Open-Source Data Catalog Amundsen with Mark Grover @ Stemma


Episode Artwork
1.0x
0% played 00:00 00:00
Jan 11 2022 41 mins   3

In this episode of Building The Backend we hear from Mark Grover founder @ Stemma, co-creator of Amundsen. Stemma is a fully managed data catalog, powered by the leading open-source data catalog, Amundsen.

Below are top 3 value bombs:

  • Automated data catalogs are critical to help wrangle the growing data across organizations. (i.e. Being able to identify out of 150 columns on this table only 10 are being used downstream)
  • Tribal knowledge and context cannot be automated - data catalogs cannot be 100% automated.
  • Amundsen is an open-source data catalog originally created at Lyft. Stemma has created a managed version of Amundsen.

Help me improve the podcast by completing this 60 second survey: https://buildingthebackend.com/survey