The many layers of Data Lineage

Written by Ronald Baan

Ronald is a data enthusiast who spends his time sharing his passion in data with others.

4 September 2022

Nice article in Medium on #datalineage by Borja Vázquez Barreiros.
Data lineage is a tricky one, though so important if you want to do more with data. Besides determining what kind of lineage you need, collecting the data needed for the data lineage there is then: how to make this lineage usable!

This article makes a case for a Google Maps approach with different layers for the different uses of data lineage. Interesting thought, especially since it does not assume 1 solution for everyone.

In the article, the context is primarily the data warehouse. In a modern data landscape with cloud and data lake, data lineage is possibly even more important, as users are now even more diverse and probably elsewhere in the organization. So, yes, priority!

And this: “If we want to remove all barriers, we need to think first and foremost about data modeling.” YES!

You may also like…

Layers of Knowledge (Graph)

Layers of Knowledge (Graph)

You can model reality intricately, you can also do it smartly and then make sure systems can easily handle it as well....

Data Lake House

Data Lake House

In case you're pretty content with your data lake (or not at all), it's time to upgrade the implementation around the...

When Data is not Data

When Data is not Data

It remains funny and sometimes this mistake is made in data science or data engineering.