Sunday, November 27, 2022
HomeBig DataAtlan + Airflow: Higher Pipeline Monitoring and Information Lineage with Our Latest...

Atlan + Airflow: Higher Pipeline Monitoring and Information Lineage with Our Latest Integration – Atlan

One morning at 8 am, I woke as much as the Cupboard Minister of India calling me. He mentioned, “Prukalpa, the quantity on this dashboard doesn’t appear proper.”

Frantic, I opened up my laptop computer and loaded the dashboard to understand the quantity was clearly off. And but, at that second, there was nothing I might do to elucidate it. I might really feel myself shedding the credibility and hard-earned belief that had taken months to construct.

I referred to as my Challenge Supervisor, who was unbelievable at stakeholder administration however couldn’t perceive the nitty-gritties of information. She referred to as our Information Analyst, who regarded on the dashboard and mentioned, “Looks as if one thing broke down within the pipeline”. Our Analyst then referred to as our solely Information Engineer, who pulled out logs from Apache Airflow. However he couldn’t troubleshoot it as a result of he didn’t know what the variables meant and didn’t have the information context.

It took us 8 hours and 4 folks to determine what went improper. We misplaced time that day.

However extra importantly, we misplaced belief. Belief with our buyer. Belief in our staff.

Belief is usually not about issues breaking. In years of working with knowledge, I’ve discovered that knowledge will at all times be chaos. However when issues break and you discover out too late, or you may’t clarify why one thing broke, that’s what breaks belief.

Think about if, at that second when the cupboard minister referred to as me, I might rapidly open a dashboard and say, “Sure, looks as if the pipeline didn’t run on time right this moment. We’ve acquired an alert and it has already been escalated to knowledge engineering.” And even higher, think about if the dashboard had an alert on it, signaling to the minister that one thing was improper and he shouldn’t use it.

At this time we’re excited to announce that Atlan natively integrates with Apache Airflow. For knowledge groups in every single place, this implies extra transparency and belief, and fewer time spent debugging pipelines after a damaged dashboard or mismatched metrics.

Atlan + Airflow: Constructing an ecosystem of belief and transparency

With this integration, knowledge groups can construct higher knowledge engineering experiences centered round constructing information and belief of their knowledge.

First, Atlan’s integration with Airflow brings much-needed pipeline context to knowledge belongings.

Now you may share any sort of metadata from Airflow pipelines to Atlan knowledge asset profiles, the place knowledge analysts, scientists, and enterprise customers have entry to it. This opens up pipeline context and makes it absolutely clear in order that knowledge groups and shoppers can at all times know the standing of the information pipeline related to every knowledge asset.

Listed below are some nice context fields that we’ve seen folks carry from Airflow to Atlan:

  • Freshness: When was my desk final up to date?
  • Run schedule: Did the pipeline run as anticipated?
  • Pipeline standing: Was the final pipeline run profitable?
Customized Airflow metadata on an Atlan asset profile

Atlan already connects to knowledge warehouses (e.g. Snowflake, Redshift) and BI instruments (e.g. Tableau and Looker). Bringing Airflow into this ecosystem additionally signifies that knowledge groups can now map relationships throughout all of their knowledge. Whether or not you’re loading in new knowledge, revising a pipeline, or organising a dashboard, now you can assemble and visualize knowledge lineage from finish to finish.

Atlan: Tableau assets linked with source Snowflake tables
Tableau belongings linked with supply Snowflake tables

Much less time debugging, extra time constructing

Getting an pressing name about damaged knowledge is likely one of the worst experiences for an information staff. As a substitute of calling everybody who has ever touched the information, now you can diagnose the issue in seconds.

All it takes is opening an information asset profile and checking the pipeline standing and metrics. No extra hours of scrambling or damaged belief, Atlan and Airflow’s integration permits you to see your entire knowledge and its context in a single place.

Able to get began with this integration? Take a look at a demo of Atlan.

Listed below are two assets that will help you get began with bringing Airflow and Atlan collectively:



Please enter your comment!
Please enter your name here

Most Popular

Recent Comments