InGeek CulturebyAlex GordienkoApache Airflow CheatsheetApache Airflow is a system to programmatically author, schedule, and monitor data pipelines. It has been six years since version 1.0.0 of…Oct 26, 20211181Oct 26, 20211181
InSelectFrombyPaul FryAirflow Metadata: how-to gather key runtime statistics in real-timeAnd how to gather this metadata for each DAG runDec 6, 2021541Dec 6, 2021541
InLevel Up CodingbyJuan NathanielAirflow: Decorators for a Clean Data PipelineHow to abstract away complexity in your data pipeline using Airflow decoratorsAug 26, 2021133Aug 26, 2021133
InThe Storyblocks Tech BlogbyChas DeVeasBlue-Green ETLs with Airflow Task GroupsHow Storyblocks achieved zero-downtime updates of application data in our data warehouse using Task Groups in Apache Airflow.Sep 27, 202141Sep 27, 202141
InTDS ArchivebyJuan GesinoAbstracting Data Loading with Airflow DAG FactoriesCreating an abstraction layer for improved scalability and usability for loading Google Sheets data with AirflowNov 30, 2021562Nov 30, 2021562
InAnalytics VidhyabyAjith ShettyWhat’s inside DELTA LAKEDelta Lake is the term you would have heard about or read about in 100s of blogs or you may have even used it in your project.Jul 31, 2021232Jul 31, 2021232
Adenilson CastroMeet Apache Airflow — #01Sharing some highlights of what I’ve learned on the Data Engineering journey so far. Here you’ll find some core concepts of Apache Airflow.Jun 7, 202115Jun 7, 202115
Chetan HiraparaApache Airflow scaling a DAGApache airflow is workflow management tool developed by Airbnb in 2014 for programmatically author, schedule and monitor workflows. Apache…Jun 24, 202169Jun 24, 202169
Kuan-Chih WangSetting Up Apache Airflow Celery Executor ClusterApache airflow is one of the most common tools for routine task execution such as data ETL pipeline and workflow orchestration. When…Jun 8, 202140Jun 8, 202140
Adenilson CastroApache Airflow: The ETL — #02In the previous post, we’ve learned some core concepts regarding the Apache Airflow, like the DAG and the Operators. In this post, we’ll…Jul 20, 202127Jul 20, 202127
InTDS ArchivebyZach AlexanderAutomating Emails in Apache Airflow: A How-To GuideA simple walkthrough of writing a one-step DAG to automate an e-mail with Python’s SMTP library, Gmail, and Apache AirflowJul 2, 2021921Jul 2, 2021921
InTDS ArchivebyPedro MadrugaGetting started with Task Groups in Airflow 2.0A simple pipeline with two groups of tasks, using the @taskgroup decorator of the TaskFlow API from Airflow 2.Aug 24, 20211141Aug 24, 20211141
InTDS ArchivebyL. D. Nicolas MayBreak Up a Big Airflow DAG into Multiple FilesModularize Chunks of Your Large Airflow DAG for Easy Reüse and MaintainabilityAug 11, 20212633Aug 11, 20212633
InTDS ArchivebyNicholas LeongApache Airflow — A New Way To Write DAGsETL Frameworks to Scale Your Data PipelinesJun 7, 20211233Jun 7, 20211233
InTDS ArchivebyAxel FurlanHow to build a DAG Factory on AirflowA guide to building efficient DAGs with half of the codeMar 14, 20213195Mar 14, 20213195
InTDS ArchivebyAntony HenaoData Engineers Shouldn’t Write Airflow DagsMay 29, 20219128May 29, 20219128
InMagratheabyMarcos MarxHow to develop data pipeline in Airflow through TDD (test-driven development)I’ve been reading a lot about DataOps and MLOps methodologies lately. One of the pillars of these methodologies is to improve development…Feb 3, 20212303Feb 3, 20212303