Creating & Monitoring Big Data Pipelines with Apache Airflow

Helping customer to build, schedule & monitor their Big data pipelines to stich end-to-end Big data operations

Duration

3 Days

Level

Intermediate Level

Design and Tailor this course

As per your team needs

Edit Content

Big data systems are becoming more and more complex each day. Even simpler big data system involves various stages such as ingestion, transformations, analytics etc. and also involves various stakeholders such as big data engineers, data scientists and data analytics. So it becomes necessary to stitch together all big data tasks into a pipeline and monitor them to make them a more scalable, less error prone and autonomous system.

The Creating & Monitoring Big Data Pipelines with Apache Airflow training course is designed to teach data engineers what they need to know to create, schedule and monitor data pipelines using the de facto platform known as Apache Airflow by programmatically authoring, scheduling and creating workflows. The course begins with the core functionalities of Apache Airflow and then moves on to building data pipelines. The course then moves into more advanced topics around Apache Airflow such as start_date and schedule_time, dealing with time zones, alerting on failures and much more. The course concludes with a look at how to handle monitoring and security with Apache Airflow, as well as managing and deploying workflows in the cloud.

Purpose:

Promote an in-depth understanding of how to use Apache Airflow to create, schedule and monitor data pipelines.

Productivity Objectives:

Upon completion of this course, you should be able to: