书籍 Data Pipelines with Apache Airflow的封面

Data Pipelines with Apache Airflow

Bas Harenslak, Julian de Ruiter

出版时间

2020-06-04

ISBN

9781617296901

评分

★★★★★
书籍介绍
Data Pipelines with Apache Airflow is your essential guide to working with the powerful Apache Airflow pipeline manager. Expert data engineers Bas Harenslak and Julian de Ruiter take you through best practices for creating pipelines for multiple tasks, including data lakes, cloud deployments, and data science. Part desktop reference, part hands-on tutorial, this book teaches you the ins-and-outs of the Directed Acyclic Graphs (DAGs) that power Airflow, and how to write your own DAGs to meet the needs of your projects. You’ll learn how to automate moving and transforming data, managing pipelines by backfilling historical tasks, developing custom components for your specific systems, and setting up Airflow in production environments. With complete coverage of both foundational and lesser-known features, when you’re done you’ll be set to start using Airflow for seamless data pipeline development and management. what's inside Framework foundation and best practices Airflow's execution and dependency system Testing Airflow DAGs Running Airflow in production
用户评论
书中涉及到的技术决策讨论和实际经验蛮相符
写的还是挺好的,由浅入深,学习Airflow必读。