Contribute Media
A thank you to everyone who makes this possible: Read More

Building Large Scale Data Pipelines by Apache Airflow|黃泰瑋 (Tai-Wei Huang)|PyCon APAC 2022

Description

PyCon APAC 2022|一般演講 Talks|國泰金控 Cathay Financial Holdings / 美光科技 Micron 冠名贊助

✏️ 共筆 Note:https://hackmd.io/@pycontw/SyzsUaXJs 🖐🏻 Slido:https://app.sli.do/event/pMDXNVF4SZe7pgNHJ1oC6w 🪧 投影片 Slides:https://drive.google.com/file/d/17J_4FKu1s26rfpTO6MGmq1Qh5jUrkiJh/view?usp=sharing 💬 語言 Language:中文演講/英文投影片 Chinese talk w. English slides 🎯 層級 Level:中階 Intermediate 🔎 分類 Category:資料庫 Databases

💡 摘要 Abstract 💡 本演講將說明如何透過 Airflow DAG 大規模擴展 ETL Pipeline 以及調控各項參數,達成每日更新 0.7~1T 的資料,並透過 Airflow 蒐集與定義 Data Downtime 計算出 Data SLA,也會佐以講者 3 年來辛酸血淚的開發與維運經驗,讓聽眾可以少踩一些坑,安心提早下班。

🚀 講者介紹 About Speaker - 黃泰瑋 (Tai-Wei Huang) 🚀 Tai-Wei Huang is a data engineer at E.SUN Bank. His work mostly focuses on data pipeline, data quality and every thing about data.

#python #pycontw #pyconapac2022 #apache #airflow #mlaas #dag

Follow “PyCon Taiwan” ⭐️ Official Website: https://tw.pycon.org ⭐️ Facebook: https://www.facebook.com/pycontw ⭐️ Instagram: https://www.instagram.com/pycontw ⭐️ Twitter: https://twitter.com/PyConTW ⭐️ LinkedIn: https://www.linkedin.com/company/pycontw ⭐️ Blogger: https://pycontw.blogspot.com

Details

Improve this page