Contribute Media
A thank you to everyone who makes this possible: Read More

"Building a Python Data Pipeline with Apache Flink" - Caito Scherr (PyCascades 2021)


Any symbiotic relationship among very different creatures has unique challenges, but can result in something even more powerful than the sum of its parts. Combining Python with Apache Flink, particularly for Machine Learning, has its complications, but can also produce an incredibly fast, portable, scalable, and highly flexible data pipeline.

This talk covers the structure and technical features of a Python-Flink pipeline. It also goes over getting started, and more importantly - addressing the common mistakes and hurdles of building one. This includes which features to use and how to leverage the strengths of each framework based on your specific use case. For instance, when would you use regular Python, and when would you want to use PyFlink? Are there cases where you would NOT want to use some of the abstraction or automation tools available for these frameworks?

Attendees will get out of this talk an introduction to working with Apache Flink with Python, and pragmatic tips and tricks for a smoother, faster, more enjoyable (because this should be fun!) dive into this symbiotic relationship.

This talk is geared towards those who are new to Flink but is applicable to anyone with beginner to advanced Python experience.

After three amazing in-person conferences, this time we're moving PyCascades online.

PyCascades is a regional PyCon in the Pacific Northwest, celebrating the west coast Python developer and user community. Our organizing team includes members of the Vancouver, Seattle, and Portland Python user groups.

Videos are released as CC BY-NC-SA 4.0.

Produced by Next Day Video Australia:

#pycascades #pycon #python #conference

Sat Feb 20 15:35:00 2021 at Interactive Track

Improve this page