Contribute Media
A thank you to everyone who makes this possible: Read More

Duckdb for Data Scientists: Handling Large Data Files in Jupyter Notebook

Description

Duckdb is a free, open-source analytical database management system (DBMS) that supports SQL. It is fast and memory-efficient and has support of various data formats including CSV, parquet and JSON, and databases including MySQL and Postgres.

Adding Duckdb to your data science toolbox can help you to speed up your data analysis with the comfort of the old reliable SQL within your Jupyter Notebook.

In this talk, I will cover:

  • What is Duckdb and why you should use it
  • How to incorporate Duckdb into your data science workflow
  • Getting started with Duckdb for data science

Details

Improve this page