SciPy 2024

Mehdi Ouazza

Mehdi (aka mehdio) is a data enthusiast with nearly a decade of experience in data engineering for companies of all sizes. He's not your average data guy, injecting humor and fun into his work to make complex topics easier to digest. When he's not actively contributing to the data community through his blog, YouTube, and social media, you can find him off-beat, marching to the beat of his own data drum. In 2023, Mehdi joined Motherduck as a developer advocate, bringing his data eng expertise to supercharge DuckDB.

The speaker's profile picture

Sessions

07-09
08:00
240min
All the SQL a Pythonista needs to know: an introduction to SQL and DataFrames with DuckDB
Guen Prawiroatmodjo, Alex Monahan, Mehdi Ouazza, Elena Felder

Structured Query Language (or SQL for short) is a programming language to manage data in a database system and an essential part of any data engineer’s tool kit. In this tutorial, you will learn how to use SQL to create databases, tables, insert data into them and extract, filter, join data or make calculations using queries. We will use DuckDB, a new open source embedded in-process database system that combines cutting edge database research with dataframe-inspired ease of use. DuckDB is only a pip install away (with zero dependencies), and runs right on your laptop. You will learn how to use DuckDB with your existing Python tools like Pandas, Polars, and Ibis to simplify and speed up your pipelines. Lastly, you will learn how to use SQL to create fast, interactive data visualizations, and how to teach your data how to fly and share it via the Cloud.

Tutorials
Ballroom A