Data Engineering with DuckDB & MotherDuck

Name: Data Engineering with DuckDB & MotherDuck
Rating: 4.6 (40 reviews)

Learn how to build data workflows that run on your laptop, in the cloud, or across both using DuckDB and MotherDuck.

Created byAndreas Kretz

Last updated 10/2025

English

What you'll learn

Set up DuckDB locally and use it for fast analytical queries
Connect to MotherDuck and query cloud data with the same engine
Compare local vs. cloud execution using EXPLAIN and EXPLAIN ANALYZE
Build a Python-based ELT workflow with DuckDB
Export data to CSV and Parquet, and re-query directly from files
Combine local and cloud data in a single “dual execution” query
Visualize real-world data in Python (NYC elevator complaints project)
Use DuckLake to manage Parquet data with schema evolution and transactions

Course content

7 sections • 19 lectures • 1h 23m total length

Welcome & Course Overview1:28
Get an overview of what you’ll learn in this course and how DuckDB and MotherDuck fit into modern data engineering workflows.
What Are DuckDB and MotherDuck?2:23
Understand what makes DuckDB unique as a lightweight analytical database and how MotherDuck extends it to the cloud for collaboration and scale.
Setup & Installation5:28
Step-by-step instructions for installing DuckDB locally, setting up the DuckDB UI, and connecting your environment to MotherDuck.

Attaching to MotherDuck5:13
Connect your local DuckDB setup to the cloud with MotherDuck, configure authentication, and attach shared databases.
Cloud Query Execution6:15
Run your first queries in MotherDuck and see how data processing in the cloud compares to running locally.
Comparing Local vs Cloud Compute6:33
Use EXPLAIN and EXPLAIN ANALYZE to compare execution plans and understand how DuckDB and MotherDuck split computation.
Using MotherDuck via UI3:48
Discover how to sign in to MotherDuck directly from the DuckDB UI and work with local and cloud tables side by side.

Setting Up DuckDB in Python3:19
Install and configure DuckDB for Python and verify everything works by running your first simple query.
Reading and Transforming Data6:23
Build a small ELT pipeline in Python: load CSV data, clean column names, normalize values, and add calculated fields.
Writing Results Locally3:12
Export your clean data into CSV and Parquet files, and query them directly with DuckDB to confirm your results.

Connecting Python to MotherDuck2:58
Securely connect your Python scripts to MotherDuck using tokens and environment variables.
Reusing ELT with Remote Compute4:12
Run the exact same ELT workflow you built locally, now in the cloud, using MotherDuck compute.
Hybrid Workflows (Local + Cloud)6:47
Combine local analysis and cloud data in one workflow to detect hotspots, visualize data, and run hybrid queries.

Requirements

Basic SQL and Python knowledge
No prior DuckDB or cloud setup required

Description

In this hands-on course, you’ll start by exploring DuckDB locally: querying CSV and Parquet files, building persistent databases, and analyzing data right from your terminal or the built-in DuckDB UI. You’ll then connect to MotherDuck, the cloud platform built around DuckDB, and learn how to scale analytics, share data, and collaborate without switching tools.

You’ll build hands-on ELT workflows using the DuckDB CLI, Python, and MotherDuck. From analyzing local CSV files to running cloud-scale data pipelines. You’ll see how hybrid execution works, compare local versus cloud compute, and learn to move effortlessly between environments while maintaining a single, simple toolset.

Along the way, you’ll work on a real-world project analyzing NYC 311 elevator service requests, combining local and cloud datasets to generate insights, visualize hotspots, and identify business opportunities.

Finally, you’ll explore DuckLake, DuckDB’s new integrated lakehouse format, which adds schema evolution, snapshots, and transactions to your Parquet data. You’ll understand where Duck Lake fits into the modern data stack and how it connects to cloud storage like S3.

By the end of this course, you’ll have a complete setup, reusable SQL and Python scripts, and the confidence to use DuckDB and MotherDuck together for modern, scalable data engineering workflows.

Who this course is for:

Data Engineers, Data Scientists, and Analysts exploring modern, lightweight data stacks
Anyone who wants to understand how DuckDB and MotherDuck fit into real data platforms

Data Engineering with DuckDB & MotherDuck

What you'll learn

Explore related topics

Course content

Introduction to DuckDB & MotherDuck3 lectures • 9min

Working with DuckDB Locally (CLI + UI)3 lectures • 15min

Connecting to and Working with MotherDuck4 lectures • 22min

ELT Workflow with Python and DuckDB3 lectures • 13min

Scaling the Workflow with MotherDuck3 lectures • 14min

Bonus - Duck Lake Table Format2 lectures • 9min

Wrap-up1 lecture • 2min

Requirements

Description

Who this course is for: