
Get an overview of what you’ll learn in this course and how DuckDB and MotherDuck fit into modern data engineering workflows.
Understand what makes DuckDB unique as a lightweight analytical database and how MotherDuck extends it to the cloud for collaboration and scale.
Step-by-step instructions for installing DuckDB locally, setting up the DuckDB UI, and connecting your environment to MotherDuck.
Learn how to query CSV files directly with DuckDB, no separate server or data warehouse needed.
Explore the built-in graphical interface for DuckDB, preview data, run queries, and visualize basic trends.
Create persistent DuckDB databases from your CSV files and reuse them for fast analytical work.
Connect your local DuckDB setup to the cloud with MotherDuck, configure authentication, and attach shared databases.
Run your first queries in MotherDuck and see how data processing in the cloud compares to running locally.
Use EXPLAIN and EXPLAIN ANALYZE to compare execution plans and understand how DuckDB and MotherDuck split computation.
Discover how to sign in to MotherDuck directly from the DuckDB UI and work with local and cloud tables side by side.
Install and configure DuckDB for Python and verify everything works by running your first simple query.
Build a small ELT pipeline in Python: load CSV data, clean column names, normalize values, and add calculated fields.
Export your clean data into CSV and Parquet files, and query them directly with DuckDB to confirm your results.
Securely connect your Python scripts to MotherDuck using tokens and environment variables.
Run the exact same ELT workflow you built locally, now in the cloud, using MotherDuck compute.
Combine local analysis and cloud data in one workflow to detect hotspots, visualize data, and run hybrid queries.
Learn about Duck Lake, the new lakehouse layer for DuckDB and MotherDuck, and how it enables schema evolution and transactions on Parquet.
Set up a Duck Lake database backed by S3, create tables, insert and query data, and explore snapshots and metadata.
Review everything you’ve learned, from local analytics to hybrid workflows and see how to apply these tools in your own projects.
In this hands-on course, you’ll start by exploring DuckDB locally: querying CSV and Parquet files, building persistent databases, and analyzing data right from your terminal or the built-in DuckDB UI. You’ll then connect to MotherDuck, the cloud platform built around DuckDB, and learn how to scale analytics, share data, and collaborate without switching tools.
You’ll build hands-on ELT workflows using the DuckDB CLI, Python, and MotherDuck. From analyzing local CSV files to running cloud-scale data pipelines. You’ll see how hybrid execution works, compare local versus cloud compute, and learn to move effortlessly between environments while maintaining a single, simple toolset.
Along the way, you’ll work on a real-world project analyzing NYC 311 elevator service requests, combining local and cloud datasets to generate insights, visualize hotspots, and identify business opportunities.
Finally, you’ll explore DuckLake, DuckDB’s new integrated lakehouse format, which adds schema evolution, snapshots, and transactions to your Parquet data. You’ll understand where Duck Lake fits into the modern data stack and how it connects to cloud storage like S3.
By the end of this course, you’ll have a complete setup, reusable SQL and Python scripts, and the confidence to use DuckDB and MotherDuck together for modern, scalable data engineering workflows.