About DuckDB
The in-process columnar database for analytical SQL anywhere your code runs.
DuckDB started in 2018 at the Database Architectures group at CWI in Amsterdam, the same research lab where MonetDB came from, with Mark Raasveldt and Hannes Mühleisen as the original authors. The first public release shipped in 2019 and version 1.0 (codename Snow Duck) landed on 3 June 2024 with a stable on-disk format that future versions read back. The MIT-licensed code is held by the DuckDB Foundation and built by DuckDB Labs in Amsterdam; MotherDuck ships the managed cloud version and is one of the gold sponsors backing the project.
The engine is in-process and columnar. There is no server to run, no port to open and no replication to manage; DuckDB is a library that lives inside Python, R, Node, Java, Rust, the CLI, your data app or a serverless function. Inside that process it stores tables column by column, runs a vectorized executor on batches of rows and reads Parquet, CSV and JSON files directly from the local disk, an HTTPS URL or an S3 bucket with predicate and projection pushdown. The same binary scans a 200 GB Parquet dataset on a laptop and powers an analytical endpoint in a Lambda, and the same SQL works against a local file or a MotherDuck warehouse without rewriting the query.