Tuesday, October 08, 2024

DuckDB

DuckDB with Hannes Mühleisen - Software Engineering Daily podcast

DuckDB is an open-source column-oriented relational database that was first released in 2019. It’s designed to provide high performance on complex queries against large databases, and focuses on online analytical processing workloads.

Hannes Mühleisen is the Co-Creator of DuckBD, and is the CEO and Co-Founder of DuckDB Labs. He joins the show to talk about drawing inspiration from SQLite, why DuckDB was written in C++, the novel data processing scenarios it enables, and more.

Professor of Data Engineering
Radboud University Nijmegen, Netherlands
Freie Universität Berlin
Freie Universität BerlinDoctor of Philosophy - PhD, Computer Science


DuckDB is a fast in-process analytical database
DuckDB supports a feature-rich SQL dialect complemented with deep integrations into client APIs.
DuckDB v1.0.0 was released in June 2024.

// Get the top-3 busiest train stations in May
const duckdb = require('duckdb');
const db = new duckdb.Database(':memory:');
db.all(
  `SELECT station_name, count(*) AS num_services
    FROM 'http://blobs.duckdb.org/train_services.parquet'
    WHERE monthname(date) = 'May'
    GROUP BY ALL
    ORDER BY num_services DESC
    LIMIT 3;`,
  (err, res) => {
    if (err) {
      console.log("Error", err);
    } else {
      console.table(res);
    }
  }
);