#duckdb
This profile in ‘Significance’ on DuckDB co-founder Hannes Mühleisen is quite interesting, and has helpful insights about data quality and the changing meaning of “big data.” Also some good professional advice in here for statisticians.

academic.oup.com/jrssig/artic...
Is big data dead?
Abstract. Data, ducks and statistics – Sandra Alba gathers dispatches from Amsterdam and Auckland
academic.oup.com
November 9, 2025 at 5:47 PM
I got similar feedback for my "DuckDB in Action" book… People had been very surprised what is possible these days.
November 10, 2025 at 10:55 AM
🧭 Great blog post on DuckDB for data scientists!

codecut.ai/deep-dive-in...
A Deep Dive into DuckDB for Data Scientists
Discover how DuckDB simplifies data querying with zero configuration and outperforms pandas for large datasets.
codecut.ai
November 5, 2025 at 9:54 AM
We took Canada’s Spatial Access Measures dataset (big, clunky CSVs) → turned it into a single GeoParquet file.

Add DuckDB-WASM + deck.gl & you get
- instant queries
- smooth maps
- no backend

Public data, but actually usable.
developmentseed.org/blog/2025-10...

@saadiqmohiuddin.bsky.social
November 6, 2025 at 6:38 PM
DuckDB continues to be a miracle!
pg_lake just went open source! (Apache 2.0)

pg_lake is a set of extensions (from Crunchy Data Warehouse) that add comprehensive Iceberg support and data lake access to Postgres, with @duckdb.org transparently integrated into the query engine.

Announcement blog: www.snowflake.com/en/engineeri...
November 5, 2025 at 8:51 AM
What's new this week in AI, data science, bioinfo: LLMs for learning to code, #Rstats updates (R Data Scientist, R Weekly), RAG, layoffs & AI, DuckDB for data science, Anthropic+Iceland, AI in science, AI in drug discovery, Nextflow & pathogen surveillance doi.org/10.59350/28g...
Weekly Recap (Nov 7, 2025)
LLMs for learning to code, R updates (R Data Scientist, R Weekly), RAG, layoffs & AI, DuckDB for data science, Anthropic+Iceland, AI in science, AI in drug discovery, Nextflow & pathogen surveillance
doi.org
November 7, 2025 at 2:04 PM
🦆 Episode 4 in Season 2 of the DuckDB in Research series is out now!

🎙️ Adaptive Factorization in DuckDB (@duckdb.org) - with Paul Groß is out now!

🔗 Listen now on Spotify: open.spotify.com/episode/20kz...
Adaptive Factorization in DuckDB with Paul Groß
Spotify video
open.spotify.com
November 6, 2025 at 1:17 PM
📢 The final episode of DuckDB in Research's second season is out!

➗ This week, host Jack Waudby interviews Paul Groß, PhD student at CWI Amsterdam, to explore his work on adaptive factorization and worst-case optimal joins.

🎧 Listen to the episode at duckdb.org/science/paul...
November 6, 2025 at 12:52 PM
The fact that I can do things like this in #duckdb along with all of the other magical and ridiculously fast things I can do in DuckDB is truly just amazing.

```sql
CREATE OR REPLACE PROPERTY GRAPH network_attack_graph
VERTEX TABLES (
ip_vertices LABEL IP,
tag_vertices LABEL Tag
)
EDGE TABLES ( […]
Original post on mastodon.social
mastodon.social
November 7, 2025 at 1:34 PM
Server for DuckDB with MySQL and Postgres protocols
November 6, 2025 at 6:57 PM
Wow, what a presentation by @hannes.muehleisen.org about the history of data architecture with its changes in architecture from 1985 to 2025 with DuckDB and in general.

I took a lot of notes, some of which are illustrated below in my current Obsidian Vault.
November 4, 2025 at 8:58 AM
Developed a new #RStats package to access sub-national boundary spatial data (UN OCHA COD and geoBoundaries datasets) from the excellent fieldmaps project. Using #DuckDB and #geoarrow to efficiently query remote parquet and convert to sf #RSpatial #GIS

github.com/epicentre-ms...
GitHub - epicentre-msf/fieldmaps: R Interface to Fieldmaps Data
R Interface to Fieldmaps Data. Contribute to epicentre-msf/fieldmaps development by creating an account on GitHub.
github.com
November 4, 2025 at 2:01 PM
📯 I’m excited to share the news about my upcoming book:

📘 Spatial Data Management with DuckDB: From SQL Basics to Advanced Geospatial Analytics

🔗 GitHub Repository: github.com/giswqs/duckd...
🌐 Book Website: duckdb.gishub.org

The book will be released later this month. Stay tuned for updates!
November 3, 2025 at 1:04 PM
A list of open source data tools I love working with:

- DuckDB
- Polars
- SQLMesh
- dlthub

What would you add? 🤔

#data
November 4, 2025 at 2:01 PM
🎨 Are you creating a website or presentation that mentions DuckDB? Check out our design manual for advice on how to use our logo: duckdb.org/design/manual/
DuckDB Design Manual
DuckDB Design Manual: Explore DuckDB’s visual identity, logo usage, color system, and accessibility standards. Download official assets and learn how to maintain a consistent brand across all platform...
duckdb.org
November 3, 2025 at 10:45 PM
This is really cool. I added the example to my second brain, and it just works.

www.ssp.sh/brain/run-du...
November 2, 2025 at 1:45 PM
Probably my favorite slide from #foss4gna, given that all the map work I do is on web or mobile, it's essential that all the #gis capabilities are coming directly from the server/database, most often #postgis or #duckdb
November 5, 2025 at 6:50 PM
New database leaderboard from Yellowbrick ranks the quality of DBMS optimizer estimates and plans. They only evaluate TPC-H for now and report results for Postgres + DuckDB + MSSQL: sql-arena.com/components/p...
Repo: github.com/sql-arena/db...
LinkedIn Group: www.linkedin.com/groups/15775...
November 3, 2025 at 5:07 PM
YamlQL transforms any YAML structure (Kubernetes manifests, config files, Helm values) into relational tables (via DuckDB) so you can run SQL queries or use AI-driven natural language queries over your configs

https://ku.bz/dGSTnVnN4
November 3, 2025 at 3:11 PM
DuckDB-WASM offers browser-based querying, but users reported lacking feature parity with standard DuckDB. This can lead to unexpected performance issues & increased development complexity. Know its limits! 🤔 #WebAssembly 2/6
November 2, 2025 at 4:00 AM
Harvard Law Library has created a brilliant website for their Data.gov archive—there's a static S3-style data store, and the front-end is a DuckDB-Wasm in-browser database using HTTP range requests to retrieve the needed data. In short, there's...no website? Kind of?
Rethinking Data Discovery for Libraries and Digital Humanities | Library Innovation Lab
Woman using a Macey vertical filing cabinet (detail, 1903). Source: Wikimedia Commons.
lil.law.harvard.edu
October 31, 2025 at 5:51 PM
🚀 DuckDB Labs is hiring a DevOps Engineer and a Sales Engineer in Amsterdam 🇳🇱

🤔 Want to join one of the fastest-growing database startups?
Work with the team that disrupted analytical data management and grow with us in your new role.

👉 Apply now until November 7: duckdblabs.com#jobs%EF%BF%BC
Services for DuckDB
DuckDB Labs provides services for the DuckDB data management system directly from its creators.
https://duckdblabs.com#jobs
October 31, 2025 at 2:18 PM
Use DuckDB-WASM to query TB of data in browser https:// lil.law.harvard.edu/blog/2025/ 10/24/rethinking-data-discovery-for-libraries-and-digital-humanities/ # HackerNews # DuckDB # WASM # Data # Query # Browser # Libraries # Digital # Humanities

Interest | Match | Feed
Origin
mastodon.social
October 31, 2025 at 7:14 PM
📢 A new DuckDB in Research podcast episode is out.

📈 In this week's episode, Jack Waudby interviews Mihail Stoian (@mihailstoian.bsky.social), PhD student at the Data Systems Lab, UT Nuremberg about the Parachute approach for robust query processing.

🎧 Listen at duckdb.org/science/miha...
October 30, 2025 at 10:04 AM
The OPEN SOURCE ANALYTICS CONFERENCE is coming up!

Don’t miss your chance to see great talks on Apache Doris, DuckDB, Apache Superset, Apache Iceberg, Lakekeeper, AI, and more.

Register here: osacon.io

#opensource #developers #ai
OSA CON
The Open Source Analytics Conference (OSA CON) is the go-to conference for all things open-source analytics.
osacon.io
October 29, 2025 at 8:47 PM