Wes McKinney
wesmckinney.com
Wes McKinney
@wesmckinney.com
Principal Architect @posit.co, GP Composed Ventures, Co-founder Voltron Data. Open source: Apache Arrow, pandas, Ibis. "Python for Data Analysis" book
roborev does GitHub now! Just shipped the initial GitHub PR experience, multi-agent / multi-prompt reviews that synthesize to a single coherent review response:

www.roborev.io/integrations...
GitHub Integration
Automatically review GitHub PRs and post results as bot comments
www.roborev.io
February 9, 2026 at 8:22 PM
Reposted by Wes McKinney
Had a great time on @posit.co 's Test Set pod w/ @mchow.com @hadley.nz @wesmckinney.com!

We talk about moving between R, SQL, python and the strengths of different analytical tools for diff data tasks. You won't believe what proprietary language gets a shout-out (Stata!)

posit.co/thetestset/e...
Episode 14 – Emily Riederer: Column selectors, data quality, and learning in public - Posit
posit.co
January 29, 2026 at 1:59 PM
roborev can now analyze and systematically refactor your code for you — this is essential to managing the poor code quality of agents in rapidly expanding agentic code bases. Game changer for me

www.roborev.io/guides/assis...
Code Analysis and Assisted Refactoring
Run built-in code analysis and automatically apply fixes with roborev analyze and roborev fix
www.roborev.io
January 29, 2026 at 6:33 PM
roborev now supports Windows (x64 and ARM)! Lots of new features and quality-of-life features, too (such as `y` to copy-paste/yank review to clipboard to paste into your agent session)

www.roborev.io/installation/
Installation
Install roborev on your system
www.roborev.io
January 23, 2026 at 5:29 PM
A funny thing is happening: the more I build with agents, the less I want to use Python. I explore this in my latest "From Human Ergonomics to Agent Ergonomics"

wesmckinney.com/blog/agent-e...
From Human Ergonomics to Agent Ergonomics – Wes McKinney
wesmckinney.com
January 20, 2026 at 3:04 PM
Reposted by Wes McKinney
Join Wes McKinney (@wesmckinney.com) and the Pixeltable @pixeltable.net team, Marcel Kornacker and Alison Hill (@apreshill.com), for a fireside chat hosted by Hugo Bowne-Anderson on Dec 16!

They will discuss data processing and #AI workflows for multimodal data 📊

Register: luma.com/2y04b6nf
Building Multimodal AI Workflows with Pixeltable · Luma
The challenge with multimodal AI isn't calling models. It's everything else. Videos need to become frames. Audio needs transcription. Embeddings need to stay…
luma.com
December 12, 2025 at 4:20 PM
Reposted by Wes McKinney
Super interesting @wesmckinney.com insight: AI may stagnate Open Source - because users will be much more inclined to adopt software that AI tools can help them with, rather than newer tools that it isn't trained on yet.

Kind of a "snake eating tail" problem as far as generating new training data.
December 6, 2025 at 8:53 PM
Reposted by Wes McKinney
“Parquet is great… until GPUs, multimodal data & million-column schemas show up.”

The creator of pandas/Arrow @wesmckinney.com digs into Arrow vs Parquet, new columnar + table formats, DataFusion/DuckDB, metadata headaches, and what AI coding agents mean for open source infra.

Episode link below
December 2, 2025 at 5:30 PM
Reposted by Wes McKinney
🚀 Launching Supermetal — data replication that just works.

Sync databases to warehouses in real-time or batch — no Kafka, no JVM, no Debezium. Built in Rust & Apache Arrow.

Try it → trial.supermetal.io
Launch post → supermetal.io/blog/launch
Supermetal - High Performance Data Replication Platform
Move terabytes of data with minimal resources using our Rust-based CDC platform. Single binary, zero dependencies.
supermetal.io
November 5, 2025 at 6:50 PM
Reposted by Wes McKinney
The future of data connectivity is columnar. Today we launched
@columnar.tech to accelerate the shift from slow, row-oriented APIs like ODBC and JDBC to >10x faster alternatives powered by @arrow.apache.org. Learn more 👇
Announcing Columnar
Back to the future of data connectivity
columnar.tech
October 29, 2025 at 10:51 PM
Reposted by Wes McKinney
It was such a pleasure to join @hadley.nz, @wesmckinney.com, and @mchow.com on THE TEST SET! You can check out the two parts of our conversation here:

🍕 posit.co/thetestset/e...
🤖 posit.co/thetestset/e...
October 27, 2025 at 2:48 PM
Reposted by Wes McKinney
Our SIGMOD paper with our friends at Tsinghua + @wesmckinney.com + @pateljm.bsky.social on creating a next generation open-source data file format is out. F3 is a future-proof file format avoids the mistakes of Parquet.
📄 Paper: db.cs.cmu.edu/papers/2025/...
📁 Code: github.com/future-file-...
October 1, 2025 at 1:49 PM
Reposted by Wes McKinney
In September the @columnar.tech crew are headed to PyData Paris 2025 and the first ever Apache Arrow Summit. The organizer @quantstack.bsky.social is a dedicated supporter of @arrow.apache.org. We’re delighted to be sponsoring the event.
Welcome to our new sponsor Columnar !
August 18, 2025 at 2:30 PM
Reposted by Wes McKinney
Data science junkies, get ready! 🚀 "The Test Set" #podcast trailer is here for your viewing pleasure.

Tune in July 1st and every Tuesday after for new episodes with hosts @mchow.com, @hadley.nz, and @wesmckinney.com as they welcome thought leaders in #DataScience.

Subscribe now: pos.it/thetestset
June 18, 2025 at 4:58 PM
Reposted by Wes McKinney
🚀 Introducing **Bauplan**

A serverless, code-native platform for building data and AI pipelines — directly on your object store. No clusters. No notebooks. No GUI based workflows.

Just Python + SQL + S3.

👉 www.bauplanlabs.com/blog/hello-b...
April 16, 2025 at 2:14 PM
Reposted by Wes McKinney
1/ We just raised $17M to build the multimodal data stack for Physical AI! 🚀

Lead: pointnine.com
With: costanoa.vc, Sunflower Capital,
@seedcamp.com
Angels including: @rauchg.blue, Eric Jang, Oliver Cameron, @wesmckinney.com , Nicolas Dessaigne, Arnav Bimbhet

Thesis: rerun.io/blog/physica...
March 20, 2025 at 6:13 PM