Lightnews — Scholar-powered news

Alex Miller

@alexmillerdb.bsky.social

Does anyone know of a good webapp or discord bot or something to help manage a reading group? Something that keeps a list of suggesting things to read, can do voting on the next thing to read, and maybe has a bit of curation support for when the to-read list gets unmanageable?

February 1, 2026 at 2:59 AM

Reposted by Alex Miller

South Bay Systems

@southbaysystems.xyz

Our next event will be on January 21st, featuring speakers from (the just-finishing) CIDR! Come to Databricks to hear about:
* DuckDB on xNVMe by @pinartozun.bsky.social of ITU
* Spilling in QP by Maximilian Kuschewski of TUM
* NPUs in DBs by Alexander Baumstark of TU-Ilmenau
luma.com/8a54z94d

South Bay Systems: Innovative Data Systems Research · Luma

Welcome to another edition of South Bay Systems! This time we bring you three wonderful talks from authors at the just-finishing Conference in Innovative Data…

luma.com

January 5, 2026 at 7:46 PM

Alex Miller

@alexmillerdb.bsky.social

I’ve recently seen multiple, unrelated instances of people referencing Bf-trees. Good job, @benjdd.com.

December 30, 2025 at 8:09 PM

Alex Miller

@alexmillerdb.bsky.social

Asking a coding agent to run `cargo build` and read referenced source files for context has made LLMs significantly more helpful and accurate at actually understanding why a compilation error is happening and being able to explain an appropriate fix. Much better than copy-pasting into online LLMs.

December 28, 2025 at 8:10 PM

Alex Miller

@alexmillerdb.bsky.social

It’s frustrating how a bunch of database research from 1980s and before basically doesn’t exist anymore because it’s not on the internet, and it’s not even in the IEEE/ACM indexes of published work.

November 24, 2025 at 8:13 PM

Alex Miller

@alexmillerdb.bsky.social

Does anyone have links to good writing on the sort of soft skills you learn from working in larger organizations about how to work in larger organizations as an IC? The overall space of soft skills dealing with the pretty common ways that large corporations behave.

November 21, 2025 at 6:55 PM

Alex Miller

@alexmillerdb.bsky.social

Looking forward to reading about which disaggregated architecture HorizonDB aligned itself with

November 19, 2025 at 7:02 PM

Alex Miller

@alexmillerdb.bsky.social

So what’s the feature set difference between pgDog, Neki, and multigres?

November 15, 2025 at 11:56 PM

Reposted by Alex Miller

South Bay Systems

@southbaysystems.xyz

Our next event is on November 19th at StarTree’s office in downtown Mountain View. Come hear about Morel from Julian Hyde and and Query Optimization as a Service from Yuanyuan Tian!
luma.com/xygolo9c

South Bay Systems: Morel / Query Optimization as a Service · Luma

Welcome to another edition of South Bay Systems! This time we bring you two wonderful talks: Julian Hyde will be speaking about Morel, a new functional…

luma.com

November 7, 2025 at 1:40 AM

Reposted by Alex Miller

South Bay Systems

@southbaysystems.xyz

The recording from our last South Bay Systems meetup is now available!
youtu.be/f1bz3efUJpM

Apache Pinot on Object Storage & JSON in Apache Doris

YouTube video by South Bay Systems

youtu.be

November 1, 2025 at 5:56 PM

Reposted by Alex Miller

Tyler Hillery

@tylerhillery.com

@abigalekim.bsky.social @xiangpeng.systems and I are kicking off Madison Systems with a coffee chat on Sunday, Nov 9th. Come nerd out on systems!

luma.com/v69tvpla

Madison Systems Coffee Chat · Luma

If you’re working on or are interested in anything in the space of software internals (compilers, databases, operating systems, etc.), come grab a cup of…

luma.com

October 23, 2025 at 6:52 PM

Alex Miller

@alexmillerdb.bsky.social

[PVLDB] Enhancing Transaction Processing through Indirection Skipping
www.vldb.org/pvldb/v...

Whereas VMCache improve pointer swizzing's complexity by removing the swizzling, this work points out that page and frame hints are highly effective, and okay if they're wrong.

October 17, 2025 at 4:02 AM

Alex Miller

@alexmillerdb.bsky.social

This reminded me I've been sitting on draft blog posts about Copy-and-Patch JIT compilation for a while, and so I've finally published the first chunk of it: a minimal tutorial and explanation of how and why Copy-and-Patch actually works.

Start at transactional.blog/copy-and-pat...

October 13, 2025 at 11:26 PM

Reposted by Alex Miller

South Bay Systems

@southbaysystems.xyz

South Bay Systems returns on October 27th at Adobe in downtown San Jose. We have an Analytics-on-Object-Storage double feature this time starring two different Apache projects: Apache Pinot and Apache Doris. (Talk descriptions below.)

Register now!
luma.com/9o6bahgc

South Bay Systems: Apache Pinot on Object Storage / Variants in Apache Doris · Luma

Welcome to another edition of South Bay Systems! This time, we'll have a double feature! First we'll have Songqiao Su and Raghav Yadav talking about…

luma.com

October 13, 2025 at 6:47 PM

Reposted by Alex Miller

South Bay Systems

@southbaysystems.xyz

There was an accident with the recording where audio wasn't captured, so instead we can offer a recording from one of Jakob's practice runs on twitch: www.twitch.tv/videos/25845...

October 7, 2025 at 5:26 PM

Reposted by Alex Miller

Qian Li

@qianli.dev

Had a fun time at the South Bay Systems meetup last night. Thanks @yugabytedb.bsky.social for hosting!

@codedrift.social gave a great talk on WebAssembly: what it is (and isn't), how it connects to WASI, and promising projects. He cuts through a lot of the hype vs. reality. Recording coming soon.

October 3, 2025 at 10:38 PM

Alex Miller

@alexmillerdb.bsky.social

[ASPLOS'25] Fusion: An Analytics Object Store Optimized for Query
Pushdown
www.cs.princeton.edu...

Tightly integrating an Iceberg catalog with an object store means that one could make file-format aware erasure coding decisions, to permit pushing down filters and aggregations.

September 28, 2025 at 11:42 PM

Alex Miller

@alexmillerdb.bsky.social

[VLDB] Towards Principled, Practical Document Database Design
www.vldb.org/pvldb/v...

If you've ever wished that there was a document database equivalent for relational databases' 3NF-style schema design guidance, then this is the paper for you.

September 23, 2025 at 5:23 PM

Alex Miller

@alexmillerdb.bsky.social

[arXiv] On the Theoretical Limitations of Embedding-Based Retrieval
arxiv.org/abs/2508.2...

It's impossible to retrieve all combinations of pairs of documents post-embedding. Thus, there's usecases that vector search won't do well at. Conversely, BM25 excels in these cases.

September 21, 2025 at 3:38 AM

Alex Miller

@alexmillerdb.bsky.social

I text-to-speech papers often, and www.paper2audio.com finally did the one thing that I was hoping AI would enable: replace tables/figures/diagrams with a summary of what is being shown. It makes table/diagram-heavy papers actually comprehensible. There's iOS and Android apps, and it's free.

September 11, 2025 at 9:21 PM

Alex Miller

@alexmillerdb.bsky.social

[VLDB] NaviX: A Native Vector Index Design for Graph DBMSs With Robust Predicate-Agnostic Search Performance
www.vldb.org/pvldb/v...

It feels like a follow-on/improvement to ACORN. Also interesting to see HNSW built directly on a graph database working well.

September 5, 2025 at 5:11 AM

Alex Miller

@alexmillerdb.bsky.social

Someone should go implement a bulk loading into btree mechanism relying on man7.org/linux/man-pa... to be able to prepare a tree of data, and then just atomically drop it into the main btree file as a sub-tree, as that'd be pretty cool to read about.

August 21, 2025 at 7:44 PM

Alex Miller

@alexmillerdb.bsky.social

There’s surprisingly been no good citation for follower reads and the trade-offs therein. Super excited that this finally got published. law-theorem.com had “Coming soon!” for a few years 😭

PVLDB @pvldb.bsky.social · Aug 15

Vol:18 No:9 → The LAW theorem: Local Reads and Linearizable Asynchronous Replication
👥 Authors: Emmanouil Giortamis, Antonios Katsarakis, Vasilis Gavrielatos, Pramod Bhatotia, Aleksandar Dragojevic, Boris Grot, Vijay Nagarajan,...
📄 PDF: https://www.vldb.org/pvldb/vol18/p2831-giortamis.pdf

Thumbnail: The LAW theorem: Local Reads and Linearizable Asynchronous Replication

August 20, 2025 at 4:51 PM

Alex Miller

@alexmillerdb.bsky.social

For anyone else trying to catch up on DBSP, my recommended flow of learning is:
1. Watch the talk: www.youtube.com/watch?v=omOH... (h/t @wslim.bsky.social)
2. Read the spec/book: mihaibudiu.github.io/work/dbsp-sp... (h/t @avi.im)
3. Read the VLDB paper

List is ordered by assumed knowledge of reader

August 19, 2025 at 9:21 PM

Alex Miller

@alexmillerdb.bsky.social

In the Postgres-style MVCC vs MySQL-style MVCC debates, I'd really love to see an implementation of time-separated btrees (dl.acm.org/doi/pdf/10.1...) evaluated. It's CoW-BTree style "your path down the tree prunes out versions you don't want to see", but update-in-place and copies only on splits.

August 17, 2025 at 6:29 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news