www.vldb.org/pvldb/v...
Whereas VMCache improve pointer swizzing's complexity by removing the swizzling, this work points out that page and frame hints are highly effective, and okay if they're wrong.
www.vldb.org/pvldb/v...
Whereas VMCache improve pointer swizzing's complexity by removing the swizzling, this work points out that page and frame hints are highly effective, and okay if they're wrong.
Pushdown
www.cs.princeton.edu...
Tightly integrating an Iceberg catalog with an object store means that one could make file-format aware erasure coding decisions, to permit pushing down filters and aggregations.
Pushdown
www.cs.princeton.edu...
Tightly integrating an Iceberg catalog with an object store means that one could make file-format aware erasure coding decisions, to permit pushing down filters and aggregations.
www.vldb.org/pvldb/v...
If you've ever wished that there was a document database equivalent for relational databases' 3NF-style schema design guidance, then this is the paper for you.
www.vldb.org/pvldb/v...
If you've ever wished that there was a document database equivalent for relational databases' 3NF-style schema design guidance, then this is the paper for you.
Extending SQL to Return a Subdatabase dl.acm.org/doi/pdf/10.1... also seems related?
Extending SQL to Return a Subdatabase dl.acm.org/doi/pdf/10.1... also seems related?
arxiv.org/abs/2508.2...
It's impossible to retrieve all combinations of pairs of documents post-embedding. Thus, there's usecases that vector search won't do well at. Conversely, BM25 excels in these cases.
arxiv.org/abs/2508.2...
It's impossible to retrieve all combinations of pairs of documents post-embedding. Thus, there's usecases that vector search won't do well at. Conversely, BM25 excels in these cases.
www.vldb.org/pvldb/v...
It feels like a follow-on/improvement to ACORN. Also interesting to see HNSW built directly on a graph database working well.
www.vldb.org/pvldb/v...
It feels like a follow-on/improvement to ACORN. Also interesting to see HNSW built directly on a graph database working well.
> We compared the end-to-end performance of GaussDB with System-X and CockRoachDB on TPC-C with 10,000 warehouses.
> We compared the end-to-end performance of GaussDB with System-X and CockRoachDB on TPC-C with 10,000 warehouses.
arxiv.org/abs/2403.0...
TreeTracker gives a very simple breakdown of what the core differences are between a naive binary join and an optimal multi-way join.
arxiv.org/abs/2403.0...
TreeTracker gives a very simple breakdown of what the core differences are between a naive binary join and an optimal multi-way join.
openproceedings.org/...
Huawei Taurus folk plugged Orca into MySQL to get a better query optimizer for OLAP queries in MySQL.
openproceedings.org/...
Huawei Taurus folk plugged Orca into MySQL to get a better query optimizer for OLAP queries in MySQL.
arxiv.org/abs/2502.0...
"if it doesn't work on the cloud does it even matter?" but applied to kernel bypass networking.
arxiv.org/abs/2502.0...
"if it doesn't work on the cloud does it even matter?" but applied to kernel bypass networking.
You’re also welcome to join eatonphil.com/discord.html
You’re also welcome to join eatonphil.com/discord.html
arxiv.org/abs/2209.0...
Which was an interesting skim, partly just because I've never seen trits (bit in base 3) applied to solve a problem effectively before.
arxiv.org/abs/2209.0...
Which was an interesting skim, partly just because I've never seen trits (bit in base 3) applied to solve a problem effectively before.
arxiv.org/abs/2505.0...
I've seen a lot of morton/hilbert curves before, but learning a space filling curve and adapting it to the data distribution is new to me.
arxiv.org/abs/2505.0...
I've seen a lot of morton/hilbert curves before, but learning a space filling curve and adapting it to the data distribution is new to me.
Databases via Dynamic Partitioning
arxiv.org/abs/2505.0...
Which fuses RBAC into the vector partitioning scheme, so that it's not just a post-filter on results.
Databases via Dynamic Partitioning
arxiv.org/abs/2505.0...
Which fuses RBAC into the vector partitioning scheme, so that it's not just a post-filter on results.