return documents that match any token in the query string
faster than BM25 when you just need a binary match (e.g. "showing 36 of 131,072" in your search UI) as it skips score computation entirely
docs: tpuf.link/containsany
return documents that match any token in the query string
faster than BM25 when you just need a binary match (e.g. "showing 36 of 131,072" in your search UI) as it skips score computation entirely
docs: tpuf.link/containsany
fix: precompute it in a flat array
trade memory for cache-friendly access → 8% faster puffin'
fix: precompute it in a flat array
trade memory for cache-friendly access → 8% faster puffin'
100B+ vectors @ 50ms p50 / 200ms p99 latency
blog: tpuf.link/ann-v3
100B+ vectors @ 50ms p50 / 200ms p99 latency
blog: tpuf.link/ann-v3
tpuf now dynamically adjusts the encoding of postings lists based on density, like roaring bitmaps
result → up to 26% faster search for queries whose terms match many documents
tpuf now dynamically adjusts the encoding of postings lists based on density, like roaring bitmaps
result → up to 26% faster search for queries whose terms match many documents
• tighter compression
• less KV overhead
• better MAXSCORE interaction
up to 10x smaller indexes → up to 20x faster text search!
tpuf.link/fts-index
• tighter compression
• less KV overhead
• better MAXSCORE interaction
up to 10x smaller indexes → up to 20x faster text search!
tpuf.link/fts-index
takeaways:
- sometimes longer queries are *faster*
- the longer the query, the less efficiently it scales
- fitted curves on raw data are immensely gratifying
tpuf.link/bm25-latency
takeaways:
- sometimes longer queries are *faster*
- the longer the query, the less efficiently it scales
- fitted curves on raw data are immensely gratifying
tpuf.link/bm25-latency
~10x lower index queue time → new documents get indexed sooner → faster queries on new data with less WAL scanning
built entirely on object storage, no kafka
(chart: max create_index time in queue, gcp us-east4)
~10x lower index queue time → new documents get indexed sooner → faster queries on new data with less WAL scanning
built entirely on object storage, no kafka
(chart: max create_index time in queue, gcp us-east4)
"This doesn't exist without turbopuffer. If we used a traditional search provider, our bill would be in the millions each year." - Justin Watts, Distinguished Engineer
tpuf.link/telus
"This doesn't exist without turbopuffer. If we used a traditional search provider, our bill would be in the millions each year." - Justin Watts, Distinguished Engineer
tpuf.link/telus
use the ContainsTokenSequence in your tpuf FTS queries to match documents on an exact phrase
docs: tpuf.link/phrase-match
use the ContainsTokenSequence in your tpuf FTS queries to match documents on an exact phrase
docs: tpuf.link/phrase-match
combine it with group_by and filters for olap-style puffin'
combine it with group_by and filters for olap-style puffin'
today: long LLM queries, wide SIMD lanes
for FTS v2, we use a vectorized MAXSCORE algorithm instead of WAND, because dumb & serial beat smart & random algorithms on modern CPUs
tpuf.link/maxscore
today: long LLM queries, wide SIMD lanes
for FTS v2, we use a vectorized MAXSCORE algorithm instead of WAND, because dumb & serial beat smart & random algorithms on modern CPUs
tpuf.link/maxscore
"We went from 3 minutes to <10 seconds, with much better recall, which means fewer agent loops." - Maria Vinokurskaya, Founding Engineer
tpuf.link/playerzero
"We went from 3 minutes to <10 seconds, with much better recall, which means fewer agent loops." - Maria Vinokurskaya, Founding Engineer
tpuf.link/playerzero
tpuf's WAL scan is now up to 2x faster
tpuf's WAL scan is now up to 2x faster
turbopuffer is now on par with Tantivy and Lucene for many queries, more to come
v2 now in beta. 2 PRs away from all query plans being implemented. will be enabled in prod for all, shortly.
turbopuffer is now on par with Tantivy and Lucene for many queries, more to come
v2 now in beta. 2 PRs away from all query plans being implemented. will be enabled in prod for all, shortly.
copy your documents wherever you need to puff with a 50% discount - no more restrictions
good for manual backups, testing, and branching
copy your documents wherever you need to puff with a 50% discount - no more restrictions
good for manual backups, testing, and branching
as part of our recent patch_by_filter launch, we clarified the ACID properties users can expect from turbopuffer
read more here: turbopuffer.com/docs/guaran...
as part of our recent patch_by_filter launch, we clarified the ACID properties users can expect from turbopuffer
read more here: turbopuffer.com/docs/guaran...
simplicity scales
[v3 in beta, unfiltered search, 1024D, k=10, 92% recall]
simplicity scales
[v3 in beta, unfiltered search, 1024D, k=10, 92% recall]