Interested in randomized algorithms and space-efficient data structures
https://igor.martayan.org
@curiouscoding.nl and I have been optimizing the computation of minimizers down to the smallest detail.
The result is an order of magnitude faster than existing methods ; processing an entire human genome takes only 4s on my laptop! 🧵
RECOMB-CG: 13 February
RECOMB-RSG: 15 February
RECOMB-Privacy: 9 March
RECOMB-Seq: 12 March (abstract registration)
RECOMB-Arch: 12 March (abstract registration)
RECOMB-Genetics: 13 March
#RECOMB2026 #deadlines
RECOMB-CG: 13 February
RECOMB-RSG: 15 February
RECOMB-Privacy: 9 March
RECOMB-Seq: 12 March (abstract registration)
RECOMB-Arch: 12 March (abstract registration)
RECOMB-Genetics: 13 March
#RECOMB2026 #deadlines
arxiv.org/abs/2602.03525
TLDR:
ZOR filters are STATIC filters with false positives.
-Almost memory optimal: <1% overhead over the theoretical lower bound (!!!)
-Fast queries: ~100 ns
-Construction cannot fail
A thread:
arxiv.org/abs/2602.03525
TLDR:
ZOR filters are STATIC filters with false positives.
-Almost memory optimal: <1% overhead over the theoretical lower bound (!!!)
-Fast queries: ~100 ns
-Construction cannot fail
A thread:
BiRank & QuadRank: single-cache-miss rank queries that are double the throughput of other Rust crates and fully saturate the memory bandwidth.
Side effect: QuadFm is smaller and 2-4x faster than the next-best FM-index.
github.com/RagnarGrootK...
raw.githubusercontent.com/RagnarGrootK...
BiRank & QuadRank: single-cache-miss rank queries that are double the throughput of other Rust crates and fully saturate the memory bandwidth.
Side effect: QuadFm is smaller and 2-4x faster than the next-best FM-index.
github.com/RagnarGrootK...
raw.githubusercontent.com/RagnarGrootK...
www.biorxiv.org/content/10.6...
www.biorxiv.org/content/10.6...
mirdita.org
mirdita.org
For anyone still using Bowtie2 for filtering or depletion of host sequences or specifics, I can recommend Deacon from @bedec.bsky.social . It is so much faster and easier than Bowtie2, and its performance is equal or better (tested with metagenomes and mitogenomes).🧬 & 🖥️
For anyone still using Bowtie2 for filtering or depletion of host sequences or specifics, I can recommend Deacon from @bedec.bsky.social . It is so much faster and easier than Bowtie2, and its performance is equal or better (tested with metagenomes and mitogenomes).🧬 & 🖥️
We introduce new ideas to revisit the notion of sampling with window guarantees, also known as minimizers.
A thread:
We introduce new ideas to revisit the notion of sampling with window guarantees, also known as minimizers.
A thread:
You can find it at gitlab (gitlab.com/rahmannlab/c...) or install it via PyPI or Bioconda.
You can find it at gitlab (gitlab.com/rahmannlab/c...) or install it via PyPI or Bioconda.
1/
1/
We invented a new algorithm analysis framework to find out.
We invented a new algorithm analysis framework to find out.
It's the first tool that builds on Sassy, the approximate-DNA-searching tool that @rickbitloo.bsky.social and myself developed earlier this year, specifically with this application in mind.
www.biorxiv.org/content/10.1...
Want to get started? github.com/rickbeeloo/b...
It's the first tool that builds on Sassy, the approximate-DNA-searching tool that @rickbitloo.bsky.social and myself developed earlier this year, specifically with this application in mind.
www.biorxiv.org/content/10.1...
www.biorxiv.org/content/10.1...
I want to see the generated assembly and which lines are hot.
I want to see the generated assembly and which lines are hot.
www.nature.com/articles/d41...
www.nature.com/articles/d41...
engineering.fb.com/2025/10/06/d...
engineering.fb.com/2025/10/06/d...