Roland Faure
rfaure.bsky.social
Roland Faure
@rfaure.bsky.social
Sequence bioinfomatician, algorithms, methods.
Postdoc in Institut Pasteur in Rayan Chikhi's lab
Our preprint on our new metagenomic HiFi assembler Alice is out 🥳 Based on a *new sketching method* (🧵1/6)
👉 Preprint www.biorxiv.org/content/10.1...
👉 Github github.com/rolandfaure/...
Alice: fast and haplotype-aware assembly of high-fidelity reads based on MSR sketching
We introduce Mapping-friendly Sequence Reduction (MSR) sketches, a sketching method for high-fidelity (HiFi) long reads, and Alice, an assembler that operates directly on these sketches. MSR produces ...
www.biorxiv.org
October 3, 2025 at 2:51 PM
Reposted by Roland Faure
🌎👩‍🔬 For 15+ years biology has accumulated petabytes (million gigabytes) of🧬DNA sequencing data🧬 from the far reaches of our planet.🦠🍄🌵

Logan now democratizes efficient access to the world’s most comprehensive genetics dataset. Free and open.

doi.org/10.1101/2024...
September 3, 2025 at 8:39 AM
Reposted by Roland Faure
Preprint out for myloasm, our new nanopore / HiFi metagenome assembler!

Nanopore's getting accurate, but

1. Can this lead to better metagenome assemblies?
2. How, algorithmically, to leverage them?

with co-author Max Marin @mgmarin.bsky.social, supervised by Heng Li @lh3lh3.bsky.social

1 / N
High-resolution metagenome assembly for modern long reads with myloasm https://www.biorxiv.org/content/10.1101/2025.09.05.674543v1
September 7, 2025 at 11:35 PM
Reposted by Roland Faure
I am happy to share our new preprint introducing MADRe - a pipeline for Metagenomic Assembly-Driven Database Reduction, enabling accurate and computationally efficient strain-level metagenomic classification.

🔗https://www.biorxiv.org/content/10.1101/2025.05.12.653324v1
1/9
May 16, 2025 at 8:37 AM
Reposted by Roland Faure
Starting #RECOMBseq with @rayanchikhi.bsky.social 's keynote. Here stressing our responsibility as scientists to enable access to a common good: genomic data
April 24, 2025 at 12:42 AM
Reposted by Roland Faure
Side note: you could, speaking purely theoretically, also fit every microbe onto an SD card, which is within the weight limit for a carrier pigeon. For some distances, it would be faster than the internet for transmitting sequence libraries
7/
April 9, 2025 at 9:10 PM
Reposted by Roland Faure
So glad this is finally out. The method has been instrumental in allowing us to compress the AllTheBacteria data - ~2 million bacterial genomes shrink from 3Terabytes (gzipped) to 100Gb using phylogenetic compression. Great work by @brinda.eu
April 9, 2025 at 10:27 PM
Reposted by Roland Faure
Do you (like me) create a bunch of conda environments, then later forget what they're for, when they were last updated, or which tools are in them?

If so, you might this little project: github.com/rrwick/conda...
GitHub - rrwick/condaenvlist: a simple tool for listing conda environments with descriptions
a simple tool for listing conda environments with descriptions - rrwick/condaenvlist
github.com
March 27, 2025 at 4:34 AM
So glad to have participated in #DSB2025, what a great workshop! For some mysterious reason it was the first time I attended after 3 years of sequence research. Thanks to all participants & organizers 😃
March 7, 2025 at 7:41 PM
Reposted by Roland Faure
Ragnar's made some incredible optimizations on the computation of minimizers, can't wait to see how these improvements will benefit bioinfo tools!
Nice result to end the day (night*):
After discussions with @imartayan.bsky.social, the SIMD minimizer code now also does proper canonical (revcomp) minimizers:

~1ns/bp for fwd minis
+0.4ns/bp with collect and dedup
+0.6ns/bp with canonical hashes.

Super happy how it's only 2x slower in the end!
December 13, 2024 at 3:50 PM
Reposted by Roland Faure
Amazing ideas here www.biorxiv.org/content/bior... from
@yoann.bsky.social
and collaborators.

Reorganize minimizers to allow kmers dichotomic search. That's brilliant.

#bioinformatics 🧬🖥️
December 4, 2024 at 12:10 PM
So glad to have successfully defended my Ph.D. last week 😀 Work on producing haplotype-resolved metagenomic assemblies using noisy long reads (HairSplitter) and high-fidelity long reads (Alice assembler, unpublished yet).
Thanks to my advisors Dominique Lavenier and Jean-François Flot ❤️
December 4, 2024 at 8:32 AM