Pierre Peterlongo
@pierrepeterlongo.bsky.social
Inria Senior researcher.
Head of the https://team.inria.fr/genscale/ at Inria and Irisa.
Algorithmics for sequencing data analyses, genomics and metagenomics.
Head of the https://team.inria.fr/genscale/ at Inria and Irisa.
Algorithmics for sequencing data analyses, genomics and metagenomics.
Pinned
🌎👩🔬 For 15+ years biology has accumulated petabytes (million gigabytes) of🧬DNA sequencing data🧬 from the far reaches of our planet.🦠🍄🌵
Logan now democratizes efficient access to the world’s most comprehensive genetics dataset. Free and open.
doi.org/10.1101/2024...
Logan now democratizes efficient access to the world’s most comprehensive genetics dataset. Free and open.
doi.org/10.1101/2024...
❗ I clearly consider this result as THE most important result achieved over this last decade for exploiting and democratizing genomic data.
I think there will be a "before" and an "after" logan and logan-search
github.com/IndexThePlan...
logan-search.org
Have a look at this thread
I think there will be a "before" and an "after" logan and logan-search
github.com/IndexThePlan...
logan-search.org
Have a look at this thread
Reposted by Pierre Peterlongo
The Metagraph paper is out in Nature; it showed up in my feeds today! Congratulations to Mikhail Karasikov, @gxxxr.bsky.social, @akkah21.bsky.social and all of the other authors (whom I'd love to follow on Bluesky if I can find you ;P) www.nature.com/articles/s41...
Efficient and accurate search in petabase-scale sequence repositories - Nature
MetaGraph enables scalable indexing of large sets of DNA, RNA or protein sequences using annotated de Bruijn graphs.
www.nature.com
October 9, 2025 at 2:40 PM
The Metagraph paper is out in Nature; it showed up in my feeds today! Congratulations to Mikhail Karasikov, @gxxxr.bsky.social, @akkah21.bsky.social and all of the other authors (whom I'd love to follow on Bluesky if I can find you ;P) www.nature.com/articles/s41...
Reposted by Pierre Peterlongo
Preprint out for myloasm, our new nanopore / HiFi metagenome assembler!
Nanopore's getting accurate, but
1. Can this lead to better metagenome assemblies?
2. How, algorithmically, to leverage them?
with co-author Max Marin @mgmarin.bsky.social, supervised by Heng Li @lh3lh3.bsky.social
1 / N
Nanopore's getting accurate, but
1. Can this lead to better metagenome assemblies?
2. How, algorithmically, to leverage them?
with co-author Max Marin @mgmarin.bsky.social, supervised by Heng Li @lh3lh3.bsky.social
1 / N
High-resolution metagenome assembly for modern long reads with myloasm https://www.biorxiv.org/content/10.1101/2025.09.05.674543v1
September 7, 2025 at 11:35 PM
Preprint out for myloasm, our new nanopore / HiFi metagenome assembler!
Nanopore's getting accurate, but
1. Can this lead to better metagenome assemblies?
2. How, algorithmically, to leverage them?
with co-author Max Marin @mgmarin.bsky.social, supervised by Heng Li @lh3lh3.bsky.social
1 / N
Nanopore's getting accurate, but
1. Can this lead to better metagenome assemblies?
2. How, algorithmically, to leverage them?
with co-author Max Marin @mgmarin.bsky.social, supervised by Heng Li @lh3lh3.bsky.social
1 / N
❗ I clearly consider this result as THE most important result achieved over this last decade for exploiting and democratizing genomic data.
I think there will be a "before" and an "after" logan and logan-search
github.com/IndexThePlan...
logan-search.org
Have a look at this thread
I think there will be a "before" and an "after" logan and logan-search
github.com/IndexThePlan...
logan-search.org
Have a look at this thread
🌎👩🔬 For 15+ years biology has accumulated petabytes (million gigabytes) of🧬DNA sequencing data🧬 from the far reaches of our planet.🦠🍄🌵
Logan now democratizes efficient access to the world’s most comprehensive genetics dataset. Free and open.
doi.org/10.1101/2024...
Logan now democratizes efficient access to the world’s most comprehensive genetics dataset. Free and open.
doi.org/10.1101/2024...
September 4, 2025 at 12:35 PM
❗ I clearly consider this result as THE most important result achieved over this last decade for exploiting and democratizing genomic data.
I think there will be a "before" and an "after" logan and logan-search
github.com/IndexThePlan...
logan-search.org
Have a look at this thread
I think there will be a "before" and an "after" logan and logan-search
github.com/IndexThePlan...
logan-search.org
Have a look at this thread
📜 Excited to share insights from our recent paper: "Kaminari: a resource-frugal index for approximate colored k-mer queries". The study aims to efficiently identify documents containing a query string, focusing on DNA strings. www.biorxiv.org/content/10.1... 🧬 🖥️ 1/8
May 27, 2025 at 12:06 PM
📜 Excited to share insights from our recent paper: "Kaminari: a resource-frugal index for approximate colored k-mer queries". The study aims to efficiently identify documents containing a query string, focusing on DNA strings. www.biorxiv.org/content/10.1... 🧬 🖥️ 1/8
Maybe the simplest idea to decrease overestimations of a counting bloom filter. A trivial observation + 10 lines of code.
I'm surprised it has not been described before. Please comment if this is not the case.
Blog post here:
pierrepeterlongo.github.io/2025/03/17/m... 🧪🧬🖥️
I'm surprised it has not been described before. Please comment if this is not the case.
Blog post here:
pierrepeterlongo.github.io/2025/03/17/m... 🧪🧬🖥️
March 18, 2025 at 4:31 PM
Maybe the simplest idea to decrease overestimations of a counting bloom filter. A trivial observation + 10 lines of code.
I'm surprised it has not been described before. Please comment if this is not the case.
Blog post here:
pierrepeterlongo.github.io/2025/03/17/m... 🧪🧬🖥️
I'm surprised it has not been described before. Please comment if this is not the case.
Blog post here:
pierrepeterlongo.github.io/2025/03/17/m... 🧪🧬🖥️
Today I wanted to know the number of unique 27-mers in the hg38 human genome (spoiler there are 2.49 billion). I found no tool for doing this. So I wrote that github.com/pierrepeterl...
It may help.
Please use it / improve it.
🧬💻 #bioinformatics
It may help.
Please use it / improve it.
🧬💻 #bioinformatics
GitHub - pierrepeterlongo/unique_kmer_counter: Count number of unique kmers from fasta or fasta.gz files
Count number of unique kmers from fasta or fasta.gz files - pierrepeterlongo/unique_kmer_counter
github.com
January 30, 2025 at 4:37 PM
Today I wanted to know the number of unique 27-mers in the hg38 human genome (spoiler there are 2.49 billion). I found no tool for doing this. So I wrote that github.com/pierrepeterl...
It may help.
Please use it / improve it.
🧬💻 #bioinformatics
It may help.
Please use it / improve it.
🧬💻 #bioinformatics
Reposted by Pierre Peterlongo
We are back in the Town Theatre for a great lecture on Alignment, by @rayanchikhi.bsky.social! 🧬💻 #evomics2025 #genomics #bioinformatics
January 8, 2025 at 9:45 AM
We are back in the Town Theatre for a great lecture on Alignment, by @rayanchikhi.bsky.social! 🧬💻 #evomics2025 #genomics #bioinformatics
bsky.app/profile/pier...
Applications for this position are still open. If you're passionate about large-scale science, we'd love to hear from you.
🧬 & 🖥️
Applications for this position are still open. If you're passionate about large-scale science, we'd love to hear from you.
🧬 & 🖥️
🚨🚨🚨
We are hiring
🚨🚨🚨
After the creation of logan-search (see: bsky.app/profile/pier...) we propose a 2-years engineer position for continuing the development and optimizations.
With @rayanchikhi.bsky.social and @tlemane.bsky.social
Details + applications: recrutement.inria.fr/public/class...
We are hiring
🚨🚨🚨
After the creation of logan-search (see: bsky.app/profile/pier...) we propose a 2-years engineer position for continuing the development and optimizations.
With @rayanchikhi.bsky.social and @tlemane.bsky.social
Details + applications: recrutement.inria.fr/public/class...
January 8, 2025 at 11:27 AM
bsky.app/profile/pier...
Applications for this position are still open. If you're passionate about large-scale science, we'd love to hear from you.
🧬 & 🖥️
Applications for this position are still open. If you're passionate about large-scale science, we'd love to hear from you.
🧬 & 🖥️
🚨🚨🚨
We are hiring
🚨🚨🚨
After the creation of logan-search (see: bsky.app/profile/pier...) we propose a 2-years engineer position for continuing the development and optimizations.
With @rayanchikhi.bsky.social and @tlemane.bsky.social
Details + applications: recrutement.inria.fr/public/class...
We are hiring
🚨🚨🚨
After the creation of logan-search (see: bsky.app/profile/pier...) we propose a 2-years engineer position for continuing the development and optimizations.
With @rayanchikhi.bsky.social and @tlemane.bsky.social
Details + applications: recrutement.inria.fr/public/class...
December 12, 2024 at 2:30 PM
🚨🚨🚨
We are hiring
🚨🚨🚨
After the creation of logan-search (see: bsky.app/profile/pier...) we propose a 2-years engineer position for continuing the development and optimizations.
With @rayanchikhi.bsky.social and @tlemane.bsky.social
Details + applications: recrutement.inria.fr/public/class...
We are hiring
🚨🚨🚨
After the creation of logan-search (see: bsky.app/profile/pier...) we propose a 2-years engineer position for continuing the development and optimizations.
With @rayanchikhi.bsky.social and @tlemane.bsky.social
Details + applications: recrutement.inria.fr/public/class...
Reposted by Pierre Peterlongo
🚨 Call for Papers: RECOMB-seq 2025 🚨
🗓️ Dates: April 24-25, 2025
📍 Location: Seoul, South Korea
Key deadlines:
🔹 Abstract registration: Jan 24, 2025
🔹 Submission: Jan 31, 2025
More details: recomb-seq.github.io/papers/
🗓️ Dates: April 24-25, 2025
📍 Location: Seoul, South Korea
Key deadlines:
🔹 Abstract registration: Jan 24, 2025
🔹 Submission: Jan 31, 2025
More details: recomb-seq.github.io/papers/
Call for Papers
RECOMB-seq is the RECOMB Satellite Conference on Biological Sequence Analysis
recomb-seq.github.io
December 11, 2024 at 11:18 AM
🚨 Call for Papers: RECOMB-seq 2025 🚨
🗓️ Dates: April 24-25, 2025
📍 Location: Seoul, South Korea
Key deadlines:
🔹 Abstract registration: Jan 24, 2025
🔹 Submission: Jan 31, 2025
More details: recomb-seq.github.io/papers/
🗓️ Dates: April 24-25, 2025
📍 Location: Seoul, South Korea
Key deadlines:
🔹 Abstract registration: Jan 24, 2025
🔹 Submission: Jan 31, 2025
More details: recomb-seq.github.io/papers/
🗓️Tomorrow, Friday December 13, Khodor HANNOUSH, from the @genscaleteam.bsky.social team, will defend his thesis entitled “Dynamic Pan-genome Graphs”. Details by following this link:
www.irisa.fr/date/2024-12...
www.irisa.fr/date/2024-12...
Graphes dynamiques de pangénome | le site web de l'IRISA
www.irisa.fr
December 12, 2024 at 1:55 PM
🗓️Tomorrow, Friday December 13, Khodor HANNOUSH, from the @genscaleteam.bsky.social team, will defend his thesis entitled “Dynamic Pan-genome Graphs”. Details by following this link:
www.irisa.fr/date/2024-12...
www.irisa.fr/date/2024-12...
Amazing ideas here www.biorxiv.org/content/bior... from
@yoann.bsky.social
and collaborators.
Reorganize minimizers to allow kmers dichotomic search. That's brilliant.
#bioinformatics 🧬🖥️
@yoann.bsky.social
and collaborators.
Reorganize minimizers to allow kmers dichotomic search. That's brilliant.
#bioinformatics 🧬🖥️
December 4, 2024 at 12:10 PM
Amazing ideas here www.biorxiv.org/content/bior... from
@yoann.bsky.social
and collaborators.
Reorganize minimizers to allow kmers dichotomic search. That's brilliant.
#bioinformatics 🧬🖥️
@yoann.bsky.social
and collaborators.
Reorganize minimizers to allow kmers dichotomic search. That's brilliant.
#bioinformatics 🧬🖥️
Reposted by Pierre Peterlongo
I made a starter pack for algorithmic genomics. It's certainly incomplete, but already has a ton of awesome peeps. Let me know if you know people I should add (with a focus on algorithms and data structures in genomics)
go.bsky.app/TRWCnZs
go.bsky.app/TRWCnZs
November 12, 2024 at 2:03 PM
I made a starter pack for algorithmic genomics. It's certainly incomplete, but already has a ton of awesome peeps. Let me know if you know people I should add (with a focus on algorithms and data structures in genomics)
go.bsky.app/TRWCnZs
go.bsky.app/TRWCnZs
🧬🔍There are 50 petabases of freely-available DNA sequencing data. We introducing Logan Search which allows you to search for any DNA sequence in minutes, bringing Earth’s largest genomic resource to your fingertips.
🏔️ logan-search.org 🏔️
#Genomics #Bioinformatics #OpenScience
🏔️ logan-search.org 🏔️
#Genomics #Bioinformatics #OpenScience
November 11, 2024 at 7:29 PM
🧬🔍There are 50 petabases of freely-available DNA sequencing data. We introducing Logan Search which allows you to search for any DNA sequence in minutes, bringing Earth’s largest genomic resource to your fingertips.
🏔️ logan-search.org 🏔️
#Genomics #Bioinformatics #OpenScience
🏔️ logan-search.org 🏔️
#Genomics #Bioinformatics #OpenScience