Lightnews — Scholar-powered news

Jouni Sirén

@jltsiren.bsky.social

VG will soon start adding headers to the GAF files it generates. The specifics are still uncertain, but if you maintain a GAF parser, it may be a good idea to skip lines starting with "@". Here is a draft specification for the vg flavor of GAF.

github.com

October 31, 2025 at 3:27 AM

Reposted by Jouni Sirén

Mohsen Zakeri

@mohsenzakeri.bsky.social

1/6 Movi 2 is here: faster and more space-efficient for pangenome queries. Its fastest mode uses half the memory of Movi 1 while running ~30% faster. github.com/mohsenzakeri...

GitHub - mohsenzakeri/Movi: Fast, Cache-Efficient, and Scalable Queries on Pangenomes

Fast, Cache-Efficient, and Scalable Queries on Pangenomes - mohsenzakeri/Movi

github.com

October 21, 2025 at 8:00 PM

Reposted by Jouni Sirén

Rob Patro

@robp.bsky.social

For the weekend crowd. I'm hiring a postdoc! If you're interested in algorithms, data structures and high-dimensional inference, and if you want to invent new methods for genomics and implement them in high-performance, robust and easy-to-use software, do I have a lab for you; ours!

Rob Patro @robp.bsky.social · Oct 10

Hi bioinformatics, genomics and CS friends! Please help me spread the word. I'm hiring a postdoc! Come work on cutting edge method development in algorithmic genomics with me and my group at @umdscience.bsky.social! 🖥️🧬

Rob Patro @robp.bsky.social · Oct 8

And it's posted! If you're interested and eligible, please consider applying through the UMD portal: umd.wd1.myworkdayjobs.com/en-US/UMCP/j....

If you're a PI working in algorithmic genomics (& you can recommend my lab to your top graduating students ;P), please let them know!

October 11, 2025 at 1:10 PM

Reposted by Jouni Sirén

Xian Chang

@xian-chang.bsky.social

🦒Long read giraffe is out!🦒
Mapping long reads to pangenome graphs is ~10x faster than with GraphAligner, with veeery slightly better mapping accuracy, short variant calling, and SV genotyping than GraphAligner or Minimap2

bioRxiv Bioinfo @biorxiv-bioinfo.bsky.social · Oct 1

Rapid, accurate long- and short-read mapping to large pangenome graphs with vg Giraffe https://www.biorxiv.org/content/10.1101/2025.09.29.678807v1

October 2, 2025 at 6:28 AM

Reposted by Jouni Sirén

Giulio Ermanno Pibiri

@jermp.bsky.social

We are glad to announce that the next workshop “Data Structures in Bioinformatics” (DSB 2026) will take place in Venice, Italy, on *February 18-19*, 2026. dsb-meeting.github.io/DSB2026/ Book the dates! #DSB26

DSB 2026 Venice - February 18-19

Workshop Data Structures in Bioinformatics

dsb-meeting.github.io

September 1, 2025 at 6:10 PM

Jouni Sirén

@jltsiren.bsky.social

GBZ-base has been a side project for me for a couple of years. It's basically a GBZ graph stored in SQLite instead of a custom file format. You can convert a GBZ graph to GBZ-base quickly and then extract subgraphs around nodes / reference positions on a laptop. 1/n

GitHub - jltsiren/gbz-base: Prototype for an immutable pangenome graph in SQLite

Prototype for an immutable pangenome graph in SQLite - jltsiren/gbz-base

github.com

August 28, 2025 at 12:49 AM

Reposted by Jouni Sirén

Rob Patro

@robp.bsky.social

Last talk of the day (before posters) "Lossless Pangenome Indexing Using Tag Arrays" presented by Parsa Eskandar! #WABI25

August 20, 2025 at 8:00 PM

Jouni Sirén

@jltsiren.bsky.social

There was a workshop on 25 years of the FM-index and the CSA after SEA. I would have liked to attend, but I had other commitments. The invited speakers were Giovanni Manzini and Roberto Grossi, as the other purpose of the workshop was to present them Festschrifts for their 60th birthdays. 1/6

SEA 2025

regindex.github.io

August 8, 2025 at 9:49 AM

Jouni Sirén

@jltsiren.bsky.social

A new preprint on indexing pangenome graphs using an FM-index of the haplotypes and a tag array. Joint work with Parsa Eskandar and @benedictpaten.bsky.social.

Lossless Pangenome Indexing Using Tag Arrays

Pangenome graphs represent the genomic variation by encoding multiple haplotypes within a unified graph structure. However, efficient and lossless indexing of such structures remains challenging due t...

www.biorxiv.org

May 15, 2025 at 8:22 PM

Jouni Sirén

@jltsiren.bsky.social

We use personalized references with our Giraffe aligner. Each chromosome is partitioned into a sequence of blocks. We sample the most relevant haplotypes in each block using kmer counts. Mapping to this personalized reference improves variant calling accuracy. www.nature.com/articles/s41...

Personalized pangenome references - Nature Methods

This work introduces a k-mer-based approach to customizing a pangenome reference, making it more relevant to a new sample of interest. This method enhances the accuracy of genotyping small variants an...

www.nature.com

March 4, 2025 at 11:00 PM

Jouni Sirén

@jltsiren.bsky.social

Coming up soon in vg: faster GAF sorting. The old algorithm was spending too much time parsing and serializing alignments. The new algorithm just deals with blobs and integer keys. With that and some algorithmic improvements, you can now expect to sort 30x short reads in 15-20 minutes on a laptop.

January 25, 2025 at 3:05 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news