Stephen Hwang
stephenhwang.bsky.social
Stephen Hwang
@stephenhwang.bsky.social
Debugging wet code 💻🧬
@XDBio, JHMI; Langmead Lab
Reposted by Stephen Hwang
We ran Mumemto on 474 human assemblies from @humanpangenome.bsky.social to find syntenic regions using MUMs. Mumemto scales remarkably well to large pangenomes thanks to compressed-space algos! It took under 2 days across 7 nodes (each using ~500 GB memory).
February 26, 2025 at 6:01 PM
Reposted by Stephen Hwang
Excited to share a preprint for (w/ @benlangmead.bsky.social) our new tool, Mumemto, on biorxiv! Mumemto finds multi-MUMs across pangenomes (i.e. mummer but for pangenomes). It can rapidly visualize synteny, identify misassemblies, and accelerate core genome and multiple alignment, highlighting SVs.
Mumemto: efficient maximal matching across pangenomes
Aligning genomes into common coordinates is central to pangenome analysis and construction, but it is also computationally expensive. Multi-sequence maximal unique matches (multi-MUMs) are guideposts ...
www.biorxiv.org
January 6, 2025 at 3:27 PM
Reposted by Stephen Hwang
Very excited to see Movi (by @mohsenzakeri.bsky.social) now out in iScience: www.cell.com/iscience/ful.... Movi builds on the "move structure" pangenome index, a compressed full-text index and close cousin to r-index. Compared to r-index, the move structure is simpler and more cache-efficient.
Movi: A fast and cache-efficient full-text pangenome index
Biocomputational method; Classification of bioinformatical subject; Genomic analysis
www.cell.com
December 11, 2024 at 4:48 PM
Reposted by Stephen Hwang
1/5 We (Nate Brown, @oahmed.bsky.social, Travis Gagie, and @benlangmead.bsky.social) developed Movi, a cache-efficient full-text pangenome index.  It's the fastest full-text index for pangenomes, particularly appropriate for adaptive sampling where time budget is important.
November 7, 2023 at 6:46 PM