Sina Majidian
banner
sinamajidian.bsky.social
Sina Majidian
@sinamajidian.bsky.social
On the academic job market | How are species compared to one another across different genomic regions? Postdoc at Langmead Lab, Johns Hopkins | Comparative #genomics at scale | Formerly at UNIL/SIB/WUR | sinamajidian.github.io
Specificity, length and luck drive gene rankings in association studies
nature.com/articles/s41586-025-09703-7
November 9, 2025 at 12:52 PM
Next, Johanna von Wachsmann
@johannavw.bsky.social presented Gemsparcl—Rapid and consistent genome clustering for navigating bacterial diversity with millions of genomes" #GI2025
November 8, 2025 at 4:23 PM
Jim Shaw @jimshaw.bsky.social presented "High-resolution metagenome assembly for modern long reads
with myloasm"
This efficient tool uses SNPmers and leverages high quality ONT and PacBio reads for metagenome assembly.
myloasm: doi.org/10.1101/2025.09.05.674543
November 8, 2025 at 4:21 PM
Anshul Kundaje @anshulkundaje.bsky.social presented "Deep learning models of regulatory DNA—A comparison of model
design choices" #GI2025 He focused more on task-specific models & showed multi-task models lack causal interpretability.
ChromBPNet: doi.org/10.1101/2024.12.25.630221
encodeproject.org
November 8, 2025 at 4:16 PM
Today is the last day of Genome Informatics #GI2025. The first talk was presented by Genrietta Yagudayeva on "A reproducible RNA-seq pipeline for mitogenomics and barcoding phylogenetics in neglected biodiversity"
November 8, 2025 at 4:09 PM
Yana Safonova (Penn State) delivered the keynote "Enabling biomedical discoveries through immunogenomics
approaches"

Mammalian Immune Luci: doi.org/10.1093/molbev/msaf152
PatchWorkPlot: doi.org/10.1093/bioinformatics/btaf504
November 7, 2025 at 10:34 PM
Ryan Moreno from @sroyyors.bsky.social Lab presented "Integrating single-cell omics data across species using matrix factorization regularized by gene-level phylogenies"

A very cool application of gene orthology across species for single-cell expression analysis! #orthology #GI2025 #singlecell
November 7, 2025 at 10:23 PM
Li Song @mourisl.bsky.social presented "Quality control of single-cell ATAC-seq data without peak calling using Chromap"

Chromap-QC: biorxiv.org/content/10.1101/2025.07.15.664951
Chromap: nature.com/articles/s41467-021-26865-w #GI2025
November 7, 2025 at 7:18 PM
Hanchen Wang presented "Biomni—A general-purpose biomedical AI agent" #GI2025
doi.org/10.1101/2025.05.30.656746
November 7, 2025 at 6:57 PM
Matthew Nguyen delivered a great talk on "Refining Kraken2 long-read taxonomic classifications using convolutional neural networks"
November 7, 2025 at 6:41 PM
Yijie Kang (CSHL, Stony Brook) from @pkoo562.bsky.social Lab presented "Decoding the sequence basis of Pol II elongation with deep learning"
November 7, 2025 at 3:05 PM
Third day of Genome Informatics #GI2025 began with an exciting session on “AI, ML and Integrative Genomics” chaired by Irene Kaplow & Thomas Pierrot.
The first talk, by Irene Kaplow, focused on Challenges in Predicting Enhancer Activity Differences Between Species
doi.org/10.1186/s12864-022-08450-7
November 7, 2025 at 2:28 PM
Second's day concluded by fantastic talk by Cristina Martin Linares on "Minimal reconstruction of SpliceAI using distilled matryoshka sparse autoencoders"

They showed that matryoshka SAEs arxiv.org/abs/2503.17547 improves upon openSpliceAI elifesciences.org/reviewed-preprints/107454. #GI2025
November 7, 2025 at 1:08 PM
Nicola De Maio presented "Maximum likelihood phylogenetics at pandemic scales" and discussed the importance of scalable phylogenetics in genomic epidemiology. #GenomeInformatics #GI2025
MAPLE: nature.com/articles/s41588-023-01368-0
November 7, 2025 at 12:56 PM
Nicolae Sapoval @nsapoval.bsky.social presented "Theoretical and empirical performance of pseudo-likelihood- based Bayesian inference of species trees under the multispecies coalescent"
A fantastic theory talk, offering intuitive insights!
Paper: doi.org/10.1101/2025.01.28.635282
November 6, 2025 at 8:28 PM
"PhyloFisher v2—Advancing accuracy and reproducibility in deep phylogenomics" was presented by Robert E. Jones.
doi.org/10.1371/journal.pbio.3001365 & doi.org/10.1002/cpz1.969
Software: github.com/TheBrownLab/PhyloFisher
November 6, 2025 at 8:18 PM
The "Algorithmic and evolutionary biology" session is chaired by Nicola De Maio and Erin Molloy.
Erin Molloy delivered the first talk titled "Towards scalable reconstruction of species phylogenies under the network multispecies coalescent"
TreeQMC: doi.org/10.1093/sysbio/syaf009
November 6, 2025 at 8:15 PM
The keynote speaker, Marinka Zitnik @marinkazitnik.bsky.social , delivered her talk to a full house on "Empowering biomedical discovery with AI scientists"
Paper: arxiv.org/abs/2509.23426
Platform: aiscientist.tools
November 6, 2025 at 7:58 PM
The morning session concluded with a talk by Megan Le: DeKnot—Local haplotype-resolved assembly with k-syncmer-
based multiplex De Bruijn graphs
The goal is to perform local haplotype assembly by iteratively growing lists of k’ consecutive closed syncmers.
November 6, 2025 at 6:46 PM
Haonan Wu gives a talk on "A k-mer-based estimator of the substitution rate between repetitive sequences"
www.biorxiv.org/content/10.1...
This work tackles the issue of Mash which ignores repeats in the genome, providing better distance estimation #GI2025
November 6, 2025 at 4:38 PM
Mile Sikic @msikic.bsky.social presents "AI for genomes—Rethinking de novo assembly" genome.cshlp.org/content/35/4/839
They devised a bidirectional message-passing procedure in GNN for the problem of genome assembly
November 6, 2025 at 4:22 PM
Harun Mustafa presents "Efficient, accurate, SRA-scale indexing and query" at #GI2025
MetaGraph is a highly compressed representation
of all public biological sequences!
nature.com/articles/s41586-025-09603-w
Try it online: metagraph.ethz.ch/search
November 6, 2025 at 4:09 PM
Ke Chen presents "A scalable and improved heuristic for flow decomposition"
They developed a new algorithm to decompose a directed graph (generated from reads) into a minimum # weighted paths, with application to metagenome and transcriptome assembly.
November 6, 2025 at 2:46 PM
Second day of Genome Informatics #GI2025 began with the session “Genome Assembly and Sequence Algorithms" Yun William Yu presented “Average-case Analysis of Seed-Chain-Extend under Random Mutations"
genome.cshlp.org/content/33/7/1175
providing theoretical guarantees for the popular seed-chain-extend
November 6, 2025 at 2:19 PM
Nicole Brown gave a fantastic talk on Identifying introgressions across pangenomes with Panagram

It uses k-mer conservation to annotate genomic variation across hundreds of genomes, followed by normalization of k-mer profiles to identify introgression events
github.com/kjenike/pana... #GI2025
November 6, 2025 at 2:52 AM