Sanjeeva Reddy Dodlapati
banner
saneevareddy.bsky.social
Sanjeeva Reddy Dodlapati
@saneevareddy.bsky.social
Machine Learning Researcher, working on Genomics and Drug Discovery.
A large-scale study identified 380 regulatory SNVs linked to cancer risk, impacting mitochondrial translation, DNA repair, and Rho signaling, with CRISPR validation highlighting therapeutic targets. #CancerGenomics #PrecisionMedicine #CancerGenomics #PrecisionMedicine
www.nature.com/articles/s41...
Functional analysis of cancer-associated germline risk variants - Nature Genetics
Analysis of 4,041 single-nucleotide variants (SNVs) linked to 13 cancers performed in primary human cell types identifies 380 potentially regulatory SNVs and their putative target genes. Editing one S...
www.nature.com
February 18, 2025 at 1:18 AM
AI-powered liquid biopsies are revolutionizing cancer detection! 🧬✨ Discover how deep learning is unlocking hidden patterns in cell-free DNA (cfDNA) for earlier, more accurate, and non-invasive cancer diagnostics.

Read more 👇

sanjeevareddydodlapati.substack.com/p/revolution...
Revolutionizing Cancer Diagnostics: Deep Learning and AI in cfDNA Analysis
Unlocking the Future of Liquid Biopsy: How AI and Deep Learning Are Transforming Cancer Detection with cfDNA Analysis
sanjeevareddydodlapati.substack.com
February 13, 2025 at 6:58 PM
Reposted by Sanjeeva Reddy Dodlapati
Am I the only one feeling schadenfreude about USA's arrogance about it's AI superiority, as they got their butt kicked by DeepSeek-R1, and that the facade of needing 100K GPUs for training was an intellectually lazy and uninspiring solution that crumbled before the world's eyes? 👀
January 31, 2025 at 12:37 PM
Reposted by Sanjeeva Reddy Dodlapati
Ever wondered how the complexity of human biology—from our organs to our very DNA—unfolds at different levels? Let’s decode life together!
#ComputationalGenomics #Genomics #AIinScience #Bioinformatics #HumanBiology #Omics #MolecularBiology #ScienceCommunication
open.substack.com/pub/sanjeeva...
Unraveling Human Biology: A Journey from the Organism to the Atom—Laying the Groundwork for Digital Biology
Setting the Foundation for Computational Genomics and Beyond
open.substack.com
January 10, 2025 at 2:29 AM
Reposted by Sanjeeva Reddy Dodlapati
We've a postdoc opening for our lab at the Broad: Cambridge MA!

Must have experience in toxicology + data science

Work on the wonderful OASIS dataset we are producing... Cell Painting, transcriptomics, proteomics in various liver cell and tissue models!

broad.io/mlcbpostdoc
December 18, 2024 at 3:50 PM
The Alarming Impact of Microplastics on Human Health and the Environment 🧵
1/ Microplastics are everywhere: in our food, water, air, and even inside our bodies. But what are they, where do they come from, and how do they affect us? Let’s dive in. 🧬🌍 #Microplastics #Health #Environment
December 18, 2024 at 3:22 PM
Reposted by Sanjeeva Reddy Dodlapati
NEW PREPRINT

A detailed overview of 32 popular predictive performance metrics for prediction models

arxiv.org/abs/2412.10288
December 16, 2024 at 8:44 AM
Reposted by Sanjeeva Reddy Dodlapati
Excited to be presenting Orthrus with Ruain Shi and Keren Isaev @karini925.bsky.social today! We will be presenting our spotlight at the workshop on AI for new drug modalities #NeurIPS2024

Come chat about a new approach to mRNA representation learning!
December 15, 2024 at 5:21 PM
Reposted by Sanjeeva Reddy Dodlapati
This work finds that costly foundation models are outperformed by CNNs in genomics benchmarks, and are outperformed by *linear* autoregression in time-series benchmarks. 👀
Specialized Foundation Models Struggle to Beat Supervised Baselines
Following its success for vision and text, the "foundation model" (FM) paradigm -- pretraining large models on massive data, then fine-tuning on target tasks -- has rapidly expanded to domains in the ...
arxiv.org
December 12, 2024 at 7:26 AM
Reposted by Sanjeeva Reddy Dodlapati
I've been working to make designing regulatory DNA that exhibits desired characteristics easier for everyone. With the following series of tools, you can go from a blank slate to designed edits in ~30 minutes using only a V100. That includes file downloading and model training.
December 10, 2024 at 5:58 PM
Reposted by Sanjeeva Reddy Dodlapati
Also want to point to this other recent preprint that also shows that optimized ab-initio CNN models beat DNALMs even on the (relatively pointless) surrogate tasks used in the DNALM papers. CNNs also beat several other foundation models in other domains.

arxiv.org/abs/2411.02796

#bioMLeval
December 12, 2024 at 6:14 AM
Reposted by Sanjeeva Reddy Dodlapati
Protein and DNA Conformational Changes Contribute to Specificity of Cre Recombinase https://www.biorxiv.org/content/10.1101/2024.12.11.627928v1
December 12, 2024 at 2:45 AM
Reposted by Sanjeeva Reddy Dodlapati
Excited about L2G, led by Wenduo Cheng. We leverage LLMs to beat genomic FMs and strong supervised baselines on a wide range of benchmarks. L2G uses cross-modal transfer (rather than vanilla fine-tuning), and neural architecture search to learn a genomic-specific embedder model.
Can we bypass the resource bottleneck of pretraining genomic Foundation Models? Our work L2G repurposes language LLMs for genomics via cross-modal transfer, matching fine-tuned genomic FMs. Kudos to Wenduo & fantastic collab w/ @atalwalkar.bsky.social. L2G, language to genome; L2G, life’s too good!
December 11, 2024 at 7:36 PM
Reposted by Sanjeeva Reddy Dodlapati
I trained a sparse autoencoder on the middle layer residual stream of my genome language model and found human-interpretable latent features that consistently detect specific DNA motifs!

🧵1/8
December 12, 2024 at 2:47 AM