Kuan-Hao Chao
banner
kuanhaochao.bsky.social
Kuan-Hao Chao
@kuanhaochao.bsky.social
Senior Deep Learning Scientist Illumina | CS PhD candidate at CS PhD student at @jhu.edu @jhucompsci.bsky.social

Teaching machines to learn biology 🧬💻

https://khchao.com/
Thank you so much Ben. I’m deeply grateful for your mentorship; you’ve shaped how I think and do science. I hope we get to collaborate again soon!
August 26, 2025 at 1:15 PM
It’s happening today! Thrilled to share my PhD journey with friends and colleagues—thank you all for being part of it!
August 25, 2025 at 3:06 PM
Dive into the full thread here 👇
Friends, I’m excited to release OpenSpliceAI, an open‐source, efficient, and modular framework for splice site prediction. It reimplements and extends SpliceAI (Jaganathan et al., 2019) using the modern PyTorch framework. 1/

Mihaela Pertea, @stevensalzberg.bsky.social, Anqi Liu, Alan Mao
OpenSpliceAI: An efficient, modular implementation of SpliceAI enabling easy retraining on non-human species https://www.biorxiv.org/content/10.1101/2025.03.20.644351v1
July 23, 2025 at 2:03 PM
March 25, 2025 at 1:49 PM
A huge thank you to my advisors, Mihaela Pertea and @stevensalzberg.bsky.social for their invaluable mentorship and support. Special shoutout to collaborators and friends Anqi Liu and Alan Mao at Hopkins — this project wouldn’t be possible without you! 11/
March 24, 2025 at 1:58 PM
OpenSpliceAI offers researchers a comprehensive suite of tools for studying transcript splicing—from creating training datasets and training models to predicting splice sites and assessing the impact of genetic variants.
🔗 Explore our documentation here: ccb.jhu.edu/openspliceai/ 10/
March 24, 2025 at 1:57 PM
We showed general patterns of donor and acceptor sites across hundreds of splice site motifs learned by OpenSpliceAI. Moreover, it confidently predicts cryptic splicing events—such as acceptor gain in MYBPC3 and novel exon gain in OPA1. 9/
March 24, 2025 at 1:56 PM
In silico mutagenesis (ISM) confirms that OpenSpliceAI focuses on the same key regions and patterns for splice site prediction at U2SURP and DST as SpliceAI, and effectively capturing a splicing enhancer. We also demonstrate its ability to capture the full gene span of CFTR. 8/
March 24, 2025 at 1:56 PM
We enhanced model reliability by applying temperature scaling for calibration. Using expected calibration error and reliability diagrams, we observed a smoother probability distribution that aligns more closely with the empirical distribution, making it better! 7/
March 24, 2025 at 1:55 PM
OpenSpliceAI also supports transfer learning. Our experiments show that models pre-trained on human and applied to species of interest achieve near-optimal performance in just one epoch, drastically reducing compute time while enhancing predictions for species with smaller genomes 6/
March 24, 2025 at 1:55 PM
Comparing species-specific training versus running SpliceAI directly, our results confirm that one-size-fits-all generalization isn’t enough. OpenSpliceAI’s ability to retrain on specific species sets it apart from SpliceAI. 5/
March 24, 2025 at 1:54 PM
Our benchmarks show that OpenSpliceAI outperforms SpliceAI in elapsed time, memory usage, and GPU peak memory across various gene lengths. Thanks to dynamic PyTorch graphs, batch prediction, and optimized engineering, full-chromosome predictions are now a reality! 4/
March 24, 2025 at 1:53 PM
Built with six modular components, OpenSpliceAI lets you:

• Create species-specific datasets
• Train custom models
• Calibrate predictions
• Apply transfer learning from human models
• Predict on genes / entire chromosomes
• Assess variant impacts on cryptic splicing
3/
March 24, 2025 at 1:52 PM
We’re introducing OpenSpliceAI along with a suite of pre-trained models for Human-MANE, mouse, zebrafish, honeybee, and Arabidopsis. OpenSpliceAI provides a user‐friendly toolkit for studying transcript splicing in any species of interest! 2/

🔗 GitHub: github.com/Kuanhao-Chao...
GitHub - Kuanhao-Chao/OpenSpliceAI: 🤖 Open‑source deep-learning-based splice‑site predictor that decodes splicing patterns across species
🤖 Open‑source deep-learning-based splice‑site predictor that decodes splicing patterns across species - Kuanhao-Chao/OpenSpliceAI
github.com
March 24, 2025 at 1:52 PM