Leo Zang
banner
leozang.bsky.social
Leo Zang
@leozang.bsky.social
Protein Designer | Share Reading Notes (AI+Protein/RNA/DNA)
www.leozang.com
Computational protein design
- "This Primer provides an introduction to the main approaches in computational protein design, covering both physics-based and machine-learning-based tools. It aims to be accessible to biological, physical and computer scientists alike."
www.nature.com/articles/s43...
March 5, 2025 at 9:14 PM
Protein-Based Degraders: From Chemical Biology Tools to Neo-Therapeutics
- "we provide a comprehensive and critical review of studies that have used proteins and peptides to mediate the degradation and hence the functional control of otherwise challenging disease-relevant protein targets.
January 30, 2025 at 5:33 PM
Inference-Time Alignment in Diffusion Models with Reward-Guided Generation: Tutorial and Review
arxiv.org/abs/2501.09685
January 23, 2025 at 3:41 AM
Massively parallel characterization of transcriptional regulatory elements
- Develope an optimized lentiMPRA (lentiviral massively parallel reporter assay) method to test regulatory activity of >680,000 sequences across three cell types (HepG2, K562, WTC11)
Link: www.nature.com/articles/s41...
January 17, 2025 at 7:12 AM
A review of deep learning models for the prediction of chromatin interactions with DNA and epigenomic profiles | @BriefingBioinfo
Link: academic.oup.com/bib/article/...
December 27, 2024 at 5:05 AM
Concept Bottleneck Language Models For protein design
- Introduce CB-pLM (Concept Bottleneck Protein Language Models) from 24M to 3B, trained on UniRef50 and SwissProt over 718 concepts (including Cluster name, Biological process, and Biopython-derived features, etc.)
arxiv.org/abs/2411.06090
December 14, 2024 at 10:29 PM
Using artificial intelligence to document the hidden RNA virosphere
- PRIME, protein language model (same as ESM-2 650M architecture) pretrained on 96 million sequences with optimal growth temperatures (OGTs annotated by [1]) with MLM, MSE, and Correlation Loss
Link: www.science.org/doi/10.1126/...
November 27, 2024 at 10:30 PM
Getting aligned on representational alignment
- "In this Perspective, we survey the exciting recent developments in representational alignment research in the fields of cognitive science, neuroscience, and machine learning"
Link: arxiv.org/abs/2310.13018
November 27, 2024 at 9:49 PM
Discovery and significance of protein-protein interactions in health and disease | @cellpressnews.bsky.social Review
Link: www.cell.com/cell/fulltex...
November 21, 2024 at 5:03 AM
InterPLM: Discovering Interpretable Features in Protein Language Models via Sparse Autoencoders
www.biorxiv.org/content/10.1...
- Use sparse autoencoders (SAEs) to extract and analyze interpretable features from ESM-2-8M
November 19, 2024 at 1:44 AM
miRBench: A Comprehensive microRNA Binding Site Prediction Training and Benchmarking Dataset
Preprint: www.biorxiv.org/content/10.1...
GitHub:
github.com/katarinagres...
November 16, 2024 at 9:16 PM
AlphaBind, a Domain-Specific Model to Predict and Optimize Antibody-Antigen Binding Affinity
- Encode antibody and antigen with ESM2-nv (ESM2 but on NVIDIA), concatenate embeddings and feed into a lightweight transformer (4 attention heads, 7 layers) to predict binding affinity
November 16, 2024 at 9:01 PM