Yehlin Cho
yehlincho.bsky.social
Yehlin Cho
@yehlincho.bsky.social
Protein Hunter enables multimer binder design, multi-motif scaffolding, partial redesign, and nucleic acid binder design — offering a general pipeline for protein design that can be applied to any AF3-style models, existing or in development.
October 13, 2025 at 3:45 PM
Additionally, Protein Hunter supports all-atom molecular binder design. We show in silico success rates for four small molecules, where iterative cycles of Boltz2 and LigandMPNN achieve the highest AF3 success rates.
October 13, 2025 at 3:45 PM
We also demonstrate the success of the pipeline on cyclic peptides, exemplified with the MDM2 target.
Macrocyclic peptide design can be achieved through cyclic positional encodings.
October 13, 2025 at 3:45 PM
However, diffusion-based models favor α-helical topologies (reflecting training bias), reducing structural diversity. To enhance β-sheet content, we applied a negative helix bias to Pairformer pair features before diffusion, increasing sheet-rich samples.
October 13, 2025 at 3:45 PM
Repeating this process significantly improves the in silico success rates of AlphaFold3 and the designability of both unconditional and conditional (binder) design tasks.
October 13, 2025 at 3:45 PM
Protein Hunter: Starting from an all "X" sequence, we find that diffusion-based structure prediction models can hallucinate reasonable looking structures, which can be further improved through iterative sequence design and structure prediction, similar to AF2Cycler and LASErMPNN.
October 13, 2025 at 3:45 PM
And they do it remarkably well with an all-“X” sequence. ❌😮
AF3-style models treat unknown PDB residues as X tokens and explicitly handle non-canonical amino acids and ligands, enabling folding of undefined sequences while minimizing bias from amino acid specific features.
October 13, 2025 at 3:45 PM
It actually folds into a structure and binds near the target!

We found that AF3-like structure prediction models (Boltz, Chai, AF3) can hallucinate proteins within their diffusion modules.
October 13, 2025 at 3:45 PM
Have you ever wondered what AF3-like structure prediction models would produce when given a random protein sequence and a target of your choice?

Would it form a completely disordered structure that wraps around the target, or would it still fold and bind to it?
October 13, 2025 at 3:45 PM
Thrilled to announce our new preprint, “Protein Hunter: Exploiting Structure Hallucination within Diffusion for Protein Design,” in collaboration with @Griffin, @GBhardwaj8 and @sokrypton.org

🧬Code and notebooks will be released by the end of this week.
🎧Golden- Kpop Demon Hunters
October 13, 2025 at 3:45 PM
🚀 Excited to release BoltzDesign1!

✨ Now with LogMD-based trajectory visualization.
🔗 Demo: rcsb.ai/ff9c2b1ee8
Feedback & collabs welcome! 🙌

🔗: GitHub: github.com/yehlincho/Bo...
🔗: Colab: colab.research.google.com/github/yehli...
@sokrypton.org @martinpacesa.bsky.social
June 3, 2025 at 1:31 AM
5. BoltzDesign1 can be used to design sequences and structures that AlphaFold3 predicts to bind to metal ions, nucleic acids, and other biomolecules
April 8, 2025 at 12:17 PM
3. By utilizing only the Pairformer and Confidence module, our method generates highly diverse binders, with high AlphaFold3 success rates, strong cross-model and self-consistency, as demonstrated by benchmarks on four small-molecule targets from the RFDiffusionAA benchmark set.
April 8, 2025 at 12:17 PM
2. Instead of optimizing single structures, we optimize directly on the distogram, shaping the probability distributions of atomic distances. We show that the distogram effectively captures interactions between proteins and their targets, serving as a proxy for confidence scores
April 8, 2025 at 12:17 PM
1. We introduce BoltzDesign1, which inverts the Boltz-1 model—an open-source reproduction of AlphaFold3—to enable the design of protein binders for diverse molecular targets without requiring model fine-tuning.
April 8, 2025 at 12:17 PM
Excited to share our preprint “BoltzDesign1: Inverting All-Atom Structure Prediction Model for Generalized Biomolecular Binder Design” — a collaboration with
@martinpacesa.bsky.social, @Zhidian Zhang, @Bruno E. Correia, and @sokrypton.org

🧬 Code will be released in a couple weeks
April 8, 2025 at 12:17 PM