Lei Wang
wanglei-hust.bsky.social
Lei Wang
@wanglei-hust.bsky.social
Postdoctoral fellow, Huazhong University of Science and Technology #bioinformatics
MetaFold-RNA: Accurate prediction of RNA secondary structure using a meta-learning-guided deep network www.biorxiv.org/content/10.1...
MetaFold-RNA: Accurate prediction of RNA secondary structure using a meta-learning-guided deep network
Accurately predicting the secondary structure of ribonucleic acid (RNA) is a critical step toward deciphering its biological roles and engineering novel RNA-based technologies. However, achieving high...
www.biorxiv.org
September 22, 2025 at 1:11 AM
Reposted by Lei Wang
Excited to re-share work from
@yoakiyama.bsky.social and Zhidian Zhang on MSA pairformer. (1/4)
August 5, 2025 at 7:39 AM
Reposted by Lei Wang
Our models, code, and data are openly available on Github, Zenodo, and Huggingface.

github.com/microsoft/da...
huggingface.co/collections/...
zenodo.org/records/1526...
GitHub - microsoft/dayhoff
Contribute to microsoft/dayhoff development by creating an account on GitHub.
github.com
July 25, 2025 at 10:05 PM
Help needed: submitted a manuscript to Nature Methods, and although I received a notification that it was rejected, I never received a formal decision letter. Has anyone experienced something similar? Any suggestions on how to phrase it? 🙏

#ManuscriptSubmission #NatureMethods
July 14, 2025 at 3:24 PM
Reposted by Lei Wang
SSAlign: Ultrafast and Sensitive Protein Structure Search at Scale https://www.biorxiv.org/content/10.1101/2025.07.03.662911v1
July 6, 2025 at 4:47 AM
Reposted by Lei Wang
"SSAlign, a protein structure retrieval tool that leverages protein language models to jointly encode sequence and structural information...On large-scale datasets such as AFDB50, SSAlign outpaces Foldseek by two to three orders of magnitude in search speed"

www.biorxiv.org/content/10.1...
SSAlign: Ultrafast and Sensitive Protein Structure Search at Scale
The advent of highly accurate structure prediction techniques such as AlphaFold3 is driving an unprecedented expansion of protein structure databases. This rapid growth creates an urgent demand for novel search tools, as even the current fastest available methods like Foldseek face significant limitations in sensitivity and scalability when confronted with these massive repositories. To meet this challenge, we have developed SSAlign, a protein structure retrieval tool that leverages protein language models to jointly encode sequence and structural information, and adopts a two-stage alignment strategy optimized with multi-GPU and multi-process parallelization. On large-scale datasets such as AFDB50, SSAlign outpaces Foldseek by two to three orders of magnitude in search speed, offering unmatched scalability for high-throughput structural analysis. Compared to Foldseek, SSAlign retrieves substantially more high-quality matches on Swiss-Prot and achieves marked performance improvements on SCOPe40, with relative AUC increases of +20.2% at the family level and +33.3% at the superfamily level, demonstrating significantly enhanced sensitivity and recall. In sum, SSAlign achieves TM-align-comparable accuracy with Foldseek-surpassing speed and coverage, offering an efficient, sensitive, and scalable solution for large-scale structural biology and structure-based drug discovery. ### Competing Interest Statement The authors have declared no competing interest. National Natural Science Foundation of China, 62172172 Hubei Provincial Natural Science Foundation of China, 2025AFB159 The Postdoctoral Fellowship Program of CPSF, GZC20240545
www.biorxiv.org
July 6, 2025 at 3:34 PM