Andrew Green
banner
afg781.bsky.social
Andrew Green
@afg781.bsky.social
Dad, Nerd, Dog-haver, Brass Band-o

Using LLMs and AI to supercharge curation in ncRNA for rnacentral.bsky.social @ https://bsky.app/profile/embl.org , occasionally succeeding

Owns too many Raspberry Pis
Reposted by Andrew Green
🎉 RNAcentral Release 26 is here! This release introduces our biggest structural change yet: gene-level entries for ncRNAs across 204 organisms.
For the first time, you can explore RNA data at the gene level, not just individual sequences.
🧵👇
October 8, 2025 at 10:09 AM
I'll be presenting the @rnacentral.bsky.social & @rfamdb.bsky.social poster "Integrating the RNA Universe: Advances and Future Directions in RNAcentral and Rfam Resources" today at #ismbeccb2025! Come and say hi at board C-235 if you'd like to chat about our current status and future plans! 🧪
July 23, 2025 at 8:09 AM
Reposted by Andrew Green
If you're a #lifescientist looking to develop your skills in #datascience, including using @python.org and the applications of #AI and #machinelearning then this course is for you!

Applications are closing soon - you have until 2 March: www.ebi.ac.uk/training/eve...

🧬🖥️🧪 #GeneSky
February 18, 2025 at 9:16 AM
This is my first first-author publication while at @ebi.embl.org! Its a good one, using LLMs to do some literature curation in non-coding RNAs. We've got big plans to do even more cool stuff with LLMs in the near future!
1/12 Excited to share our new paper in DATABASE on LitSumm - our system that uses large language models to automatically generate high-quality literature summaries for non-coding RNAs! 🧬📚
February 7, 2025 at 2:41 PM
Gratifying that I'm still better at prompt engineering (for given values of better) than Claude
January 21, 2025 at 8:46 PM
First impressions of the 2nd section test piece for the brass band championship regionals this year: its a banger! That last section is great!

I'm sure 3 more months of rehearsal on it will knock that attitude out of me though
December 4, 2024 at 10:57 AM
Fiddling about with LLMs and their weirdness... Why do loads of tokenizers have one single token for 34 consecutive spaces?

Go here: huggingface.co/spaces/Xenov... and try pasting in " " (without the "). It's token ID 9898 for GPT4 and Llama3
November 28, 2024 at 10:14 PM