Keshav Ramji
keshavramji.bsky.social
Keshav Ramji
@keshavramji.bsky.social
Post-training Alignment at IBM Research AI | Prev: Penn CS + Wharton
If these topics excite you, reach out about joining us next summer!
(repost welcome) The Generative Model Alignment team at IBM Research is looking for next summer interns! Two candidates for two topics

🍰Reinforcement Learning environments for LLMs

🐎Speculative and non-auto regressive generation for LLMs

interested/curious? DM or email ramon.astudillo@ibm.com
October 7, 2025 at 11:04 PM
Excited to share our new paper on language model self-improvement!

Paper: arxiv.org/abs/2505.16927

We introduce Self-Taught Principle Learning (STaPLe), a new approach for LMs to generate their own constitutions, by learning the principles that are most effective to self-correct their responses.
May 23, 2025 at 9:33 PM
I'm at #ICLR2025 🇸🇬 and will be presenting Conformal Language Model Reasoning with Coherent Factuality (arXiv to come soon) this afternoon (4/24, poster session 2)! This work is with my amazing collaborators Max Rubin-Toles, Maya Gambhir, @aaroth.bsky.social, and @surbhigoel.bsky.social!
April 23, 2025 at 11:52 PM
Reposted by Keshav Ramji
Announcement #1: our call for papers is up! 🎉
colmweb.org/cfp.html
And excited to announce the COLM 2025 program chairs @yoavartzi.com @eunsol.bsky.social @ranjaykrishna.bsky.social and @adtraghunathan.bsky.social
December 17, 2024 at 3:48 PM
I'll be at NeurIPS tomorrow and Saturday 🇨🇦! DM me if you're working on alignment, reasoning, data-centric methods, or uncertainty quantification and would like to chat!
December 12, 2024 at 10:28 PM