🍰Reinforcement Learning environments for LLMs
🐎Speculative and non-auto regressive generation for LLMs
interested/curious? DM or email ramon.astudillo@ibm.com
Paper: arxiv.org/abs/2505.16927
We introduce Self-Taught Principle Learning (STaPLe), a new approach for LMs to generate their own constitutions, by learning the principles that are most effective to self-correct their responses.
Paper: arxiv.org/abs/2505.16927
We introduce Self-Taught Principle Learning (STaPLe), a new approach for LMs to generate their own constitutions, by learning the principles that are most effective to self-correct their responses.
colmweb.org/cfp.html
And excited to announce the COLM 2025 program chairs @yoavartzi.com @eunsol.bsky.social @ranjaykrishna.bsky.social and @adtraghunathan.bsky.social
colmweb.org/cfp.html
And excited to announce the COLM 2025 program chairs @yoavartzi.com @eunsol.bsky.social @ranjaykrishna.bsky.social and @adtraghunathan.bsky.social