Paper: arxiv.org/abs/2505.16927
We introduce Self-Taught Principle Learning (STaPLe), a new approach for LMs to generate their own constitutions, by learning the principles that are most effective to self-correct their responses.
Paper: arxiv.org/abs/2505.16927
We introduce Self-Taught Principle Learning (STaPLe), a new approach for LMs to generate their own constitutions, by learning the principles that are most effective to self-correct their responses.