PhD Candidate 🤘@UTAustin | previously @IBMResearch @sjtu1896 | NLP for social good
🚀TL;DR: We introduce Situated-PRInciples (SPRI), a framework that automatically generates input-specific principles to align responses — with minimal human effort.
🧵
🚀TL;DR: We introduce Situated-PRInciples (SPRI), a framework that automatically generates input-specific principles to align responses — with minimal human effort.
🧵
This work was done during my internship at IBM Research, and it wouldn’t have been possible without a top-notch team and my amazing advisor 👏
This work was done during my internship at IBM Research, and it wouldn’t have been possible without a top-notch team and my amazing advisor 👏
SPRI turns out to work great for tasks that require complex principles, showcasing on-par performance as expert-guided methods.
SPRI turns out to work great for tasks that require complex principles, showcasing on-par performance as expert-guided methods.
In each stage, a base model and a critic model are used to create principles and responses from scratch through critique-refine.
In each stage, a base model and a critic model are used to create principles and responses from scratch through critique-refine.
Can we guide responses with context-situated principles instead?
Introducing SPRI, a system that produces principles tailored to each query, with minimal to no human effort.
arxiv.org/pdf/2502.03397
Can we guide responses with context-situated principles instead?
Introducing SPRI, a system that produces principles tailored to each query, with minimal to no human effort.
arxiv.org/pdf/2502.03397