maxsimchowitz.bsky.social
@maxsimchowitz.bsky.social
Can a language model improve itself without external verifier? We pose self-improvement as a computational challenge, and show how self-training might surmount it. Joint work with @djfoster.bsky.social and MSR.

Self-Improvement in Language Models: The Sharpening Mechanism
arxiv.org/abs/2412.01951
Self-Improvement in Language Models: The Sharpening Mechanism
Recent work in language modeling has raised the possibility of self-improvement, where a language models evaluates and refines its own generations to achieve higher performance without external feedba...
arxiv.org
December 14, 2024 at 5:48 PM