Zhongqi (Nick) Yue, a great post-doc in my lab, has led the development of EARL—a new reinforcement learning framework for LLMs to interact with external environments, greatly improving over text-only interaction in reasoning tasks.
Zhongqi (Nick) Yue, a great post-doc in my lab, has led the development of EARL—a new reinforcement learning framework for LLMs to interact with external environments, greatly improving over text-only interaction in reasoning tasks.