jvnickerson.bsky.social
@jvnickerson.bsky.social
Is the idea to then run another RL cycle on a reasoning LLM using the just-out of reach problems?
January 30, 2025 at 12:38 AM
I'm running 32B on an M4 Mac and it's pretty good.
January 29, 2025 at 2:28 AM