Azure
realazure.bsky.social
Azure
@realazure.bsky.social
It is possible that these similarities were caused by other models being fine-tuned or primed on R1 thinking traces, before reinforcement learning.

Repository here: github.com/cpldcpu/llmb...
github.com
April 5, 2025 at 8:14 AM
That is a very neat idea to extend the latent states available for "reasoning". It feels a bit unnatural to force the models to output text for reasoning steps, even if some intermediate concepts can maybe not be easily expressed in written language.
December 11, 2024 at 9:00 AM