Lightnews — Scholar-powered news

Azure

@realazure.bsky.social

8 followers 140 following 4 posts

Posts Replies Media Videos

Azure

@realazure.bsky.social

It is possible that these similarities were caused by other models being fine-tuned or primed on R1 thinking traces, before reinforcement learning.

Repository here: github.com/cpldcpu/llmb...

github.com

April 5, 2025 at 8:14 AM

Azure

@realazure.bsky.social

That is a very neat idea to extend the latent states available for "reasoning". It feels a bit unnatural to force the models to output text for reasoning steps, even if some intermediate concepts can maybe not be easily expressed in written language.

December 11, 2024 at 9:00 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news