My favorite RL run yet over 7+ years of doing RL.
The biggest fully open RL run ever?
Gold stars on downstream evals is our original release, this latest one is the final checkpoint on the plot.
My favorite RL run yet over 7+ years of doing RL.
The biggest fully open RL run ever?
Gold stars on downstream evals is our original release, this latest one is the final checkpoint on the plot.
at $2/H100 hour, Olmo 3 start to end would cost $2.75M
allenai.org/papers/olmo3
at $2/H100 hour, Olmo 3 start to end would cost $2.75M
allenai.org/papers/olmo3
iOS: apps.apple.com/us/app/doki-...
Android: play.google.com/store/apps/d...
iOS: apps.apple.com/us/app/doki-...
Android: play.google.com/store/apps/d...
Please mass report this Instagram post, this person has fed my art through that garbage sora Al
www.instagram.com/p/DP_zCx6DjRH/
[ #art | #furryfandom | #furry | #artist | #artwork | #furryfyp ]
Please mass report this Instagram post, this person has fed my art through that garbage sora Al
www.instagram.com/p/DP_zCx6DjRH/
[ #art | #furryfandom | #furry | #artist | #artwork | #furryfyp ]
knowledge is, quite literally, power
I'm sat here imagining how my clients would feel if I added 30% to each invoice, because I "really got into gambling".
I'm sat here imagining how my clients would feel if I added 30% to each invoice, because I "really got into gambling".
Best fully open 32B reasoning model & best 32B base model. 🧵
Best fully open 32B reasoning model & best 32B base model. 🧵
Why does some grind or hard challenge feel joyful in game A, but a chore in game B? You enjoy *existing* in game A. The "thing to do" serves that joy.
Pet the dog. Enjoy being
Why does some grind or hard challenge feel joyful in game A, but a chore in game B? You enjoy *existing* in game A. The "thing to do" serves that joy.
Pet the dog. Enjoy being