A fully speculative exercise going all the way till AGI but still grounded into current research and the coming end of pretraining as we know it. And with a radical different set of premises. vintagedata.org/blog/posts/r...
Really thoughtful take from @chiphuyen.bsky.social in The Pragmatic Engineer Podcast:
Perhaps software engineering will change, like writing changed hundreds of years ago thanks to printing
Full: www.youtube.com/watch?v=98o_...
Really thoughtful take from @chiphuyen.bsky.social in The Pragmatic Engineer Podcast:
Perhaps software engineering will change, like writing changed hundreds of years ago thanks to printing
Full: www.youtube.com/watch?v=98o_...
For me, there are three big stories: itcanthink.substack.com/p/2024-robot...
Separating different classes of AI agents from a long history of reinforcement learning.
Why we can be optimistic for AI agents but also extremely critical of the terrible communications around them to date.
Plus, some policy guidance.
willwhitney.com/computing-in...
tomhipwell.co/blog/sora/
tomhipwell.co/blog/sora/
This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement learning and sequential decision making, covering value-based RL, policy-gradient methods, model-based methods, and various other topics.
arxiv.org/abs/2412.05265
This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement learning and sequential decision making, covering value-based RL, policy-gradient methods, model-based methods, and various other topics.
arxiv.org/abs/2412.05265
The latest 1206 release is #1 in ALL categories. You can try it here: aistudio.google.com/app/prompts/...
2B: huggingface.co/Qwen/Qwen2-V...
7B: huggingface.co/Qwen/Qwen2-V...
72B: huggingface.co/Qwen/Qwen2-V...
2B: huggingface.co/Qwen/Qwen2-V...
7B: huggingface.co/Qwen/Qwen2-V...
72B: huggingface.co/Qwen/Qwen2-V...
🐠 to interact with text at multiple levels of abstraction
🐡 inspired by the fish-eye lens