dippedrusk.com/posts/2024-0...
dippedrusk.com/posts/2024-0...
He'll be talking about "Dream Machine: Emergent Capabilities from Video Foundation Models".
Live stream: youtu.be/oilWwsXZamA
7pm GMT+1 / 10am PST (Mon Dec 2nd)
He'll be talking about "Dream Machine: Emergent Capabilities from Video Foundation Models".
Live stream: youtu.be/oilWwsXZamA
7pm GMT+1 / 10am PST (Mon Dec 2nd)
M. Elsayed, G. Vasan, A. R. Mahmood, is one of those papers I wish I had written 😅
This paper seems to allow us to do RL with NNs as it should have always been done. Everyone should read it!
arxiv.org/abs/2410.14606
M. Elsayed, G. Vasan, A. R. Mahmood, is one of those papers I wish I had written 😅
This paper seems to allow us to do RL with NNs as it should have always been done. Everyone should read it!
arxiv.org/abs/2410.14606
As you react/respond to the author rebuttal can you please articulate the answers to these questions in 1-2 sentences each?
1. Why not a lower score
2. Why not a higher score
This significantly helps bring everyone (authors/reviewers/AC/SAC) on the same page.
As you react/respond to the author rebuttal can you please articulate the answers to these questions in 1-2 sentences each?
1. Why not a lower score
2. Why not a higher score
This significantly helps bring everyone (authors/reviewers/AC/SAC) on the same page.
Link: blog.neurips.cc/2024/11/27/a...
Link: blog.neurips.cc/2024/11/27/a...
Generative Adversarial Nets
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengio
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever, Oriol Vinyals, Quoc V. Le
Generative Adversarial Nets
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengio
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever, Oriol Vinyals, Quoc V. Le
We are releasing SmolVLM: a new 2B small vision language made for on-device use, fine-tunable on consumer GPU, immensely memory efficient 🤠
We release three checkpoints under Apache 2.0: SmolVLM-Instruct, SmolVLM-Synthetic and SmolVLM-Base huggingface.co/collections/...
We are releasing SmolVLM: a new 2B small vision language made for on-device use, fine-tunable on consumer GPU, immensely memory efficient 🤠
We release three checkpoints under Apache 2.0: SmolVLM-Instruct, SmolVLM-Synthetic and SmolVLM-Base huggingface.co/collections/...
Please take a moment to read authors' rebuttal, other reviews, and ask clarifying questions or request for further evidence that is still missing.
Many (junior) authors have put a ton of effort into this and may get discouraged by lack of engagement!
Please take a moment to read authors' rebuttal, other reviews, and ask clarifying questions or request for further evidence that is still missing.
Many (junior) authors have put a ton of effort into this and may get discouraged by lack of engagement!
1. Self-Play Preference Optimization (SPO).
2. Direct Nash Optimization (DNO).
🧵 1/3.
The last was a position paper on RLHF/alignment.
This week I will share papers (in pairs) on the topic of "game-theoretic or social choice meet meet alignment/RLHF".
🧵 1/3.
1. Self-Play Preference Optimization (SPO).
2. Direct Nash Optimization (DNO).
🧵 1/3.