Philip Bontrager
banner
pbontrager.bsky.social
Philip Bontrager
@pbontrager.bsky.social
AI researcher & engineer @Meta working on @PyTorch torchtune in NYC; interests in generative models, RL, and evolutionary strategies

💻 https://github.com/pbontrager 📝 https://tinyurl.com/philips-papers
In the Alice In Wonderland (github.com/LAION-AI/AIW) reasoning and generalization benchmark, DeepSeek R1 appears to perform much more like o1 mini than o1 -preview. (Plot from laion-ai)
January 25, 2025 at 5:25 PM
This made me look up chess capabilities 40 years ago, turns out they were already expert level then, and didn’t really become super human until 10 years after Kasparov. So 40 years is pretty good for your analogy.
December 17, 2024 at 1:24 PM
The way you can tell if an image is AI generated or not is by looking at the hands. If the hands look weird they’re probably human drawn.
December 14, 2024 at 12:13 AM
I'm still trying to figure out what's happening in their model. They state that "latent frames from the video are passed to a large transformer dynamics model, trained with a causal mask". Do they do diffusion through time or use a autoregressive model for time and diffusion per frame?
December 4, 2024 at 4:31 PM
I'm being lazy, I want to understand the algorithm without doing a deep dive into the paper. I appreciate the extra work to share it here.

Me on the other hand, I once thought that putting this in a paper was a good idea 🫥
December 3, 2024 at 5:28 PM
You got off easy 😅
November 27, 2024 at 9:47 PM
To be fair, it wasn’t really a full convo
November 22, 2024 at 10:13 PM