Lightnews — Scholar-powered news

Harris Chan

@harrischan.bsky.social

61 followers 18 following 4 posts

Research Scientist at @GoogleDeepMind, ML PhD @UofT/@VectorInst. EngSci Grad. Former Canadian Rubik's Cube Champion

Posts Replies Media Videos

Harris Chan

@harrischan.bsky.social

Here's my attempt at visualizing the training pipeline for DeepSeek-R1(-Zero) and the distillation to smaller models.

Note they retrain DeepSeek-V3-Base with the new 800k curated data instead of continuing to finetune the checkpoint from the first round of cold-start SFT + RL

January 21, 2025 at 1:11 AM

Harris Chan

@harrischan.bsky.social

If you can imagine it, you can play it in Genie 2 🧞

Our foundation world model is capable of generating interactive worlds controllable with keyboard/mouse actions, starting from a single prompt image

So proud to have been part of this work led by @jparkerholder.bsky.social and @rockt.ai 🙏

Diagram of Genie 2, an autoregressive latent diffusion model trained on large video dataset. The images are encoded into the latent space and passed to a causal transformer dynamics model. The decoder renders the images from the predicted latent codes.

December 5, 2024 at 3:24 AM

Harris Chan

@harrischan.bsky.social

LMs see, can LMs do?

LMAct benchmarks current SOTA foundation models' ability to act in text/visual environments using text as low-level actions in many domains using in-context expert (multimodal) demonstrations. We're excited to see how this benchmark drives further progress!

anianruoss.bsky.social @anianruoss.bsky.social · Dec 3

Ever wonder how well frontier models (Claude 3.5 Sonnet, Gemini 1.5 Flash & Pro, GPT-4o, o1-mini & o1-preview) play Atari, chess, or tic-tac-toe?

We present LMAct, an in-context imitation learning benchmark with long multimodal demonstrations (arxiv.org/abs/2412.01441).

🧵 1/N

December 5, 2024 at 3:07 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news