Lightnews — Scholar-powered news

Reposted by Oussama Zekri

Quentin Berthet

@qberthet.bsky.social

🚨 New paper on regression and classification!

Adding to the discussion on using least-squares or cross-entropy, regression or classification formulations of supervised problems!

A thread on how to bridge these problems:

February 10, 2025 at 12:00 PM

Oussama Zekri

@ozekri.bsky.social

🚀 Policy gradient methods like DeepSeek’s GRPO are great for finetuning LLMs via RLHF.

But what happens when we swap autoregressive generation for discrete diffusion, a rising architecture promising faster & more controllable LLMs?

Introducing SEPO !

📑 arxiv.org/pdf/2502.01384

🧵👇

February 4, 2025 at 3:42 PM

Oussama Zekri

@ozekri.bsky.social

Beautiful work!!

Ambroise Odonnat @ambroiseodt.bsky.social · Feb 4

🚀Proud to share our work on the training dynamics in Transformers with Wassim Bouaziz & @viviencabannes.bsky.social @Inria @MetaAI

📝Easing Optimization Paths arxiv.org/pdf/2501.02362 (accepted @ICASSP 2025 🥳)

📝Clustering Heads 🔥https://arxiv.org/pdf/2410.24050

🖥️ github.com/facebookrese...

1/🧵

February 4, 2025 at 11:59 AM

Reposted by Oussama Zekri

Ambroise Odonnat

@ambroiseodt.bsky.social

🚀Proud to share our work on the training dynamics in Transformers with Wassim Bouaziz & @viviencabannes.bsky.social @Inria @MetaAI

📝Easing Optimization Paths arxiv.org/pdf/2501.02362 (accepted @ICASSP 2025 🥳)

📝Clustering Heads 🔥https://arxiv.org/pdf/2410.24050

🖥️ github.com/facebookrese...

1/🧵

February 4, 2025 at 11:56 AM

Reposted by Oussama Zekri

Ambroise Odonnat

@ambroiseodt.bsky.social

Happy to see Disentangled In-Context Learning accepted at ICLR 2025 🥳

Make zero-shot reinforcement learning with LLMs go brrr 🚀

🖥️ github.com/abenechehab/...

📜 arxiv.org/pdf/2410.11711

Congrats Abdelhakim (abenechehab.github.io) for leading it, always fun working with nice and strong people 🤗

GitHub - abenechehab/dicl: Official implementation of DICL (Disentangled In-Context Learning), featured in the paper Zero-shot Model-based Reinforcement Learning using Large Language Models.

Official implementation of DICL (Disentangled In-Context Learning), featured in the paper Zero-shot Model-based Reinforcement Learning using Large Language Models. - abenechehab/dicl

github.com

January 25, 2025 at 1:10 PM

Reposted by Oussama Zekri

lebellig

@lebellig.bsky.social

For the French-speaking audience, S. Mallat's courses at the College de France on Data generation in AI by transport and denoising have just started. I highly recommend them, as I've learned a lot from the overall vision of his courses.

Recordings are also available: www.youtube.com/watch?v=5zFh...

Génération de données en IA par transport et débruitage (1) - Stéphane Mallat (2024-2025)

YouTube video by Mathématiques et informatique - Collège de France

www.youtube.com

January 20, 2025 at 5:49 PM

Reposted by Oussama Zekri

Arnaud Doucet

@arnauddoucet.bsky.social

Speculative sampling accelerates inference in LLMs by drafting future tokens which are verified in parallel. With @vdebortoli.bsky.social , A. Galashov & @arthurgretton.bsky.social , we extend this approach to (continuous-space) diffusion models: arxiv.org/abs/2501.05370

January 10, 2025 at 4:30 PM

Reposted by Oussama Zekri

Konstantin Mishchenko

@konstmish.bsky.social

The idea that one needs to know a lot of advanced math to start doing research in ML seems so wrong to me. Instead of reading books for weeks and forgetting most of them a year later, I think it's much better to try do things, see what knowledge gaps prevent you from doing them, and only then read.

December 6, 2024 at 2:26 PM

Reposted by Oussama Zekri

Vicki

@vickiboykis.com

This seems like… what we started with, no? arxiv.org/abs/2410.02724

Large Language Models as Markov Chains

Large language models (LLMs) have proven to be remarkably efficient, both across a wide range of natural language processing tasks and well beyond them. However, a comprehensive theoretical analysis o...

arxiv.org

December 3, 2024 at 12:19 PM

Reposted by Oussama Zekri

Ambroise Odonnat

@ambroiseodt.bsky.social

🚨So, you want to predict your model's performance at test time?🚨

💡Our NeurIPS 2024 paper proposes 𝐌𝐚𝐍𝐨, a training-free and SOTA approach!

📑 arxiv.org/pdf/2405.18979
🖥️https://github.com/Renchunzi-Xie/MaNo

1/🧵(A surprise at the end!)

December 3, 2024 at 4:58 PM

Reposted by Oussama Zekri

Gabriel Peyré

@gabrielpeyre.bsky.social

I wrote a summary of the main ingredients of the neat proof by Hugo Lavenant that diffusion models do not generally define optimal transport. github.com/mathematical...

November 30, 2024 at 8:35 AM

Oussama Zekri

@ozekri.bsky.social

🚀 Did you know you can use the in-context learning abilities of an LLM to estimate the transition probabilities of a Markov chains?

The results are pretty exciting ! 😄

November 26, 2024 at 2:52 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news