Lightnews — Scholar-powered news

Eran Malach

@emalach.bsky.social

500 followers 83 following 2 posts

Research Fellow @ Kempner Institute, Harvard University
Theory of Deep Learning / Learning of Deep Theory

Posts Replies Media Videos

Reposted by Eran Malach

David Alvarez-Melis

@dmelis.bsky.social

In our newest work (led by the amazing
@sunnytqin.bsky.social , w/ @emalach.bsky.social, Samy Jelassi), we investigate a core question for LLMs: "𝑡𝑜 𝑏𝑎𝑐𝑘𝑡𝑟𝑎𝑐𝑘 𝑜𝑟 𝑛𝑜𝑡 𝑡𝑜 𝑏𝑎𝑐𝑘𝑡𝑟𝑎𝑐𝑘" in two prototypical logic-heavy puzzles: CountDown and Sudoku.

April 11, 2025 at 4:29 PM

Eran Malach

@emalach.bsky.social

Will be presenting this work at #NeurIPS2024, today 11am, poster #2311. Come visit us!

Naomi Saphra @nsaphra.bsky.social · Jun 18

Modern generative models are trained to imitate human experts, but can they actually beat those experts? Our new paper uses imitative chess agents to explore when a model can "transcend" its training distribution and outperform every human it's trained on. arxiv.org/abs/2406.11741

December 12, 2024 at 4:45 PM

Eran Malach

@emalach.bsky.social

Heading to NeurIPS tomorrow ✈️
Will be presenting a few papers during the week. Ping me if you want to chat!

December 9, 2024 at 2:55 PM

Reposted by Eran Malach

Ben Edelman

@benedelman.bsky.social

I defended my PhD dissertation back in May. I didn't have time to share it widely then (newborn baby), but I think some of you might enjoy it, especially the opening chapters: benjaminedelman.com/assets/disse...

December 2, 2024 at 12:21 AM

Reposted by Eran Malach

Quanquan Gu

@quanquangu.bsky.social

Just put together a starter pack for Deep Learning Theory. Let me know if you'd like to be included or suggest someone to add to the list!

go.bsky.app/2qnppia

November 22, 2024 at 9:35 PM

Reposted by Eran Malach

David Brandfonbrener

@brandfonbrener.bsky.social

How does test loss change as we change the training data? And how does this interact with scaling laws?

We propose a methodology to approach these questions by showing that we can predict the performance across datasets and losses with simple shifted power law fits.

November 21, 2024 at 3:11 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news