Lightnews — Scholar-powered news

Primoz Ravbar

@primozravbar.bsky.social

Researcher @UCSB
Neuroscience, ethology, computational biology, theoretical biology, data science, ML, AI, artificial life

Posts Replies Media Videos

Primoz Ravbar

@primozravbar.bsky.social

AI models collapse when trained on recursively generated data
www.nature.com/articles/s41...

AI models collapse when trained on recursively generated data - Nature

 Analysis shows that indiscriminately training generative artificial intelligence on real and generated content, usually done by scraping data from the Internet, can lead to a collapse in th...

www.nature.com

December 23, 2024 at 6:47 PM

Reposted by Primoz Ravbar

Blake Richards

@tyrellturing.bsky.social

This paper looks interesting - it argues that you don’t need adaptive systems like Adam to get good gradient-based training, instead you can just set a learning rate for different groups of units based on initialization:

arxiv.org/abs/2412.11768

#MLSky #NeuroAI

No More Adam: Learning Rate Scaling at Initialization is All You Need

In this work, we question the necessity of adaptive gradient methods for training deep neural networks. SGD-SaI is a simple yet effective enhancement to stochastic gradient descent with momentum (SGDM...

arxiv.org

December 20, 2024 at 7:00 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news