Lightnews — Scholar-powered news

John Zila

@jzila.com

15 followers 13 following 5 posts

Posts Replies Media Videos

John Zila

@jzila.com

The market today is oversold on the DeepSeek news.

DeepSeek has made some incredible innovations in model efficiency. But the order-of-magnitude gains are primarily due to their MoE architecture, which scales more favorably in both training and inference when compared to a dense model.

January 27, 2025 at 8:23 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news