Lightnews — Scholar-powered news

Michael Poli

@michaelpoli.bsky.social

120 followers 22 following 6 posts

AI, numerics and systems.
Founding Scientist at Liquid AI

Posts Replies Media Videos

Michael Poli

@michaelpoli.bsky.social

[6] It's now undeniable that, with a little bit of creativity, improving scaling is not only approachable but also particularly rewarding. And while I'm obviously excited by convolution-attention hyena hybrids due to their balance of efficiency and quality across domains, there's a lot more to do!

November 14, 2024 at 8:33 PM

Michael Poli

@michaelpoli.bsky.social

[5] We have seen time and again that various classes of computational units outperform others in different modalities, on different tasks, in different regimes. We've seen this in scaling laws, on synthetics, on inference.

November 14, 2024 at 8:32 PM

Michael Poli

@michaelpoli.bsky.social

[4] There has been a flurry of work over the last couple of years (from the great people at HazyResearch and elsewhere) on developing bespoke model designs as "proofs of existence" to challenge the Transformer orthodoxy, at a time when model design was considered "partially solved."

November 14, 2024 at 8:32 PM

Michael Poli

@michaelpoli.bsky.social

[3] We continue to push the scale of what's possible with "beyond Transformer" models applied to biology, in what could be among the most computationally intensive fully open (weights, data, pretraining infrastructure) sets of pretrained models across AI as a whole.

November 14, 2024 at 8:31 PM

Michael Poli

@michaelpoli.bsky.social

[2] A lot has happened since the first release of Evo. We have made public the original pretraining dataset (OpenGenome)—links below—and will soon release the entire pretraining infrastructure and model code.

November 14, 2024 at 8:31 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news