Felix Sarnthein
flxsa.bsky.social
Felix Sarnthein
@flxsa.bsky.social
PhD student in machine learning at ELLIS Institute Tübingen, MPI-IS and ETH. Prev: MSc in CS at ETH
Reposted by Felix Sarnthein
Just a heads up to everyone: @deep-mind.bsky.social is unfortunately a fake account and has been reported. Please do not follow it nor repost anything from it.
November 25, 2024 at 11:24 PM
Reposted by Felix Sarnthein
🍏 New preprint alert! 🍏
PoM: Efficient Image and Video Generation with the Polynomial Mixer
arxiv.org/abs/2411.12663
This is my latest "summer project" and it was so big I had to call in reinforcements (Thanks @nicolasdufour.bsky.social)

TL;DR Transformers are for boomers, welcome to the future
🧵👇
PoM: Efficient Image and Video Generation with the Polynomial Mixer
Diffusion models based on Multi-Head Attention (MHA) have become ubiquitous to generate high quality images and videos. However, encoding an image or a video as a sequence of patches results in costly...
arxiv.org
November 20, 2024 at 8:08 AM