Miguel Angel Bautista
itsbautistam.bsky.social
Miguel Angel Bautista
@itsbautistam.bsky.social
I am a research scientist @ Apple MLR, seeking a grand unification of generative modeling 🇪🇸🇺🇸
One thing that I am excited about is unlocking the power of function space models for domain-agnostic learning. I am positive that a single architecture can achieve SOTA generation results for images, videos, 3D pointclouds and graphs.
December 6, 2024 at 7:55 PM
When we started getting this results on ImageNet-256 I was impressed that a model that predicts each pixel independently (through a cross-attention block), can generate these high-frequency details.
December 6, 2024 at 7:55 PM