Gerard I. Gállego
geiongallego.bsky.social
Gerard I. Gállego
@geiongallego.bsky.social
Reposted by Gerard I. Gállego
Roger Moore reminds Interspeech audience that speech is not audible text. Text is a technology.
August 18, 2025 at 8:18 AM
A quick (and slightly late) career update: I joined the Barcelona Supercomputing Center (BSC) in January 2025! I'm now in a full-time role, back to Speech Translation after a few years of internships and detours. 🧵
June 3, 2025 at 8:53 PM
As we welcome 2025, we're excited to share that our paper, "Single-stage TTS with Masked Audio Token Modeling and Semantic Knowledge Distillation", has been accepted to #ICASSP2025!
This work advances single-stage Non-Autoregressive TTS based on audio token modeling.
🧵
December 31, 2024 at 7:48 PM
Reposted by Gerard I. Gállego
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute 🔥

How? By combining step-wise reward models with tree search algorithms :)

We're open sourcing the full recipe and sharing a detailed blog post 👇
December 16, 2024 at 5:08 PM
Reposted by Gerard I. Gállego
congratulations, @ian-goodfellow.bsky.social, for the test-of-time award at @neuripsconf.bsky.social!

this award reminds me of how GAN started with this one email ian sent to the Mila (then Lisa) lab mailing list in May 2014. super insightful and amazing execution!
November 27, 2024 at 6:31 PM
Reposted by Gerard I. Gállego
Arxiv sharing reminder

pdf ❌
abs ✅
November 26, 2024 at 8:42 AM