Lightnews — Scholar-powered news

Tzu-Heng (Brian) Huang

@zihengh1.bsky.social

CS Ph.D. Student @UWMadison. Research Intern @Apple AIML. Focusing on multimodal models, data curation, and data-centric AI.

zihengh1.github.io

Posts Replies Media Videos

Reposted by Tzu-Heng (Brian) Huang

albertge.bsky.social

@albertge.bsky.social

Online data mixing reduces training costs for foundation models, but faces challenges:
⚠️ Human-defined domains miss semantic nuances
⚠️ Limited eval accessibility
⚠️ Poor scalability

Introducing 🎵R&B: first regroup data, then dynamically reweight domains during training!

May 8, 2025 at 5:01 PM

Reposted by Tzu-Heng (Brian) Huang

changho.bsky.social

@changho.bsky.social

What enables a strong model to surpass its weaker teacher?

🚀 Excited to share our ICLR 2025 paper: "Weak-to-Strong Generalization Through the Data-Centric Lens"! 🧵

February 5, 2025 at 6:22 PM

Tzu-Heng (Brian) Huang

@zihengh1.bsky.social

Tons of model weights available, but what else can we do besides prediction? 🤔 Introducing Grad-Mimic! A new data selection framework using well-trained model’s weights to find high-value samples for foundation models. Boost data curation & data efficiency!

February 9, 2025 at 9:08 PM

Reposted by Tzu-Heng (Brian) Huang

Fred Sala

@fredsala.bsky.social

First up at #NeurIPS2024 from our group, our work on labeling via programmatic distillation (a spotlight!). Label your data orders of magnitude faster and cheaper — come join us today at Poster Session 2 East for a demo!

December 11, 2024 at 11:15 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news