Tzu-Heng (Brian) Huang
banner
zihengh1.bsky.social
Tzu-Heng (Brian) Huang
@zihengh1.bsky.social
CS Ph.D. Student @UWMadison. Research Intern @Apple AIML. Focusing on multimodal models, data curation, and data-centric AI.

zihengh1.github.io
Reposted by Tzu-Heng (Brian) Huang
Online data mixing reduces training costs for foundation models, but faces challenges:
⚠️ Human-defined domains miss semantic nuances
⚠️ Limited eval accessibility
⚠️ Poor scalability

Introducing 🎵R&B: first regroup data, then dynamically reweight domains during training!
May 8, 2025 at 5:01 PM
Reposted by Tzu-Heng (Brian) Huang
What enables a strong model to surpass its weaker teacher?

🚀 Excited to share our ICLR 2025 paper: "Weak-to-Strong Generalization Through the Data-Centric Lens"! 🧵
February 5, 2025 at 6:22 PM
Tons of model weights available, but what else can we do besides prediction? 🤔 Introducing Grad-Mimic! A new data selection framework using well-trained model’s weights to find high-value samples for foundation models. Boost data curation & data efficiency!
February 9, 2025 at 9:08 PM
Reposted by Tzu-Heng (Brian) Huang
First up at #NeurIPS2024 from our group, our work on labeling via programmatic distillation (a spotlight!). Label your data orders of magnitude faster and cheaper — come join us today at Poster Session 2 East for a demo!
December 11, 2024 at 11:15 PM