zihengh1.github.io
arxiv.org/abs/2501.06708.
If you found this interesting, feel free to spread the word!
arxiv.org/abs/2501.06708.
If you found this interesting, feel free to spread the word!
✔️ Pretrained model weights are reliable guides for data selection.
✔️ Grad-Mimic identifies noisy samples and estimates training dataset quality.
✔️ It even complements other filtering methods, boosting CLIP performance with less data!
✔️ Pretrained model weights are reliable guides for data selection.
✔️ Grad-Mimic identifies noisy samples and estimates training dataset quality.
✔️ It even complements other filtering methods, boosting CLIP performance with less data!
1️⃣ Training Phase: Prioritizes which samples to learn, boosting data efficiency.
2️⃣ Post-Training Phase: Evaluates sample utility across training steps, creating an ensemble filter using weak supervision.
1️⃣ Training Phase: Prioritizes which samples to learn, boosting data efficiency.
2️⃣ Post-Training Phase: Evaluates sample utility across training steps, creating an ensemble filter using weak supervision.
⚠️ Existing methods have limitations: Model-free approaches are hard to design, while model-based ones can be computationally expensive.
✅ Grad-Mimic offers a better way using existing model weights!
arxiv.org/abs/2501.06708
⚠️ Existing methods have limitations: Model-free approaches are hard to design, while model-based ones can be computationally expensive.
✅ Grad-Mimic offers a better way using existing model weights!
arxiv.org/abs/2501.06708