We’ve released a major update to our ProtRL repo:
✅ GRPO via Hugging Face Trainer
✅ New support for weighted DPO
Built for flexible, scalable RL with HF trainer base!
Check here: github.com/AI4PDLab/Pro...
We’ve released a major update to our ProtRL repo:
✅ GRPO via Hugging Face Trainer
✅ New support for weighted DPO
Built for flexible, scalable RL with HF trainer base!
Check here: github.com/AI4PDLab/Pro...