Harshit Sikchi
harshitsikchi.bsky.social
Harshit Sikchi
@harshitsikchi.bsky.social
Research @OpenAI. I study Reinforcement Learning. PhD from UT Austin. Previously FAIR Paris, Meta US, NVIDIA, CMU, and IIT Kharagpur.
Website: https://hari-sikchi.github.io/
(6/n) With RLZero, you can just pass in a YouTube video and ask an agent to mimic the behavior instantly. This brings us closer to true zero-shot cross-embodiment transfer.
December 11, 2024 at 7:11 AM
(5/n) RLZero’s Prompt to Policy: Asking a humanoid agent to perform a headstand.
December 11, 2024 at 7:11 AM
(4/n) Reward is an inconvenient and easily hackable form of task specification. Now, we can prompt and obtain behaviors zero-shot with language. Example: Asking a walker agent to perform a cartwheel.
December 11, 2024 at 7:11 AM
🤖 Introducing RL Zero 🤖: a new approach to transform language into behavior zero-shot for embodied agents without labeled datasets!
December 11, 2024 at 7:11 AM