Lightnews — Scholar-powered news

Carlo Sferrazza

@carlosferrazza.bsky.social

We just released FastTD3: a simple, fast, off-policy RL algorithm to train humanoid policies that transfer seamlessly from simulation to the real world.

younggyo.me/fast_td3

May 29, 2025 at 5:49 PM

Carlo Sferrazza

@carlosferrazza.bsky.social

FuSe policies reason jointly over vision, touch, and sound, enabling tasks such as multimodal disambiguation, generation of object descriptions upon interaction, and compositional cross-modal prompting (e.g., “press the button with the same color as the soft object”).

January 13, 2025 at 6:51 PM

Carlo Sferrazza

@carlosferrazza.bsky.social

Ever wondered what robots 🤖 could achieve if they could not just see – but also feel and hear?

Introducing FuSe: a recipe for finetuning large vision-language-action (VLA) models with heterogeneous sensory data, such as vision, touch, sound, and more.

Details in the thread 👇

January 13, 2025 at 6:51 PM

Carlo Sferrazza

@carlosferrazza.bsky.social

By combining MaxInfoRL with DrQv2 and DrM, this achieves state-of-the-art model-free performance on hard visual control tasks such as DMControl humanoid and dog tasks, improving both sample efficiency and steady-state performance.

December 17, 2024 at 5:47 PM

Carlo Sferrazza

@carlosferrazza.bsky.social

While standard Boltzmann exploration (e.g., SAC) focuses only on action entropy, MaxInfoRL maximizes entropy in both state and action spaces! This proves to be crucial when dealing with complex exploration settings.

December 17, 2024 at 5:47 PM

Carlo Sferrazza

@carlosferrazza.bsky.social

🚨 New reinforcement learning algorithms 🚨

Excited to announce MaxInfoRL, a class of model-free RL algorithms that solves complex continuous control tasks (including vision-based!) by steering exploration towards informative transitions.

Details in the thread 👇

December 17, 2024 at 5:47 PM

Carlo Sferrazza

@carlosferrazza.bsky.social

While standard Boltzmann exploration (e.g., SAC) focuses only on action entropy, MaxInfoRL maximizes entropy in both state and action spaces! This proves to be crucial when dealing with complex exploration settings.

December 17, 2024 at 5:44 PM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news