Lightnews — Scholar-powered news

Onno Eberhard

@onnoeberhard.com

540 followers 280 following 50 posts

PhD Student in Tübingen (MPI-IS & Uni Tü), interested in reinforcement learning. Freedom is a pure idea. https://onnoeberhard.com/

Posts Replies Media Videos

Onno Eberhard

@onnoeberhard.com

A cute little animation: a critically damped harmonic oscillator becomes unstable with integral control if the gain is too high. Here, at K_i = 2, a Hopf bifurcation occurs: two poles of the transfer function enter the right-hand s-plane and the closed-loop system becomes unstable.

September 9, 2025 at 2:34 PM

Onno Eberhard

@onnoeberhard.com

Without forgetting, the learning is intractable: it is equivalent to keeping the complete history. However, to distinguish histories that differ only far in the past, we need to "zoom in" a lot, as shown here.

July 16, 2025 at 1:35 AM

Onno Eberhard

@onnoeberhard.com

What about memory traces? Here, I am visualizing the space 𝒵 of all possible memory traces for the case where there are only 3 possible (one-hot) observations, 𝒴 = {a, b, c}. We can show that, if 𝜆 < 1/2, then memory traces preserve all information of the complete history! Nothing is forgotten!

July 16, 2025 at 1:35 AM

Onno Eberhard

@onnoeberhard.com

With this increased sample efficiency, the algorithm can even tackle high-dimensional, non-smooth, and stochastic MuJoCo environments, as shown here.

June 4, 2025 at 8:06 AM

Onno Eberhard

@onnoeberhard.com

This strategy manages to learn highly effective open-loop controllers, like this one that swings up an inverted pendulum.

June 4, 2025 at 8:06 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news