Lightnews — Scholar-powered news

Harley Wiltzer

@harwiltz.bsky.social

PhD student at Mila / McGill. Studying distributional RL for transfer across risk-sensitive utilities, and for long-horizon high-frequency decision-making.

Posts Replies Media Videos

Reposted by Harley Wiltzer

Robin Ranjit Singh Chauhan

@robinchauhan.bsky.social

E65: NeurIPS 2024 – Posters and Hallways 3

- Claire Bizon Monroc of Inria : WFCRL for Wind Farm Control
Andrew Wagenmaker of @ucberkeleyofficial.bsky.social : Leveraging Simulation to Bridge Sim-to-Real Gap
- @harwiltz.bsky.social of @mila-quebec.bsky.social : Multivariate Distributional RL
(cont)

March 10, 2025 at 5:21 PM

Harley Wiltzer

@harwiltz.bsky.social

How can you 0-shot transfer predictions of long-term performance across reward functions *and* risk-sensitive utilities?

We can do this via Distributional Successor Features. Our recent work introduces the 1st tractable & provably convergent algos for learning DSFs.

#NeurIPS2024 #6704
12 Dec, 11-2

December 9, 2024 at 3:30 PM

Harley Wiltzer

@harwiltz.bsky.social

In value-based RL, when decisions are made at high frequency, all hell breaks loose.

Our paper "Action Gaps & Advantages in Continuous-Time Distributional RL" shows how Distributional RL sheds light on this, enabling high-frequency model-free risk-sensitive RL.

#NeurIPS2024 #6410
13 Dec, 11-2

December 9, 2024 at 2:46 PM

Reposted by Harley Wiltzer

Arthur Gretton

@arthurgretton.bsky.social

Distributional SFs: enable 0-shot generalization of return *distribution* functions across a finite-dimensional reward function class

"Foundations of Multivariate Distributional Reinforcement Learning"

#NeurIPS2024 #6704
12 Dec 11am-2pm
neurips.cc/virtual/2024...

Wiltzer Farebrother Rowland

December 8, 2024 at 10:43 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news