Chunks learned in one environment improve exploration and sampling efficiency in unseen settings, suggesting the method abstracts high order general principles that are robust & adaptable to new envs!
Chunks learned in one environment improve exploration and sampling efficiency in unseen settings, suggesting the method abstracts high order general principles that are robust & adaptable to new envs!
For example, in FractalGrid, vanilla GFlowNets get stuck in a single mode, but armed with ActionPiece, it unlocks new exploration paths!
For example, in FractalGrid, vanilla GFlowNets get stuck in a single mode, but armed with ActionPiece, it unlocks new exploration paths!
Across synthetic and real-world tasks (e.g., RNA sequence generation, bit sequences, and FractalGrid), our approach improves especially for GFlowNets:
✅ Mode discovery
✅ Exploration
✅ Density estimation
✅ Interpretability
Across synthetic and real-world tasks (e.g., RNA sequence generation, bit sequences, and FractalGrid), our approach improves especially for GFlowNets:
✅ Mode discovery
✅ Exploration
✅ Density estimation
✅ Interpretability
We address the credit assignment challenge under long trajectories in RL or GFlowNets by constructing high order actions, or “chunks”, effectively compressing trajectory lengths!
We address the credit assignment challenge under long trajectories in RL or GFlowNets by constructing high order actions, or “chunks”, effectively compressing trajectory lengths!