Check out the full paper here: 📄 arxiv.org/abs/2505.03176
💻 Code coming soon!
📬 DM me if you’d like to chat or discuss the paper!
(10/10)
Check out the full paper here: 📄 arxiv.org/abs/2505.03176
💻 Code coming soon!
📬 DM me if you’d like to chat or discuss the paper!
(10/10)
(9/10)
(9/10)
8/10
8/10
(7/10)
(7/10)
(6/10)
(6/10)
An action-invariant aggregate representation
Action-equivariant individual-view representations
💡No explicit equivariance loss or dual predictor required!
(5/10)
An action-invariant aggregate representation
Action-equivariant individual-view representations
💡No explicit equivariance loss or dual predictor required!
(5/10)
➡️ A transformer encoder aggregates these action-conditioned view representations to predict a yet unseen view.
(4/10)
➡️ A transformer encoder aggregates these action-conditioned view representations to predict a yet unseen view.
(4/10)
(3/10)
(3/10)
2/10
2/10