📅Wednesday, July 16th
🕓4:30-7:00 PM
📈#E-2802
📅Wednesday, July 16th
🕓4:30-7:00 PM
📈#E-2802
📄paper: arxiv.org/abs/2506.05971
💻 code: github.com/BenGutteridge/…
🙌 With Ben Gutteridge, Scott le Roux, @mmbronstein.bsky.social,
Xiaowen Dong#ICML20252#GNNN#AIAI
📄paper: arxiv.org/abs/2506.05971
💻 code: github.com/BenGutteridge/…
🙌 With Ben Gutteridge, Scott le Roux, @mmbronstein.bsky.social,
Xiaowen Dong#ICML20252#GNNN#AIAI
✅ Long-range can be formalized & measured
✅ Reveals new insights into models & datasets
🚀 Time to rethink evaluation: not just accuracy, but how models solve tasks
✅ Long-range can be formalized & measured
✅ Reveals new insights into models & datasets
🚀 Time to rethink evaluation: not just accuracy, but how models solve tasks
"Long-range" is often just a dataset intuition or model label.
We offer a measurable way to:
💡Understand models
🧪Test benchmarks
🦮Guide model design
🚀Go beyond performance gaps
"Long-range" is often just a dataset intuition or model label.
We offer a measurable way to:
💡Understand models
🧪Test benchmarks
🦮Guide model design
🚀Go beyond performance gaps
Surprisingly:
❌ Peptides-func: negative correlation, suggests not long-range
✅ VOC: positive correlation, suggests long-range
Surprisingly:
❌ Peptides-func: negative correlation, suggests not long-range
✅ VOC: positive correlation, suggests long-range
👷Construct synthetic tasks with analytically-known range
💯Show trained GNNs can approximate the true task range
🔬Use range as a proxy to evaluate real benchmarks
👷Construct synthetic tasks with analytically-known range
💯Show trained GNNs can approximate the true task range
🔬Use range as a proxy to evaluate real benchmarks
This measure applies to both node- and graph-level tasks, and across architectures.
This measure applies to both node- and graph-level tasks, and across architectures.
What makes a task long-range? How can we tell if a model actually captures long-range interactions?
What makes a task long-range? How can we tell if a model actually captures long-range interactions?