Project Website: wujunjie1998.github.io/araoc-benchm...
(1/4)
Project Website: wujunjie1998.github.io/araoc-benchm...
(1/4)
📚Link: physico-benchmark.github.io
While models like o3 have made impressive strides on ARC-AGI, how well do LLMs truly grasp the abstract patterns in ARC-style tasks?
(1/5)
📚Link: physico-benchmark.github.io
While models like o3 have made impressive strides on ARC-AGI, how well do LLMs truly grasp the abstract patterns in ARC-style tasks?
(1/5)