Lightnews — Scholar-powered news

Junjie Wu

@junjie116.bsky.social

15 followers 110 following 9 posts

NLP PhD candidate@HKUST | Visiting PhD student @YaleNLP

Posts Replies Media Videos

Junjie Wu

@junjie116.bsky.social

🚀 Can LLMs think beyond memorization? Our NAACL 2025 main conference paper on fluid intelligence shows why models like GPT-4o struggle with truly novel problem-solving on ARC-AGI. 📷

Project Website: wujunjie1998.github.io/araoc-benchm...

(1/4)

February 15, 2025 at 4:15 AM

Junjie Wu

@junjie116.bsky.social

🚀 Introducing PhysiCo: A New Benchmark for Evaluating Abstract Understanding in LLMs! 🚀

📚Link: physico-benchmark.github.io

While models like o3 have made impressive strides on ARC-AGI, how well do LLMs truly grasp the abstract patterns in ARC-style tasks?

(1/5)

February 15, 2025 at 4:09 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news