Lightnews — Scholar-powered news

Alex Heyman

@alexheyman.bsky.social

3 followers 6 following 10 posts

PhD candidate AI/ML researcher at York University, ON, CA | they/them

Posts Replies Media Videos

Alex Heyman

@alexheyman.bsky.social

OpenAI and DeepSeek’s reasoning LLMs have scored impressively on benchmarks that challenge humans, but how robust are their fundamentals? We test o1-mini & R1 on small-scale graph coloring problems and find limited reliability and signs of issues with non-linear reasoning.
(1/10)

February 13, 2025 at 6:14 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news