Lightnews — Scholar-powered news

@kamile.st

PhantomWiki has just been accepted at #ICLR2025 DATA-FM workshop! 🎉

Anmol Kabra @anmolkabra.com · Mar 6

🎉 PhantomWiki is accepted to the @iclr-conf.bsky.social DATA-FM workshop! Come chat with us in Singapore 🦁

🧠 The reasoning + retrieval benchmark comes right on the heels of new @realaaai.bsky.social presidential report: AI Reasoning and Agents research front and center!

Anmol Kabra @anmolkabra.com · Mar 5

🚀 📢 Releasing PhantomWiki, a reasoning + retrieval benchmark for LLM agents!

If I asked you "Who is the friend of father of mother of Tom?", you'd simply look up Tom -> mother -> father -> friend and answer.

🤯 SOTA LLMs, even DeepSeek-R1, struggle with such simple reasoning!

March 6, 2025 at 6:19 PM

Reposted by Kamilė Stankevičiūtė

Anmol Kabra

@anmolkabra.com

🎉 PhantomWiki is accepted to the @iclr-conf.bsky.social DATA-FM workshop! Come chat with us in Singapore 🦁

🧠 The reasoning + retrieval benchmark comes right on the heels of new @realaaai.bsky.social presidential report: AI Reasoning and Agents research front and center!

Anmol Kabra @anmolkabra.com · Mar 5

🚀 📢 Releasing PhantomWiki, a reasoning + retrieval benchmark for LLM agents!

If I asked you "Who is the friend of father of mother of Tom?", you'd simply look up Tom -> mother -> father -> friend and answer.

🤯 SOTA LLMs, even DeepSeek-R1, struggle with such simple reasoning!

March 6, 2025 at 1:38 PM

Kamilė Stankevičiūtė

@kamile.st

Excited to announce our new work on reasoning and retrieval evaluation in LLMs! 😊

Check it out!

Anmol Kabra @anmolkabra.com · Mar 5

🚀 📢 Releasing PhantomWiki, a reasoning + retrieval benchmark for LLM agents!

If I asked you "Who is the friend of father of mother of Tom?", you'd simply look up Tom -> mother -> father -> friend and answer.

🤯 SOTA LLMs, even DeepSeek-R1, struggle with such simple reasoning!

March 6, 2025 at 3:05 AM

Reposted by Kamilė Stankevičiūtė

Alexander Terenin - on the faculty job market

@avt.im

New preprint!

This is a hardcore technical paper on Thompson sampling - as a strategy for the so-called online learning game.

I think it's one of the most long-term important things I have ever worked on due to what it makes possible.

That needs explaining: thread below!

arxiv.org/abs/2502.14790

An Adversarial Analysis of Thompson Sampling for Full-information Online Learning: from Finite to Infinite Action Spaces

We develop an analysis of Thompson sampling for online learning under full feedback - also known as prediction with expert advice - where the learner's prior is defined over the space of an adversary'...

arxiv.org

February 21, 2025 at 8:57 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news