Lightnews — Scholar-powered news

Adi Simhi

@adisimhi.bsky.social

21 followers 25 following 9 posts

NLProc, and machine learning. Ph.D. student Technion

Posts Replies Media Videos

Adi Simhi

@adisimhi.bsky.social

🔍Check out our paper "Trust Me, I’m Wrong: High-Certainty Hallucinations in LLMs", at arxiv.org/pdf/2502.12964 and code at github.com/technion-cs-...

February 19, 2025 at 3:50 PM

Adi Simhi

@adisimhi.bsky.social

What do you think? 🤔
Could high-certainty hallucinations be a major roadblock to safe AI deployment? Let’s discuss! 👇

February 19, 2025 at 3:50 PM

Adi Simhi

@adisimhi.bsky.social

🔮 Takeaway:
We need new approaches to understand hallucinations so we can mitigate them better.
This research moves us toward deeper insights into why LLMs hallucinate and how we can build more trustworthy AI.

February 19, 2025 at 3:50 PM

Adi Simhi

@adisimhi.bsky.social

💡Why does this matter?
- Not all hallucinations stem from uncertainty or lack of knowledge.
- High-certainty hallucinations appear systematically across models & datasets.
- This challenges existing hallucination detection & mitigation strategies that rely on uncertainty signals

February 19, 2025 at 3:50 PM

Adi Simhi

@adisimhi.bsky.social

🛠️How did we test this?
We used knowledge detection & uncertainty measurement methods to analyze when and how hallucinations occur.

February 19, 2025 at 3:50 PM

Adi Simhi

@adisimhi.bsky.social

🚨Key finding:
LLMs can produce hallucinations with high certainty—even when they possess the correct knowledge!

February 19, 2025 at 3:50 PM

Adi Simhi

@adisimhi.bsky.social

🔍The problem:
LLMs sometimes generate hallucinations - factually incorrect outputs. assuming that if the model is certain and does not lack knowledge it must be correct.

February 19, 2025 at 3:50 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news