Antonin Poché @ ACL
banner
antoninpoche.bsky.social
Antonin Poché @ ACL
@antoninpoche.bsky.social
PhD Student doing XAI for NLP at @ANITI_Toulouse, IRIT, and IRT Saint Exupery.

🛠️ Xplique library development team member.
Pinned
🔥ConSim has been accepted to the #ACL2025 main conference!

🙏 Thanks again to my amazing co-authors: @alon_jacovi, Agustin Picard, @VictorBoutin, and @Fannyjrd_.

Work done in DEEL and FOR from IRT St Exupéry and @ANITI_Toulouse.

See you in Vienna 📅

For more information, check out my last post:
🚀 Thrilled to share our new paper (the first of my PhD)!

How can we compare concept-based #XAI methods in #NLProc?

ConSim (arxiv.org/abs/2501.05855) provides the answer.

Read the thread to find out which method is the most interpretable! 🧵1/7
Reposted by Antonin Poché @ ACL
If you use GMail, AI (Gemini) was turned on yesterday by default and now scans all of your content for machine learning. To turn off, go to Settings>General and scroll down. Uncheck the box for "Smart features."

There's other "Smart" add-ons as well, but that's the one that reads your content.
November 20, 2025 at 5:32 PM
Reposted by Antonin Poché @ ACL
🕳️🐇 𝙄𝙣𝙩𝙤 𝙩𝙝𝙚 𝙍𝙖𝙗𝙗𝙞𝙩 𝙃𝙪𝙡𝙡 – 𝙋𝙖𝙧𝙩 𝙄 (𝑃𝑎𝑟𝑡 𝐼𝐼 𝑡𝑜𝑚𝑜𝑟𝑟𝑜𝑤)

𝗔𝗻 𝗶𝗻𝘁𝗲𝗿𝗽𝗿𝗲𝘁𝗮𝗯𝗶𝗹𝗶𝘁𝘆 𝗱𝗲𝗲𝗽 𝗱𝗶𝘃𝗲 𝗶𝗻𝘁𝗼 𝗗𝗜𝗡𝗢𝘃𝟮, one of vision’s most important foundation models.

And today is Part I, buckle up, we're exploring some of its most charming features. :)
October 14, 2025 at 9:00 PM
Reposted by Antonin Poché @ ACL
expressing appreciation for this scientific diagram
October 5, 2025 at 8:55 PM
🔥 I am super excited to be presenting a poster at #ACL2025 in Vienna next week! 🌏

This is my first big conference!

📅 Tuesday morning, 10:30–12:00, during Poster Session 2.

💬 If you're around, feel free to message me. I would be happy to connect, chat, or have a drink!
July 25, 2025 at 3:37 PM
Reposted by Antonin Poché @ ACL
🚨 New preprint! 🚨

Everyone loves causal interp. It’s coherently defined! It makes testable predictions about mechanistic interventions! But what if we had a different objective: predicting model behavior not under mechanistic interventions, but on unseen input data?
July 10, 2025 at 2:31 PM
🔥ConSim has been accepted to the #ACL2025 main conference!

🙏 Thanks again to my amazing co-authors: @alon_jacovi, Agustin Picard, @VictorBoutin, and @Fannyjrd_.

Work done in DEEL and FOR from IRT St Exupéry and @ANITI_Toulouse.

See you in Vienna 📅

For more information, check out my last post:
🚀 Thrilled to share our new paper (the first of my PhD)!

How can we compare concept-based #XAI methods in #NLProc?

ConSim (arxiv.org/abs/2501.05855) provides the answer.

Read the thread to find out which method is the most interpretable! 🧵1/7
May 16, 2025 at 8:45 AM
Reposted by Antonin Poché @ ACL
BlackboxNLP is back! 💥

Happy to be part of the organizing team for this year, and super excited for our new shared task using the excellent MIB Benchmark, check it out! blackboxnlp.github.io/2025/task/
BlackboxNLP, the leading workshop on interpretability and analysis of language models, will be co-located with EMNLP 2025 in Suzhou this November! 📆

This edition will feature a new shared task on circuits/causal variable localization in LMs, details here: blackboxnlp.github.io/2025/task
May 15, 2025 at 8:24 AM
Reposted by Antonin Poché @ ACL
🎉 Our Actionable Interpretability workshop has been accepted to #ICML2025! 🎉
> Follow @actinterp.bsky.social
> Website actionable-interpretability.github.io

@talhaklay.bsky.social @anja.re @mariusmosbach.bsky.social @sarah-nlp.bsky.social @iftenney.bsky.social

Paper submission deadline: May 9th!
March 31, 2025 at 4:59 PM
Reposted by Antonin Poché @ ACL
Hundreds of international students have just received an email telling them their visas have been revoked.

The ‘justification’ is campus activism or social media posts.

timesofindia.indiatimes.com/world/us/hun...
March 29, 2025 at 2:11 PM
Reposted by Antonin Poché @ ACL
Can we understand the mechanisms of a frontier AI model?

📝 Blog post: www.anthropic.com/research/tra...
🧪 "Biology" paper: transformer-circuits.pub/2025/attribu...
⚙️ Methods paper: transformer-circuits.pub/2025/attribu...

Featuring basic multi-step reasoning, planning, introspection and more!
On the Biology of a Large Language Model
transformer-circuits.pub
March 27, 2025 at 6:18 PM
Reposted by Antonin Poché @ ACL
Jawdropping.

You would expect this in a dictatorship, not the United States.

This country is unrecognizable.
March 20, 2025 at 2:11 AM
Reposted by Antonin Poché @ ACL
What will be the linchpin for AI dominance?

Read our NSF/OSTP recommendations written with Goodfire's Tom McGrath tommcgrath.github.io, Transluce's Sarah Schwettmann cogconfluence.com, MIT's Dylan Hadfield-Menell @dhadfieldmenell.bsky.social

TLDR; Dominance comes from **interpretability** 🧵 ↘️
March 16, 2025 at 1:57 PM
Reposted by Antonin Poché @ ACL
An assembly of 18 European companies, labs, and universities have banded together to launch 🇪🇺 EuroBERT!

It's a state-of-the-art multilingual encoder for 15 European languages, designed to be finetuned for retrieval, classification, etc.

Details in 🧵
March 10, 2025 at 9:43 AM
Super excited to welcome @gsarti.com in Toulouse with @fannyjrd.bsky.social and Thomas Mullor !

We will be working on a new library for interpretability 😀
Finally in Toulouse 🇫🇷 where I'll collaborate with @fannyjrd.bsky.social @antoninpoche.bsky.social and the DEEL/FOR teams at IRT & ANITI on an exciting interpretability project. Stay tuned! 🔍
February 25, 2025 at 5:52 PM
🚀 Thrilled to share our new paper (the first of my PhD)!

How can we compare concept-based #XAI methods in #NLProc?

ConSim (arxiv.org/abs/2501.05855) provides the answer.

Read the thread to find out which method is the most interpretable! 🧵1/7
January 31, 2025 at 2:51 PM
🤩 Thrilled to announce that I've started my PhD in
#XAI for #NLProc under the supervision of Pr. Nicholas Asher,
@philmuller.bsky.social, and @fannyjrd.bsky.social!

My project? Improve the transparency of LLMs through interactive explanations and user-tailored explanations. 🚀
January 24, 2025 at 4:10 PM