André Panisson
panisson.bsky.social
André Panisson
@panisson.bsky.social
Principal Researcher @ CENTAI.eu | Leading the Responsible AI Team. Building Responsible AI through Explainable AI, Fairness, and Transparency. Researching Graph Machine Learning, Data Science, and Complex Systems to understand collective human behavior.
Anthropic dropped some insights into how AI brains work with their circuit tracing method. Turns out LLMs are bad at math because they’re eyeballing it (“36+59? Eh, 40ish+60ish=95?”). It means we’re one step closer to understanding the inner workings of LLMs.
#LLMs #AI #Interpretability
Tracing the thoughts of a large language model
Anthropic's latest interpretability research: a new microscope to understand Claude's internal mechanisms
www.anthropic.com
March 31, 2025 at 7:26 AM
Reposted by André Panisson
*Automatically Interpreting Millions of Features in LLMs*
by @norabelrose.bsky.social et al.

An open-source pipeline for finding interpretable features in LLMs with sparse autoencoders and automated explainability methods from @eleutherai.bsky.social.

arxiv.org/abs/2410.13928
November 27, 2024 at 2:58 PM
Reposted by André Panisson
Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey

Presents a framework categorizing MLLM explainability across data, model, and training perspectives to enhance transparency and trustworthiness.

📝 arxiv.org/abs/2412.02104
Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey
The rapid development of Artificial Intelligence (AI) has revolutionized numerous fields, with large language models (LLMs) and computer vision (CV) systems driving advancements in natural language un...
arxiv.org
December 4, 2024 at 5:54 AM
Reposted by André Panisson
I am extremely honoured to receive the @ERC_Research
#ERCCoG award for #RUNES. For the next five years, I will be working on the mathematical, computational, and experimental (!!) sides to understand how higher-order interactions change how we think and coordinate.
a bald man with a beard is smiling in front of a group of people
ALT: a bald man with a beard is smiling in front of a group of people
media.tenor.com
December 3, 2024 at 3:58 PM
Reposted by André Panisson
Latest one out! 👇👇👇👇👇
November 30, 2024 at 10:19 AM
Check out our poster at #LoG2024, based on our #TMLR paper:
📍 “A True-to-the-Model Axiomatic Benchmark for Graph-based Explainers”
🗓️ Tuesday 4–6 PM CET
📌 Poster Session 2, GatherTown
Join us to discuss graph ML explainability and benchmarks
#ExplainableAI #GraphML
openreview.net/forum?id=HSQTv3R8Iz
November 26, 2024 at 5:27 PM
Reposted by André Panisson
🌟🤖📝 **Boosting human competences with interpretable and explainable artificial intelligence**

How can AI *boost* human decision-making instead of replacing it? We talk about this in our new paper.

doi.org/10.1037/dec0...

#AI #XAI #InterpretableAI #IAI #boosting #competences
🧵👇
November 20, 2024 at 12:25 PM
Reposted by André Panisson
NeurIPS Conference is now Live on Bluesky!

-NeurIPS2024 Communication Chairs
November 22, 2024 at 1:33 AM
Reposted by André Panisson
Even as an interpretable ML researcher, I wasn't sure what to make of Mechanistic Interpretability, which seemed to come out of nowhere not too long ago.

But then I found the paper "Mechanistic?" by
@nsaphra.bsky.social and @sarah-nlp.bsky.social, which clarified things.
November 20, 2024 at 8:00 AM
Reposted by André Panisson
18M + 1.
💙, Mar🐫
bsky.app Bluesky @bsky.app · Nov 16
Another day, another million new people have joined Bluesky!

18M users? 🙂‍↔️ 18M friends 🙂‍↕️
November 17, 2024 at 2:58 AM
ICLR is a top AI conference, and while the 2025 papers aren’t officially out yet, reviews are open. I’m diving into the highest rated in Interpretability and Explainable AI. Interestingly, the top ones focus on Mechanistic Interpretability, a promising topic that our team is starting to explore.
November 16, 2024 at 5:33 PM
Reposted by André Panisson
For Science Magazine, I wrote about "The Metaphors of Artificial Intelligence".

The way you conceptualize AI systems affects how you interact with them, do science on them, and create policy and apply laws to them.

Hope you will check it out!

www.science.org/doi/full/10....
The metaphors of artificial intelligence
A few months after ChatGPT was released, the neural network pioneer Terrence Sejnowski wrote about coming to grips with the shock of what large language models (LLMs) could do: “Something is beginning...
www.science.org
November 14, 2024 at 10:56 PM
Bluesky feels like traveling back to the golden age of Twitter: when the follow button meant something, and your feed wasn’t a dystopian lineup of blue-tagged bots. It’s refreshing to be somewhere I don’t need an AI to explain why I’m seeing a post. Let’s hope we don’t ruin it this time!
November 16, 2024 at 11:22 AM