https://web.ml.tu-berlin.de/author/laura-kopf/
We introduce PRISM, a framework for extracting multi-concept feature descriptions to better understand polysemanticity.
📄 Capturing Polysemanticity with PRISM: A Multi-Concept Feature Description Framework
arxiv.org/abs/2506.15538
🧵 (1/7)
aclanthology.org/2025.blackbo...
#EMNLP #BlackboxNLP #XAI #Interpretapility
🗞️ aclanthology.org/2025.blackbo...
bsky.app/profile/nfel...
aclanthology.org/2025.blackbo...
#EMNLP #BlackboxNLP #XAI #Interpretapility
Andreas Lutz @eberleoliver.bsky.social Manuel Welte @lorenzlinhardt.bsky.social @lkopf.bsky.social
#AI #XAI #Interpretability
Andreas Lutz @eberleoliver.bsky.social Manuel Welte @lorenzlinhardt.bsky.social @lkopf.bsky.social
#AI #XAI #Interpretability
🗞️ aclanthology.org/2025.blackbo...
bsky.app/profile/nfel...
🗞️ aclanthology.org/2025.blackbo...
bsky.app/profile/nfel...
Let's connect!
#XAI #ExplainableAI #MechInterp #MachineLearning #Interpretability
Let's connect!
#XAI #ExplainableAI #MechInterp #MachineLearning #Interpretability
We provide the first survey of concept description generation and evaluation methods.
Joint effort w/ @lkopf.bsky.social
📄 arxiv.org/abs/2510.01048
We provide the first survey of concept description generation and evaluation methods.
Joint effort w/ @lkopf.bsky.social
📄 arxiv.org/abs/2510.01048
In this work, we introduce a multi-concept feature description framework that can identify and score polysemantic features.
📄 Paper: arxiv.org/abs/2506.15538
#NeurIPS #MechInterp #XAI
In this work, we introduce a multi-concept feature description framework that can identify and score polysemantic features.
📄 Paper: arxiv.org/abs/2506.15538
#NeurIPS #MechInterp #XAI
We introduce PRISM, a framework for extracting multi-concept feature descriptions to better understand polysemanticity.
📄 Capturing Polysemanticity with PRISM: A Multi-Concept Feature Description Framework
arxiv.org/abs/2506.15538
🧵 (1/7)
We introduce PRISM, a framework for extracting multi-concept feature descriptions to better understand polysemanticity.
📄 Capturing Polysemanticity with PRISM: A Multi-Concept Feature Description Framework
arxiv.org/abs/2506.15538
🧵 (1/7)
Join me this Thursday, December 12th, in East Exhibit Hall A-C, Poster #3107, from 11 a.m. PST to 2 p.m. PST. I'll be discussing our paper “CoSy: Evaluating Textual Explanations of Neurons.”
Join me this Thursday, December 12th, in East Exhibit Hall A-C, Poster #3107, from 11 a.m. PST to 2 p.m. PST. I'll be discussing our paper “CoSy: Evaluating Textual Explanations of Neurons.”