Nils Feldhus
banner
nfel.bsky.social
Nils Feldhus
@nfel.bsky.social
Post-doctoral Researcher at BIFOLD / TU Berlin interested in interpretability and analysis of language models. Guest researcher at DFKI Berlin. https://nfelnlp.github.io/
It was a real pleasure to visit the Health NLP Lab in Tübingen and present my research at BIFOLD and TU Berlin in collaboration with Charité and University of Augsburg among others. We had some exciting discussions. Thanks for having me!
Last week, Dr. Nils Feldhus @nfel.bsky.social, postdoctoral researcher at @tuberlin.bsky.social and @bifold.berlin, visited our lab and presented his research during our weekly lab meeting.
January 21, 2026 at 2:42 PM
Reposted by Nils Feldhus
Last week, Dr. Nils Feldhus @nfel.bsky.social, postdoctoral researcher at @tuberlin.bsky.social and @bifold.berlin, visited our lab and presented his research during our weekly lab meeting.
January 21, 2026 at 2:00 PM
Sharing my favorite papers I read in 2025 from human-centric XAI, mechanistic interpretability, NLG evaluation, and related fields, covering conferences I've attended (ACL in Austria, EMNLP in China), but also journals, ML and HCI conferences:

nfelnlp.github.io/recommended/...
January 2, 2026 at 9:16 AM
Reposted by Nils Feldhus
I’m at #NeurIPS in San Diego this week! Come see our poster on feature interpretability. Find @eberleoliver.bsky.social and me at:

🪧Poster Session 1 @ Exhibit Hall C,D,E #1015
Wed 3 Dec, 11 am - 2 pm
🪧Poster @ Mech Interp Workshop
Upper Level Room 30A-E
Sun 7 Dec, 8 am - 5 pm
December 2, 2025 at 6:56 PM
Reposted by Nils Feldhus
*Urgently* looking for emergency reviewers for the ARR October Interpretability track 🙏🙏

ReSkies much appreciated
November 11, 2025 at 10:29 AM
Reposted by Nils Feldhus
Heading to the EMNLP BlackboxNLP Workshop this Sunday? Don’t miss @nfel.bsky.social and @lkopf.bsky.social poster on „Interpreting Language Models Through Concept Descriptions: A Survey“
aclanthology.org/2025.blackbo...

#EMNLP #BlackboxNLP #XAI #Interpretapility
Nov 9, @blackboxnlp.bsky.social , 11:00-12:00 @ Hall C – Interpreting Language Models Through Concept Descriptions: A Survey (Feldhus & Kopf) @lkopf.bsky.social

🗞️ aclanthology.org/2025.blackbo...

bsky.app/profile/nfel...
November 8, 2025 at 10:55 AM
I'm at #EMNLP2025 in Suzhou🇨🇳 to present these papers in the coming days:

Nov 7, Session 14, 12:30-13:30 @ Hall C – Multilingual Datasets for Custom Input Extraction and Explanation Requests Parsing in Conversational XAI Systems (Wang et al.) @qiaw99.bsky.social

🗞️ aclanthology.org/2025.finding...
November 6, 2025 at 7:00 AM
🔍 Are you curious about uncovering the underlying mechanisms and identifying the roles of model components (neurons, …) and abstractions (SAEs, …)?

We provide the first survey of concept description generation and evaluation methods.

Joint effort w/ @lkopf.bsky.social

📄 arxiv.org/abs/2510.01048
October 2, 2025 at 9:13 AM
Reposted by Nils Feldhus
Happy to share that our PRISM paper has been accepted at #NeurIPS2025 🎉

In this work, we introduce a multi-concept feature description framework that can identify and score polysemantic features.

📄 Paper: arxiv.org/abs/2506.15538

#NeurIPS #MechInterp #XAI
September 19, 2025 at 12:02 PM
The submission deadline of the inaugural Young Researchers workshop at INLG 2025 has been extended by 5 days.
We're excited to receive your 2p position papers showcasing your NLG-related research until August 31, 2025! @siggen.bsky.social

ynlg-workshop.github.io

bsky.app/profile/nfel...
August 25, 2025 at 11:27 AM
Reposted by Nils Feldhus
Excited to announce the first-ever Workshop for Young Researchers in Natural Language Generation (YNLG), supported by @siggen.bsky.social, taking place on October 29, 2025 in Hanoi, Vietnam, co-located with INLG 2025.
Call for Submissions is out now!

ynlg-workshop.github.io
August 12, 2025 at 7:05 AM
Excited to announce the first-ever Workshop for Young Researchers in Natural Language Generation (YNLG), supported by @siggen.bsky.social, taking place on October 29, 2025 in Hanoi, Vietnam, co-located with INLG 2025.
Call for Submissions is out now!

ynlg-workshop.github.io
August 12, 2025 at 7:05 AM
Thoroughly enjoying the range of topics at the first #ACL2025NLP poster session!

Our FitCF poster presentation on counterfactual example generation at #ACL2025 has been moved to Tuesday, July 29, at 16:00-17:30.

bsky.app/profile/nfel...
Qianli Wang ( @qiaw99.bsky.social ) et al.:
"FitCF: A Framework for Automatic Feature Importance-guided Counterfactual Example Generation"
📄 ACL Anthology: aclanthology.org/2025.finding...
🏟️ ACL Findings, July 28 @ Hall 4/5
Poster presentation: 18:00-19:30 (pres. by Qianli and myself)
FitCF: A Framework for Automatic Feature Importance-guided Counterfactual Example Generation
Qianli Wang, Nils Feldhus, Simon Ostermann, Luis Felipe Villa-Arenas, Sebastian Möller, Vera Schmitt. Findings of the Association for Computational Linguistics: ACL 2025. 2025.
aclanthology.org
July 28, 2025 at 10:12 AM
🚆On my way to Vienna! #ACL2025NLP #ACL2025
Together with my amazing colleagues from TU Berlin, DFKI, Saarland & Potsdam, I will present 4 papers on counterfactuals (Findings), free-text rationales (GEM), fact checking (FEVER oral), table understanding (TRL oral).
Excited to meet old and new friends!
July 26, 2025 at 9:37 AM
Reposted by Nils Feldhus
Very happy to be at #FAccT2025 in Athens, where I presented our work "Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc Methods"

📄Paper: dl.acm.org/doi/10.1145/...

At #FAccT2025? Let's connect if you're interested in improving the usability of explainability methods!
June 26, 2025 at 5:25 AM
Glad to announce our #FAccT2025 paper about gender bias in feature attribution methods, led by Mahdi Dhaini, will be presented tomorrow in 🇬🇷 Athens as part of the "Evaluating Explainable AI" session from 10:45 AM to 12:15 PM in Amphitheatre Ioannis Despotopoulos: programs.sigchi.org/facct/2025/p...
June 23, 2025 at 12:05 PM
Reposted by Nils Feldhus
🔍 When do neurons encode multiple concepts?

We introduce PRISM, a framework for extracting multi-concept feature descriptions to better understand polysemanticity.

📄 Capturing Polysemanticity with PRISM: A Multi-Concept Feature Description Framework
arxiv.org/abs/2506.15538

🧵 (1/7)
June 19, 2025 at 3:18 PM
Successfully defended my PhD yesterday! 🎓 🎉
Special thanks to my mentor Sebastian Möller and professors Sina Zarrieß @clausebielefeld.bsky.social, Christin Seifert, and @matthiasboehm7.bsky.social for being part of my committee.
Will continue working on XAI & NLP as a post-doc at TU Berlin & BIFOLD
April 12, 2025 at 4:29 PM
My colleagues at TUB & DFKI are organizing a Shared Task for the next #SMM4H-HeaRD workshop, which will be co-located with AAAI ICWSM 2025 in June:
💊 Detection of adverse drug events in multilingual (DE, FR, RU, EN) and multi-platform social media posts. 💊

healthlanguageprocessing.org/smm4h-2025/
Social Media Mining for Health/Health Real-World Data (#SMM4H-HeaRD) 2025 Workshop and Shared Tasks
WorkshopShared Task Workshop Program Past events WorkshopThe Social Media Mining for Health (#SMM4H) Workshop provides an interdisciplinary forum to present and discuss natural langu…
healthlanguageprocessing.org
February 17, 2025 at 9:23 AM
Our paper Cross-Refine on natural language explanations will be presented as a poster by @qiaw99.bsky.social today at 14:00 local time in the Atrium of #COLING2025.

Proceedings are now available as well: aclanthology.org/2025.coling-...

bsky.app/profile/nfel...
January 21, 2025 at 6:03 AM
Our new COLING 2025-accepted work Cross-Refine, led by @qiaw99.bsky.social, approaches the problem of free-text rationalization with a generator-critic setup: The generator outputs initial explanations and the critic provides the feedback for them. #COLING2025

arXiv: arxiv.org/abs/2409.07123
January 3, 2025 at 3:01 PM
I published a 2024 Recap showcasing my favorite papers from this year's major conferences (ACL, EMNLP, COLM, NeurIPS, ICLR, CHI, FAccT, etc.) that influence my current and future research:

nfelnlp.github.io/recommended/...

More gists/summaries to be added...
2024 Recap
nfelnlp.github.io
December 29, 2024 at 5:53 PM
Reposted by Nils Feldhus
Recruiting reviewers + ACs for ACL 2025 in Interpretability and Analysis of NLP Models
- DM me if you are interested in emergency reviewer/AC roles for March 18th to 26th
- Self-nominate for positions here (review period is March 1 through March 20): docs.google.com/forms/d/e/1F...
Volunteer to join ACL 2025 Programme Committee
Use this form to express your interest in joining the ACL 2025 programme committee as a reviewer or area chair (AC). The review period is 1st to 20th of March 2025. ACs need to be available for variou...
docs.google.com
December 10, 2024 at 10:39 PM
Reposted by Nils Feldhus
📣📣 Wanna be an Area Chair or a Reviewer for @aclmeeting.bsky.social or know someone who would?

Nominations and self-nominations go here 👇

docs.google.com/forms/d/e/1F...
Volunteer to join ACL 2025 Programme Committee
Use this form to express your interest in joining the ACL 2025 programme committee as a reviewer or area chair (AC). The review period is 1st to 20th of March 2025. ACs need to be available for variou...
docs.google.com
December 6, 2024 at 6:01 AM