Sebastian Lapuschkin
banner
slapuschkin.bsky.social
Sebastian Lapuschkin
@slapuschkin.bsky.social
Head of XAI research at Fraunhofer HHI

Google Scholar: https://scholar.google.de/citations?user=wpLQuroAAAAJ
Reposted by Sebastian Lapuschkin
🚀 We’ve released the source code for 𝗔𝗦𝗜𝗗𝗘 (presented as an 𝗢𝗿𝗮𝗹 at the #ICLR2025 BuildTrust workshop)!

🔍 ASIDE boosts prompt injection robustness without safety-tuning: we simply rotate embeddings of marked tokens by 90° during instruction-tuning and inference.

👇 code & docs👇
June 24, 2025 at 1:47 PM
Have had enough of the fake "sources" "cited" by ChatGPT? We have the solution in the form of low-cost causal citations for LLMs.

Go check this out! arxiv.org/abs/2505.15807

Thanks to my amazing co-authors
@pkhdipraja.bsky.social,
@reduanachtibat.bsky.social , Thomas Wiegand and Wojciech Samek!
May 28, 2025 at 2:50 PM