Francesco Ortu
banner
francescortu.bsky.social
Francesco Ortu
@francescortu.bsky.social
NLP & Interpretability | PhD Student @ University of Trieste & Laboratory of Data Engineering of Area Science Park | Prev MPI-IS
Thanks again, @diegodoimo.bsky.social and @albecazzaniga.bsky.social , for the fantastic mentorship and support! 🙏🎉 They are also attending #NeurIPS, so feel free to reach out to them to discuss our results. I’m excited to keep pushing forward on these topics! 🚀
December 10, 2024 at 8:11 PM
Thanks to the amazing team at LADE @areasciencepark: @lvaleriani.bsky.social @lbasile.bsky.social @AlessioAnsuini @diegodoimo.bsky.social @albecazzaniga.bsky.social 🙏
December 10, 2024 at 8:11 PM
It was super fun to take our first step in interpreting multimodal LLMs, working closely with the brilliant @alexpietroserra.bsky.social and @EmanuelePanizon
December 10, 2024 at 8:11 PM
✅ This shows that, starting from the mid-layers, a single token effectively summarizes all 1024 image tokens!

❌ This does not occur in models fine-tuned for visual understanding (such as Pixtral).
December 10, 2024 at 8:11 PM
Additionally, blocking communication from this token significantly disrupts performance on standard benchmarks, while blocking image-text communication does not
December 10, 2024 at 8:11 PM
🎯 Key finding: In these models the hidden representations of images and text form disjoint clusters and the communication between modalities is mediated by the special token <end-of-image>!
December 10, 2024 at 8:11 PM
🌐 Check out our code and data at: ritareasciencepark.github.io/Narrow-gate
ritareasciencepark.github.io
December 10, 2024 at 8:11 PM
Thanks for creating the starter pack! I'd love to be added as well! 😊
November 20, 2024 at 10:41 AM