Davide Testa
banner
davide97ai.bsky.social
Davide Testa
@davide97ai.bsky.social
NLP and AI enthusiast! 💫🔍🤖
PhD student in AI at @fbk-nlp.bsky.social
#NLProc

https://linktr.ee/davide.testa
Excited to present our work on the MAIA benchmark at EMNLP 2025!

Thanks to my co-authors for this collaboration 🙌🔥

See you in Suzhou next November!!! 🇨🇳🚀
September 2, 2025 at 10:35 AM
Reposted by Davide Testa
Welcome to MAIA! 🚀 Our new benchmark for evaluating multimodal reasoning in Vision-LMs on videos fully in Italian! 🇮🇹 MAIA tests understanding & generation with fine-grained reasoning categories and a brand-new evaluation metric! 🔎🔥Discover MAIA here: arxiv.org/abs/2502.16989
#NLProc #evaluation #AI
All-in-one: Understanding and Generation in Multimodal Reasoning with the MAIA Benchmark
We introduce MAIA (Multimodal AI Assessment), a native-Italian benchmark designed for fine-grained investigation of the reasoning abilities of visual language models on videos. MAIA differs from other...
arxiv.org
February 27, 2025 at 9:34 AM
Reposted by Davide Testa
🚀 **Exciting News!** 🎉 Evalita-LLM is here! 🇮🇹 A new benchmark for evaluating LLMs—offering native Italian tasks, generative challenges, and fair multi-prompt evaluations. Now also available in lm-evaluation harness by @eleutherai.bsky.social !
ArXiv: arxiv.org/abs/2502.02289
#NLProc #LLM #Evaluation
Evalita-LLM: Benchmarking Large Language Models on Italian
We describe Evalita-LLM, a new benchmark designed to evaluate Large Language Models (LLMs) on Italian tasks. The distinguishing and innovative features of Evalita-LLM are the following: (i) all tasks ...
arxiv.org
February 24, 2025 at 5:07 PM
Reposted by Davide Testa
Our group leader took the stage at the FBK plenary session to showcase our research interests, ongoing projects, challenges and future plans. An exciting moment to share our vision and push the boundaries of NLP even further!
Here’s a glimpse of the event! 📸✨
#NLProc #AI #Research #FBK #Innovation
February 7, 2025 at 2:35 PM
Reposted by Davide Testa
🚀 #CALAMITA has officially kicked off - and we’re on board!
Fondazione Bruno Kessler proudly participated in this 1st #evaluation #campaign for #LLMs with our Andrea Zaninello together with @fbk-mt.bsky.social group!🤖
Here some pictures of our calamitici!🧲
See u next year 4 other challenges! 🔥🔍
December 19, 2024 at 1:42 PM
Reposted by Davide Testa
Exciting news! 🎉 If you’re curious about the opening #tutorial at CLiC-it 2024 conference made by our group on processing #data for #training and #evaluating #LLMs , here’s your chance! 📊 Explore the slides and get inspired!!! 🚀🔍✨
docs.google.com/presentation...
@ailc-nlp.bsky.social
Data-LLM-Tutorial
You Are what You Eat Processing Data for Training and Evaluating LLMs Giovanni Bonetta and Bernardo Magnini Fondazione Bruno Kessler, Trento, Italy {gbonetta|magnini}@fbk.eu Tutorial at CLiC-it 2024, ...
docs.google.com
December 17, 2024 at 4:16 PM
Reposted by Davide Testa
#NLP Research group in action! Our group leader Bernardo Magnini, alongside @tizaino.bsky.social, Sofia Brenna, and Giovanni Bonetta, presenting their #paper "Are you a Good Assistant? Assessing LLM Trustability in Task-oriented Dialogues" at CLiC-it '24 #Pisa 🔍✨
@ailc-nlp.bsky.social
#NLProc #AI
December 16, 2024 at 9:05 AM
Reposted by Davide Testa
We start this new account with the group picture of #clicit2024! Follow us! #NLProc #NLP
December 6, 2024 at 4:57 PM
Reposted by Davide Testa
What a conference! 🎉 CLiC-it in #Pisa was truly special! Returning to where it all began a decade ago! ❤️
Our group was honored to participate, share ideas and connect with such an inspiring community.
Here are some highlights from the conference!!!📸✨
#CLiCit2024 #NLP #AI @ailc-nlp.bsky.social
December 13, 2024 at 8:26 AM