AILC-NLP
banner
ailc-nlp.bsky.social
AILC-NLP
@ailc-nlp.bsky.social
Account dell'Associazione Italiana di Linguistica Computazionale / Account of the Italian Association of Computational Linguistics, http://www.ai-lc.it/
Let’s present the tasks at #EVALITA2026!
Last but not least: FadeIT 🔍💬

Can your system spot fallacies in social media posts?
FadeIT focuses on Italian texts about migration, climate change & public health.

Detect flawed reasoning — where it spreads fastest.
#NLProc
October 24, 2025 at 7:24 AM
Let’s present the tasks at #EVALITA2026!
Tenth up: PFB – Prometeia Financial Benchmark 💶📊

Can LLMs handle finance?
PFB evaluates open & closed models on domain-specific MCQs, with a twist: each question has a complexity score.

2 tasks: Italian and Multilingual QA

From GPT to finance pro.
#NLProc
October 22, 2025 at 8:38 AM
Let’s present the tasks at #EVALITA2026!
Ninth up: Cruciverb-IT 🧩🇮🇹

Ready to crack some Italian crosswords?
Cruciverb-IT offers a challenging playground for NLP systems:
1️⃣ Answer clues from real crosswords
2️⃣ Autonomously solve full crossword grids

Wordplay meets AI.
#NLProc
October 20, 2025 at 8:38 AM
Let’s present the tasks at #EVALITA2026!
Eighth up: SVELA 🧠🧽

Can LLMs forget on purpose?
SVELA tackles Machine Unlearning: design and evaluate metrics to verify if a model forgets specific knowledge while keeping the rest intact.

Selective forgetting, measurable impact.
#NLProc
October 17, 2025 at 11:14 AM
Let’s present the tasks at #EVALITA2026!
Seventh up: MultiPRIDE 🏳️‍🌈🧠

Can a system tell when a slur is reclaimed?
💬 In this multilingual task (IT/ES/EN), classify whether LGBTQ+ terms in context are used with reclamatory intent.

It’s not just about words, it’s about meaning.
#NLProc
October 15, 2025 at 3:53 PM
Let’s present the tasks at #EVALITA2026!
Sixth up: DeSegMa-It 🤖📝

Can you spot the line between human and machine?
DeSegMa-It challenges systems to:
1️⃣ Detect machine-generated texts
2️⃣ Segment where the human ends & the machine begins

Human or AI? Let’s find out.
#NLProc
October 13, 2025 at 8:45 AM
Let’s present the tasks at #EVALITA2026!
Fifth up: Enhanced-VWSD 🖼️📚

Can you pick the right image for a word in context?
Given a sentence and 10 images, choose the one that best captures the meaning of a target word.

A vision meets language challenge!
#NLProc
October 10, 2025 at 1:07 PM
Let’s present the tasks at #EVALITA2026!
Fourth up: IMPOLS 🗳️

Can systems detect what’s not said in political speech?
💬 IMPOLS targets implicit, questionable content that sounds true but isn’t explicit.

🔍 Tasks:
1️⃣ Detect implicit contents
2️⃣ Classify them
3️⃣ Classify implicatures

#NLProc
October 8, 2025 at 2:01 PM
Let’s present the tasks at #EVALITA2026!
Third up: ATE-IT 🏷️

Time to extract key concepts automatically, with the first large-scale eval of Automatic Term Extraction for Italian on institutional texts.

Subtasks:
🔹 Term Extraction
🔹 Term Variants Clustering

Let’s make terminology smarter.
#NLProc
October 6, 2025 at 8:18 AM
Let’s present the tasks at #EVALITA2026!
Second up: GSI:detect

Can machines detect gender stereotypes in Italian texts?
🧠 Score sentences for stereotypical content
🏷️ Classify them into stereotype categories

From classification to social awareness.
#NLProc
September 30, 2025 at 7:32 AM
Let’s present the tasks at #EVALITA2026!
First up: EXPLAINITA 🔍

Can you explain what a latent neuron means?
🧠 Describe Sparse Autoencoder latents
📝 Decide if a text activates a latent based on its explanation

From prediction to interpretability 💬
#NLProc
September 26, 2025 at 10:03 AM