Can LLMs with reasoning + web search reliably fact-check political claims?
We evaluated 15 models from OpenAI, Google, Meta, and DeepSeek on 6,000+ PolitiFact claims (2007–2024).
Short answer: Not reliably—unless you give them curated evidence.
arxiv.org/abs/2511.18749
Can LLMs with reasoning + web search reliably fact-check political claims?
We evaluated 15 models from OpenAI, Google, Meta, and DeepSeek on 6,000+ PolitiFact claims (2007–2024).
Short answer: Not reliably—unless you give them curated evidence.
arxiv.org/abs/2511.18749
and based in a low- or middle-income country?
💡 Submit a project idea, get matched with a mentor, present virtually at ICWSM'26, and prepare a submission for Sept 2026!
📢 Call icwsm.org/2026/submit....
🚀 Apply by Jan 15 forms.gle/A9GkJboP7qi3...
and based in a low- or middle-income country?
💡 Submit a project idea, get matched with a mentor, present virtually at ICWSM'26, and prepare a submission for Sept 2026!
📢 Call icwsm.org/2026/submit....
🚀 Apply by Jan 15 forms.gle/A9GkJboP7qi3...
Call for abstracts: tinyurl.com/3tbj2v83
Call for satellites: tinyurl.com/42sru6kz
Call for abstracts: tinyurl.com/3tbj2v83
Call for satellites: tinyurl.com/42sru6kz
📄 Data descriptor: doi.org/10.1038/s415...
📈 Interactive app to explore the data: domaindemo.info
💽 Dataset: doi.org/10.5281/zeno...
📄 Data descriptor: doi.org/10.1038/s415...
📈 Interactive app to explore the data: domaindemo.info
💽 Dataset: doi.org/10.5281/zeno...
Link: doi.org/10.1038/s415...
Link: doi.org/10.1038/s415...
🔗 arxiv.org/abs/2505.09877
1/3
🔗 arxiv.org/abs/2505.09877
1/3
- Adopt the new Responses API.
- Better handling of the structured output.
Link: github.com/yang3kc/llm_...
Please star if you find it useful~
- Adopt the new Responses API.
- Better handling of the structured output.
Link: github.com/yang3kc/llm_...
Please star if you find it useful~
We've also added a "Reports" section with some analyses. For NSF, we see that the STEM education directorate has been absolutely pummeled.
We've also added a "Reports" section with some analyses. For NSF, we see that the STEM education directorate has been absolutely pummeled.
📄 Full paper: arxiv.org/abs/2504.12902
💻 Codes: github.com/osome-iu/ris...
💾 Dataset: zenodo.org/records/1506...
📄 Full paper: arxiv.org/abs/2504.12902
💻 Codes: github.com/osome-iu/ris...
💾 Dataset: zenodo.org/records/1506...
Get your submissions in while you still can and join us at @icwsm.bsky.social!
Get your submissions in while you still can and join us at @icwsm.bsky.social!
📅 Deadline: April 10, AoE
#CyberSocialThreats #AIforSocialGood #ICWSM2025
💡Share your research on generative AI, online safety, harms and threats, or political conflict in online platforms, with the leading minds in the field. We’d love to see your submissions.
📆New Deadline: April 10 AoE
📅 Deadline: April 10, AoE
#CyberSocialThreats #AIforSocialGood #ICWSM2025
💡Share your research on generative AI, online safety, harms and threats, or political conflict in online platforms, with the leading minds in the field. We’d love to see your submissions.
📆New Deadline: April 10 AoE
💡Share your research on generative AI, online safety, harms and threats, or political conflict in online platforms, with the leading minds in the field. We’d love to see your submissions.
📆New Deadline: April 10 AoE
Make sure to get your submissions in!
Make sure to get your submissions in!