Peter Carragher
pcarragher.bsky.social
Peter Carragher
@pcarragher.bsky.social
CS PhD student at CMU

Retrieval, Webgraphs, Source Credibility, Adversarial Adaptation
Reposted by Peter Carragher
Yep, I'm on Substack: to make my research accessible and useful, I'll translate my work into digestible posts that explain how I study the local news market at scale, through text mining, to ask questions about media ownership, news deserts, and democracy: shorturl.at/q3VKI
August 28, 2025 at 12:29 PM
Last week, I presented my recent paper on memorization and hallucination in LVLMs at @l2m2workshop.bsky.social #ACL2025.

TLDR: model memorization may be more prevalent over image sources vs. text! Details here: aclanthology.org/2025.l2m2-1....
August 7, 2025 at 6:35 PM
Just wrapped up my first IC2S2 conference with a talk on our ongoing research into how ownership structures of broadcasting companies influences editorial decisions in news outlets.

Check out our News Guessing game at newsguess.petercarragher.com and contribute to our research media ownership!
News Site Guessing Game
newsguess.petercarragher.com
July 26, 2025 at 12:13 PM
Reposted by Peter Carragher
Excited to share our FAME method for news identification: Fingerprint-to-Article Matching for Events from a DB! We use it to study news coverage of disasters and conflicts (w @brenocon.bsky.social @ethanz.bsky.social). Check out our talk and poster at @icwsm.bsky.social!🧵👇
arxiv.org/abs/2506.12925
June 25, 2025 at 12:45 PM
Reposted by Peter Carragher
Just gave a talk about Dredge Words—queries for which unreliable domains rank highly #icwsm2025

ojs.aaai.org/index.php/IC...
June 26, 2025 at 3:17 PM
Excited to have two workshop papers and one main conference paper that I've been involved in being presented at @icwsm.bsky.social! Thanks @kingcatherine.bsky.social and @evanup.bsky.social for letting me tag along. Details below.
June 24, 2025 at 9:10 AM
Preprint! Vision-language models like GPT-4o fail to identify multimodal knowledge conflicts and frequently hallucinate on counterfactual samples. Try out the batsman example below (promt = "what is he holding?").
June 15, 2025 at 12:43 AM
Reposted by Peter Carragher
Another chapter in AI biting the hand that feeds it: Wikipedia’s bandwidth surged 50% since January thanks to AI crawlers.

Unlike search engines, they send no traffic back so no new users, no new donors. Just rising costs and a shrinking audience.

A raw deal for a cornerstone of the free web.
How crawlers impact the operations of the Wikimedia projects
Since the beginning of 2024, the demand for the content created by the Wikimedia volunteer community – especially for the 144 million images, videos, and other files on Wikimedia Commons – has grow…
diff.wikimedia.org
April 2, 2025 at 12:04 PM
@evanup.bsky.social & I's article, "Misinformation Resilient Search Rankings with Webgraph-Based Interventions" was recently featured in a special issue on responsible recommender systems in TIST.

Sharing it here instead of on X for... reasons.

dl.acm.org/doi/full/10....
Misinformation Resilient Search Rankings with Webgraph-Based Interventions | ACM Transactions on Intelligent Systems and Technology
The proliferation of unreliable news domains on the internet has had wide-reaching negative impacts on society. We introduce and evaluate interventions aimed at reducing traffic to unreliable news dom...
dl.acm.org
February 27, 2025 at 8:26 PM
Reposted by Peter Carragher
An article covering our most recent paper on Google’s explicit content moderation :)
NEW: Google used to warn you when you were seeing low-quality search results. Then, in the weeks leading up to the 2024 election, the company quietly turned that warning off www.platformer.news/google-data-...
February 25, 2025 at 2:54 AM