Lightnews — Scholar-powered news

klaudia jaźwińska

@klaudia.bsky.social

damn there's a fake, likely AI-generated quote attributed to me in an article about AI. I've never used the phrase "Faustian bargain" in my life

September 26, 2025 at 2:20 PM

klaudia jaźwińska

@klaudia.bsky.social

When @aisvarya17.bsky.social and I asked seven different AI models to identify the location, date and source of photos taken by photojournalists, they collectively only answered 14 out of 280 queries completely correctly.

August 26, 2025 at 10:10 PM

klaudia jaźwińska

@klaudia.bsky.social

As in our past research into AI search tools, we observed instances where the models were confidently wrong and other instances where they hedged, despite providing correct answers. www.cjr.org/tow_center/w...

August 26, 2025 at 8:46 PM

klaudia jaźwińska

@klaudia.bsky.social

However, they also struggle to reliably conduct fundamental fact-checking tasks, such as identifying whether an image is authentic. Each model we tested misidentified at least one real photo as AI-generated.

August 26, 2025 at 8:46 PM

klaudia jaźwińska

@klaudia.bsky.social

We observed that errors compounded in the LLMs’ reasoning: if one conclusion was wrong, subsequent steps often built on that error.

August 26, 2025 at 8:46 PM

klaudia jaźwińska

@klaudia.bsky.social

For example, when we asked Grok Deep Research to identify the provenance of a photo taken during flooding in Valencia, Spain, it fixated on text on person’s shirt that said “Venice Beach," leading it to incorrectly conclude that the photo was taken during flooding in Venice, Italy.

August 26, 2025 at 8:46 PM

klaudia jaźwińska

@klaudia.bsky.social

NEW in @columjournreview.bsky.social: Why AI models are bad at fact-checking photos www.cjr.org/tow_center/w...

August 26, 2025 at 8:12 PM

klaudia jaźwińska

@klaudia.bsky.social

lol came across this amazing interaction on X related to my + @aisvarya17.bsky.social's research

April 21, 2025 at 12:35 AM

klaudia jaźwińska

@klaudia.bsky.social

April 9, 2025 at 2:07 PM

klaudia jaźwińska

@klaudia.bsky.social

5️⃣ Finally, content licensing deals with news sources provided no guarantee of accurate citation in chatbot responses. For instance, @sfchronicle.com has a deal with OpenAI, but ChatGPT only correctly identified 1 of the 10 excerpts we shared from the publisher.

Graph title: "Formal licensing deals do not necessarily translate to more accurate identification or attribution of publisher content, according to our study"
Subtitle: "The Tow Center asked eight generative search tools to identify the source article, the publication and URL for 200 excerpts extracted from news articles by 20 publishers. Each square represents one response. The following are the platform/publisher pairs from our dataset where a deal is present."

March 6, 2025 at 2:50 PM

klaudia jaźwińska

@klaudia.bsky.social

4️⃣ Generative search tools made up links and cited syndicated and copied versions of articles. For example, more than half of responses from Gemini and Grok-3 cited fabricated or broken URLs that led to error pages

Graph title: "Generative search tools fabricated links and cited syndicated and plagiarized articles."
Subtitle: "The Tow Center asked eight generative search tools to identify the source article, the publication and URL for 200 excerpts extracted from news articles by 20 publishers. Each square represents the citation behavior of a response."

March 6, 2025 at 2:50 PM

klaudia jaźwińska

@klaudia.bsky.social

3️⃣ Multiple chatbots seemed to bypass robots.txt preferences. For instance, Perplexity correctly identified all 10 excerpts from paywalled articles we shared from National Geographic even though the publisher has disallowed Perplexity's crawler and has no formal relationship with the company.

Graph title: "Blocking crawlers doesn't guarantee content is inaccessible, and crawler access doesn't ensure accuracy"
Subtitle: "The Tow Center asked eight generative search tools to identify the source article, the publication and URL for 200 excerpts extracted from news articles by 20 publishers. Each square represents on response. Grok and DeepSeek do not disclose the names of their crawlers."

March 6, 2025 at 2:50 PM

klaudia jaźwińska

@klaudia.bsky.social

2️⃣ While premium chatbots like Grok-3 and Perplexity answered more prompts correctly than their corresponding free equivalents, they paradoxically also provided more confidently incorrect answers than their free counterparts.

Graph title: "Grok-3 Search and Perplexity Pro's responses exhibited less uncertainty and were more frequently confidently incorrect than their free counterparts in our study"
Subtitle: "The Tow Center asked eight generative search tools to identify the source article, the publication and URL for 200 excerpts extracted from news articles by 20 publishers. Each square represents one response."

March 6, 2025 at 2:50 PM

klaudia jaźwińska

@klaudia.bsky.social

We found that:

1️⃣ Chatbots were generally bad at declining to answer questions they couldn’t answer accurately, offering incorrect or speculative answers instead.

Graph title: "Generative search tools were often confidently wrong in our study"
Subtitle: "The Tow Center asked eight generative search tools to identify the source article, the publication and URL for 200 excerpts extracted from news articles by 20 publishers. Each square represents the citation behavior of a response."

March 6, 2025 at 2:50 PM

klaudia jaźwińska

@klaudia.bsky.social

We chose 20 publishers with varying stances on AI access, randomly selected 10 articles from each publisher and manually chose excerpts from the articles to use in our queries. We asked each chatbot to identify the corresponding article’s headline, original publisher, publication date and URL.

An example of a prompt in which we ask a chatbot to provide the headline, original publication date, and publisher for an excerpt from a news article

March 6, 2025 at 2:50 PM

klaudia jaźwińska

@klaudia.bsky.social

Really excellent explainer by @shaynelongpre.bsky.social‬ that clearly lays out what's at stake in the "AI crawler wars"

How we stand to lose out

As this cat-and-mouse game accelerates, big players tend to outlast little ones. Large websites and publishers will defend their content in court or negotiate contracts. And massive tech companies can afford to license large data sets or create powerful crawlers to circumvent restrictions. But small creators, such as visual artists, YouTube educators, or bloggers, may feel they have only two options: hide their content behind logins and paywalls, or take it offline entirely. For real users, this is making it harder to access news articles, see content from their favorite creators, and navigate the web without hitting logins, subscription demands, and captchas each step of the way.

Perhaps more concerning is the way large, exclusive contracts with AI companies are subdividing the web. Each deal raises the website’s incentive to remain exclusive and block anyone else from accessing the data—competitor or not. This will likely lead to further concentration of power in the hands of fewer AI developers and data publishers. A future where only large companies can license or crawl critical web data would suppress competition and fail to serve real users or many of the copyright holders.

February 21, 2025 at 4:56 PM

klaudia jaźwińska

@klaudia.bsky.social

For @columjournreview.bsky.social, @aisvarya17.bsky.social and I tested Grok-3's ability to identify quotes from news publishers. Out of the 200 quotes we tested, 102 returned hallucinated citations and the original source article was correctly cited only nine times. www.cjr.org/the_media_to...

February 20, 2025 at 2:51 PM

klaudia jaźwińska

@klaudia.bsky.social

Excellent presentation at #IASEAI25 today by @juliaangwin.com, who stressed the importance of putting pressure on AI companies to prioritize accuracy and truth in their outputs (in this photo she’s citing research by the great @sayash.bsky.social and @randomwalker.bsky.social

February 7, 2025 at 2:54 PM

klaudia jaźwińska

@klaudia.bsky.social

"It’s a 'you' problem that TikTok will happily serve up joke videos until the moment of your death."

January 8, 2025 at 9:30 PM

klaudia jaźwińska

@klaudia.bsky.social

Ultimately, what we found is that ChatGPT search offers publishers the illusion of control. No publisher – regardless of degree of affiliation with OpenAI – was spared from inaccurate representations of its content. (7/9)

A table indicating the affiliation each of the publishers in our dataset have with OpenAI, whether the publisher’s content was accessible to OpenAI’s search crawler through their “robots.txt” file, and the accuracy of ChatGPT in referencing their content.

November 27, 2024 at 7:36 PM

klaudia jaźwińska

@klaudia.bsky.social

The search tool's temperature settings likely contributed to inconsistent outputs. When we asked it to identify the same Washington Post quote twice, it answered incorrectly on one occasion and correctly on another. (6/9)

The Tow Center asked ChatGPT twice to identify a quote from an article published in the Washington Post on October 8, 2024. The first time, it cited the wrong date and attributed the story to the New York Times without attaching a source. The second time, it cited the correct article, identified the date, correctly attributed the story to the Washington Post, and provided a working link to the article.

November 27, 2024 at 7:34 PM

klaudia jaźwińska

@klaudia.bsky.social

In some instances, the search bot attributed the text to syndicated or plagiarized versions of the articles, instead of the original source. (5/9)

ChatGPT links to a plagiarized version of a New York Times article

ChatGPT links to a syndicated version of an MIT Tech Review article instead of the canonical piece

November 27, 2024 at 7:33 PM

klaudia jaźwińska

@klaudia.bsky.social

More than a third of ChatGPT’s responses to our queries included incorrect citations such as the one below, where it attributes a letter to the editor published in the @orlandosentinel.com to TIME (4/9)

ChatGPT misattributes an Orlando Sentinel article to TIME Magazine

November 27, 2024 at 7:32 PM

klaudia jaźwińska

@klaudia.bsky.social

Specifically, we were looking to see if it accurately returned the publisher, date and URL of the quotes we shared.
Of the 200 responses to our queries, 153 were either partially or entirely incorrect. But the chatbot only expressed a lack of confidence in its answers 7 times. (3/9)

Three examples of ChatGPT’s responses to our queries, with varying degrees of correctness

Chart heading: ChatGPT's search feature was frequently wrong, but rarely unsure.
Chart subheading: The Tow Center asked ChatGPT to identify the publication, date, and URL for 200 quotes. It was confidently wrong in 146 cases.

November 27, 2024 at 7:31 PM

klaudia jaźwińska

@klaudia.bsky.social

Meanwhile, critics warn that this farsighted movement is drawing attention and resources away from urgent, presently observable harms of AI deployments.

Screenshot of an article published by Matt O'Shaughnessy in the Carnegie Endowment for International Peace on Sept. 14, 2023, titled "How Hype Over AI Superintelligence Could Lead Policy Astray."

Screenshot of an article published by Inioluwa Deborah Raji in The Atlantic, titled "AI's Present Matters More Than Its Imagined Future"

October 30, 2023 at 9:51 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news