Researchers call ChatGPT Search answers ‘confidently wrong’

By Andrew Tarantola Published December 3, 2024

ChatGPT search — OpenAI

ChatGPT was already a threat to Google Search, but ChatGPT Search was supposed to clench its victory, along with being an answer to Perplexity AI. But according to a newly released study by Columbia’s Tow Center for Digital Journalism, ChatGPT Search struggles with providing accurate answers to its users’ queries.

The researchers selected 20 publications from each of three categories: Those partnered with OpenAI to use their content in ChatGPT Search results, those involved in lawsuits against OpenAI, and unaffiliated publishers who have either allowed or blocked ChatGPT’s crawler.

Recommended Videos

“From each publisher, we selected 10 articles and extracted specific quotes,” the researchers wrote. “These quotes were chosen because, when entered into search engines like Google or Bing, they reliably returned the source article among the top three results. We then evaluated whether ChatGPT’s new search tool accurately identified the original source for each quote.”

Forty of the quotes were taken from publications that are currently using OpenAI and have not allowed their content to be scraped. But that didn’t stop ChatGPT Search from confidently hallucinating an answer anyway.

“In total, ChatGPT returned partially or entirely incorrect responses on a hundred and fifty-three occasions, though it only acknowledged an inability to accurately respond to a query seven times,” the study found. “Only in those seven outputs did the chatbot use qualifying words and phrases like ‘appears,’ ‘it’s possible,’ or ‘might,’ or statements like ‘I couldn’t locate the exact article.'”

ChatGPT Search’s cavalier attitude toward telling the truth could harm not just its own reputation but also the reputations of the publishers it cites. In one test during the study, the AI misattributed a Time story as being written by the Orlando Sentinel. In another, the AI didn’t link directly to a New York Times piece, but rather to a third-party website that had copied the news article wholesale.

OpenAI, unsurprisingly, argued that the study’s results were due to Columbia doing the tests wrong.

“Misattribution is hard to address without the data and methodology that the Tow Center withheld,” OpenAI told the Columbia Journalism Review in its defense, “and the study represents an atypical test of our product.”

The company promises to “keep enhancing search results.”

Topics

Andrew Tarantola

Computing Writer

Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…

Computing

ChatGPT Plus vs. Pro: Is it worth the upgrade?

ChatGPT and OpenAI logos.

OpenAI unveiled its new ChatGPT Pro subscription tier during the company's inaugural "12 Days of OpenAI" live-stream event. At a cost of $200 per month, the Pro tier costs 10 times as much as a standard, single-user Plus account.

In this guide, we'll discuss how much the various ChatGPT subscription tiers cost, as well as what features and benefits you receive at each level — all to help you decide which, if any, paid tier is right for you.
Pricing

Computing

You can now try out ChatGPT Search for free

The ChatGPT Search icon on the prompt window

As part of its "12 Days of OpenAI" event, OpenAI has yet another update for ChatGPT, this time bringing its Search feature over to the free tier. The Google Search alternative was previously only for paid subscribers in the ChatGPT Plus or Pro tiers.

"We rolled it out for paid users about two months ago," Kevin Weil, OpenAI's chief product officer, said during Monday's livestream. "I can't imagine ChatGPT without Search now. I use it so often. I'm so excited to bring it to all of you for free starting today."

Computing

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

ChatGPT on a laptop

We're now into the third year of the AI boom, and industry leaders are showing no signs of slowing down, pushing out newer and (presumably) more capable models on a regular basis. ChatGPT, of course, remains the undisputed leader.

But with more than a half-dozen models available from OpenAI alone, figuring out which one to use for your specific project can be a daunting task.
GPT o1