Skip to main content

Here’s how Claude 3.5 Sonnet and GPT-4o stack up in a direct comparison

In the ever-growing large language model (LLMs) landscape, two front-runners stand out from the rest of the race: Anthropic’s Claude 3.5 Sonnet and OpenAI’s GPT-4o (the “o” stands for “Omni”). Both AIs boast impressive capabilities, but which reigns supreme? This guide dives deep into Claude 3.5 Sonnet and GPT-4o, dissecting their strengths and weaknesses across various tasks.

We’ll not only explore their relative accuracy and response speeds, but also unpack their pricing structures and service tiers. By the end of this article, you’ll be armed with the knowledge to make an informed decision about which LLM best suits your specific needs and budget.

Recommended Videos

Pricing and tiers

ChatGPT users can access the GPT-4o model for free, but the number of queries you’re allowed to ask it within a three-hour span is limited. Exceeding that limit will move you down to using GPT-3.5, OpenAI’s older, less capable model, until the cooldown timer has reset.

User’s will need to pay for the $20-per-month ChatGPT Plus subscription, a $30-per-month Teams subscription, or a market rate Enterprise sub if they want to use GPT-4o to their hearts’ content. Paying for access not only drastically increases the rate cap for using the AI model, but it also grants access to additional features like Dall-E image generation.

a screenshot of the Claude pricing tiers
Anthropic

Access to Claude is set up in much the same way. At the free tier, users can converse with the chatbot on either the web or through the iOS app. They cal also upload images and documents to the Anthropic server and query the AI about their contents, as well as enjoy limited use of the new Claude 3.5 Sonnet model.

Paying $20 a month for a Pro account gets you everything from the free tier, as well as higher usage limits, access to both Claude 3 Opus and Haiku, priority bandwidth and availability, and the ability to create Projects with the AI centered on a set of documents or files. Paying $30 a month per person (minimum 5 people) for a Teams account nets you even higher usage limits and the ability to share chats between teammates.

Advantages of using Claude

While Claude may not enjoy the same degree of name recognition as GPT-4o, Anthropic’s latest AI model holds a number of advantages over its rival — and not just in performance benchmarks. For one, Claude boasts a significantly larger context window than ChatGPT, (200,000 characters versus 128,000).

This means it can remember and analyze a much larger chunk of previous conversation or text, leading to more nuanced and relevant responses, especially across longer interactions. Claude is also trained to prioritize accuracy over creativity in its responses.

It may struggle a bit in generating amusing haiku, but Claude excels at summarizing research papers and generating reports based on uploaded data. Claude also has a more recent knowledge cutoff date than GPT-4o: April 2024 versus October 2023.

a screenshot of Claude 3.5 sonnet with the Artifacts side screen
Anthropic

In addition, Anthropic trained its Claude models as Constitutional AIs. This means that the system implements a set of principles to make judgments about its generated outputs, effectively guiding the model toward the sort of ethical behavior dictated by the constitution. That includes avoiding toxic or discriminatory outputs, as well as avoiding describing or advocating for illegal or unethical activities.

Advantages of using GPT-4o

GPT-4o, on the other hand, offers users a wider variety of functions, a larger knowledge base, and more flexible log-in options than Claude.

Unlike Anthropic’s offering, ChatGPT can generate images (thoughthat feature requires a subscription) and access the web to find the most up-to-date information available, not just the data it was trained on. And contrary to the sign-up process with Claude, GPT-4o users won’t have to provide their phone number for verification, or really any credentials at all, in order to start using the model And, thanks to its larger knowledge base, GPT-4o is more adept at generating creative content across images, text, and audio than Claude.

The ChatGPT website also tends to overload with user requests at a far lower rate than Claude’s.

Speed and accuracy comparison

Both models are capable of generating a wide variety of text outputs, from refining user-written prose to creating computer code and spreadsheet algorithms. When asked to generate a “piece of Python code that will measure the area of a circle and generate a graph of the function,” both returned similarly structured snippets. GPT-4o produced the following:

import numpy as np
import matplotlib.pyplot as plt

# Function to calculate the area of a circle
def calculate_area(radius):
return np.pi * radius ** 2

# Generate a range of radius values
radii = np.linspace(0, 10, 100) # radii from 0 to 10 with 100 points

# Calculate the corresponding area values
areas = calculate_area(radii)

# Plotting the function
plt.figure(figsize=(10, 6))
plt.plot(radii, areas, label=’Area = πr²’)
plt.title(‘Area of a Circle as a Function of Radius’)
plt.xlabel(‘Radius’)
plt.ylabel(‘Area’)
plt.legend()
plt.grid(True)
plt.show()

Meanwhile, Claude’s response was a bit more fleshed out:

import math
import numpy as np
import matplotlib.pyplot as plt

def circle_area(radius):
return math.pi * radius**2

# Generate radii from 0 to 10 with 100 points
radii = np.linspace(0, 10, 100)

# Calculate areas for each radius
areas = [circle_area(r) for r in radii]

# Create the plot
plt.figure(figsize=(10, 6))
plt.plot(radii, areas, ‘b-‘, linewidth=2)
plt.title(‘Area of a Circle as a Function of Radius’)
plt.xlabel(‘Radius’)
plt.ylabel(‘Area’)
plt.grid(True)

# Add some interesting points
plt.plot(0, circle_area(0), ‘ro’, label=’r = 0′)
plt.plot(1, circle_area(1), ‘ro’, label=’r = 1′)
plt.plot(5, circle_area(5), ‘ro’, label=’r = 5′)
plt.plot(10, circle_area(10), ‘ro’, label=’r = 10′)

plt.legend()
plt.show()

In terms of image identification, the two models are closely matched. When asked to identify the various aspects of the following image, both GPT-4o and Claude 3.5 were easily able to find and describe each of the 21 items present.

an array of 3D generated images made by Meta 3D Gen
Meta

GPT-4o has the advantage of delivering information about recent events, given its ability to search the web for news that’s more recent than its training data. Asking Claude “what happened in Luxembourg yesterday” only gets you a note reading: “I apologize, but I don’t have access to real-time news or information about specific events that occurred yesterday in Luxembourg. My knowledge cutoff is in April 2024, and I don’t have information about events after that date.” The AI does offer a number of helpful recommendations as to where you might be able to find the information you requested.

I did find comparing the two systems to be a bit of a challenge, mostly because I was only able to make a half-dozen requests of Claude before the system locked me out for three hours. Even on the free tier of ChatGPT, I won’t be frozen out of the system entirely — I’ll just have to converse with a slightly inferior model for a while.

Which is better?

Ultimately, the best choice depends on your specific needs. If you prioritize factual accuracy, long document processing, and have a tight budget, Claude might be a better fit.

However, if you need a wider range of creative functionalities and are willing to pay for additional features, ChatGPT is still the better option.

Andrew Tarantola
Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…
Here’s why people are claiming GPT-4 just got way better
A person sits in front of a laptop. On the laptop screen is the home page for OpenAI's ChatGPT artificial intelligence chatbot.

It appears that OpenAI is busy playing cleanup with its GPT language models after accusations that GPT-4 has been getting "lazy," "dumb," and has been experiencing errors outside of the norm for the ChatGPT chatbot circulated social media in late November.

Some are even speculating that GPT-4.5 has secretly been rolled out to some users, based on some responses from ChatGPT itself. Regardless of whether or not that's true, there's definitely been some positive internal changes over the past behind GPT-4.
More GPUs, better performance?
Posts started rolling in as early as last Thursday that noticed the improvement in GPT-4's performance. Wharton Professor Ethan Mollick, who previously commented on the sharp downturn in GPT-4 performance in November, has also noted a revitalization in the model, without seeing any proof of a switch to GPT-4.5 for himself. Consistently using a code interpreter to fix his code, he described the change as "night and day, for both speed and answer quality" after experiencing ChatGPT-4 being "unreliable and a little dull for weeks."

Read more
Here’s why people are saying GPT-4 is getting ‘lazy’
An OpenAI graphic for ChatGPT-4.

OpenAI and its technologies have been in the midst of scandal for most of November. Between the swift firing and rehiring of CEO Sam Altman and the curious case of the halted ChatGPT Plus paid subscriptions, OpenAI has kept the artificial intelligence industry in the news for weeks.

Now, AI enthusiasts have rehashed an issue that has many wondering whether GPT-4 is getting "lazier" as the language model continues to be trained. Many who use it speed up more intensive tasks have taken to X (formerly Twitter) to air their grievances about the perceived changes.

Read more
GPT-4.5 news: Everything we know so far about the next-generation language model
ChatGPT app running on an iPhone.

OpenAI's GPT-4 language model is considered by most to be the most advanced language model used to power modern artificial intelligences (AI). It's used in the ChatGPT chatbot to great effect, and other AIs in similar ways. But that's not the end of its development. As with GPT-3.5, a GPT-4.5 language model may well launch before we see a true next-generation GPT-5.

Here's everything we know about GPT-4.5 so far.

Read more