Skip to main content

GPT-5 to take AI forward in these two important ways

Breaking Down Barriers to AI Innovation with Reid Hoffman & Kevin Scott

We could soon see generative AI systems capable of passing Ph.D. exams thanks to more “durable” memory and more robust reasoning operations, Microsoft CTO Kevin Scott revealed when he took to the stage with Reid Hoffman during a Berggruen Salon in Los Angeles earlier this week.

Recommended Videos

“It’s sort of weird right now that you have these interactions with agents and the memory is entirely episodic,” he lamented. “You have a transaction, you do a thing. It’s useful or not for whatever task you were doing, and then it forgets all about it.” The AI system isn’t learning from or even remembering previous interactions with the user, he continued. “There’s no way for you to refer back to a thing you were trying to get [the AI] to solve in the past.”

However, Scott is optimistic that,”we’re seeing technically all of the things fall in place to have really durable memories with the systems.” With more persistent memory, future AI systems will be able to respond more naturally and more accurately over the span of multiple conversations rather than being limited to the current session.

OpenAI announced in February that it was beginning to test a new persistent memory system, rolling it out to select free and Plus subscription users. Enabling the feature allows the AI to recall user tone, voice, and format preferences between conversations as well as make suggestions in new projects based on details the user mentioned in previous chats.

Scott was also buoyant about improving the “fragility” found in the reasoning of many AI systems today. “It can’t solve very complicated math problems,” he explained. “It has to bail out to other systems to do very complicated things.”

“Reasoning, I think, gets a lot better,” he continued. He compares GPT-4 and the current generation of models to high schoolers passing their AP exams. However, the next generation of AIs “could be the thing that could pass your qualified exam.”

To date, generative AI systems have outperformed their flesh-and-blood counterparts on a variety of exam and task formats. Last November, for example, GPT-4 passed the Multistate Professional Responsibility Exam (MPRE), better known as the bar exam, with 76% correct — that’s six points higher than the nation average for humans.

Scott was quick to point out, however, that training generative AIs to pass Ph.D. exams “probably sounds like a bigger deal than it actually is… the real test will be what we choose to do with it.”

Scott was especially excited to see the barriers to entry falling away so quickly. He noted that when he got into machine learning two decades ago, his work required graduate-level knowledge, stacks upon stacks of “very daunting, complicated, technical papers to figure out how to do what I wanted to do,” and around six months of coding. That same task today, he said, “a high school student could do in a Saturday morning.”

These lowered barriers to entry will likely accelerate the democratization of AI, Scott concluded. Finding solutions to the myriad social, environmental, and technological crises facing humanity are not — and cannot — be the sole responsibility of “just the people at tech companies in Silicon Valley or just people who graduated with Ph.D.s from top-five universities,” he said. “We have 8 billion people in the world who also have some idea about what it is that they want to do with powerful tools, if they just have access to them.”

Andrew Tarantola
Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…
There’s a new way to use ChatGPT on your iPhone. Here’s how it works
Someone holding the iPhone 16 Pro with its display on.

There is a new way to access ChatGPT on Apple's iPhone and iPad. As reported by MacRumors, the latest version of the ChatGPT app makes it even easier to access the app's SearchGPT feature.

ChatGPT, a sophisticated AI chatbot developed by OpenAI, utilizes an ever-growing dataset to answer questions, write stories, summarize factual topics, translate languages, and create creative content. It is available on Apple devices through the ChatGPT app, and it is expected to be integrated into Siri in a future version of Apple Intelligence.

Read more
Anthropic Claude: How to use the impressive ChatGPT rival
a screenshot of Claude 3.5 sonnet with the Artifacts side screen

Though it may not capture as many headlines as its rivals from Google, Microsoft, and OpenAI do, Anthropic's Claude is no less powerful than its frontier model peers.

In fact, the latest version, Claude 3.5 Sonnet, has proven more than a match for Gemini and ChatGPT across a number of industry benchmarks. In this guide, you'll learn what Claude is, what it can do best, and how you can get the most out of using this quietly capable chatbot.
What is Claude?
Like Gemini, Copilot, and ChatGPT, Claude is a large language model (LLM) that relies on algorithms to predict the next word in a sentence based on its enormous corpus of training material.

Read more
This massive upgrade to ChatGPT is coming in January — and it’s not GPT-5
ChatGPT on a laptop

OpenAI is set to launch a new AI agent in January, code-named Operator, that will enable ChatGPT to take action on the user's behalf. You may never have to book your own flights ever again.

The company's leadership made the announcement during a staff meeting Wednesday, reports Bloomberg. The company plans to roll out the new feature as a research preview through the company’s developer API.

Read more