Apple is tackling one of the most frustrating aspects with AI today

By Andrew Tarantola Published July 15, 2024

Apple Intelligence on AI — Apple

As companies like Google, Anthropic, and OpenAI update and upgrade their AI models, the way that those LLMs interact with users is sure to change as well. However, getting used to the new system can become a hassle for users who then have to adjust how they pose their queries in order to get the results they’ve come to expect. An Apple research team has developed a new method to streamline that upgrade transition while reducing inconsistencies between the two versions by as much as 40%.

As part of their study, “MUSCLE: A Model Update Strategy for Compatible LLM Evolution,” published July 15, the researchers argue that when upgrading their models, developers tend to focus more on upping the overall performance, rather than making sure that the transition between models is seamless for the user. That includes making sure that negative flips, wherein the new model predicts the incorrect output for a test sample that was correctly predicted by the older model, are kept to a minimum.

Recommended Videos

This is because, the study authors argue, each user has their own quirks, quibbles, and personalized ways of interacting with chatbots. Having to continually adjust and adapt the manner in which they interact with a model can become an exhausting affair — one that is antithetical to Apple’s desired user experience.

The research team even argues that incorrect predictions by the AI should remain between versions, “There is value in being consistent when both models are incorrect,” they wrote. “A user may have developed coping strategies on how to interact with a model when it is incorrect.”

Apple presents MUSCLE

A Model Update Strategy for Compatible LLM Evolution

Large Language Models (LLMs) are frequently updated due to data or architecture changes to improve their performance. When updating models, developers often focus on increasing overall performance… pic.twitter.com/ATm2zM4Poc

— AK (@_akhaliq) July 15, 2024

To address this, the researchers first developed metrics by which to measure the degree of regression between models and then developed a strategy to minimize their occurrence. The result is MUSCLE, a strategy that doesn’t require developers to retrain the entire base model and instead relies on the use of training adapters. Adapters small AI modules that can integrate at different points along the overall LLM.

Developers can then fine-tune these specific modules instead of the entire model. This enables the model as a whole to perform distinct tasks at a fraction of the training cost and with only a small increase in the number of parameters. They’re essentially plug-ins for large language models that allow us to fine-tune specific sections of the overall AI instead of the whole thing.

The research team upgraded LLMs including Meta’s Llama and Microsoft’s Phi as part of their study, using specific math queries as samples, and found that negative flips occurred as much as 60% of the time. By incorporating the MUSCLE strategy, the team wasn’t able to fully eliminate negative flips, but they did manage to reduce their occurrence by as much as 40% compared to the control.

Topics

Andrew Tarantola

Computing Writer

Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…

Computing

Apple Intelligence: Here’s everything we know so far

Apple Intelligence features.

Apple Intelligence is Apple's take on AI, and it looks to fundamentally change the way we interact with technology, blending advanced machine learning and AI capabilities into everyday devices.
Promising more conversational prose from Siri, automated proofreading and text summarization across apps, and lightning-fast image generation, Apple's AI ecosystem is designed to enhance user experiences and streamline operations across its product lineup. Here's everything you need to know about Apple's transformational new AI.

Apple Intelligence release date and compatibility
Apple Intelligence was originally slated for formal release in September, coinciding with the roll out of iOS 18, iPadOS 18, and macOS Sequoia. However, as Bloomberg's Mark Gurman reported, Apple subsequently decided to slightly delay the release of Intelligence. It is currently available to developers as part of the iOS 18.1 beta release on September 19, though it's looking unlikely that Apple Intelligence will be released publicly before the official 18.1 roll out scheduled for October, per Gurman.
https://twitter.com/markgurman/status/1817632719175901531
The company has specified that, at least initially, the AI features will be available on the iPhone 15 Pro and 15 Pro Max, as well as iPads and Macs with M1 or newer chips (and presumably the iPhone 16 handsets as well, since they'll all be running iOS 18). What's more, the features are only available at launch when the user language is set to English.
Why the cutoff? Well, Apple has insisted that the processes are too intensive for older hardware, as they utilize the more advanced neural engines, GPUs, and CPUs of these newer chips.
Users who run an iPhone 15 Pro or iPhone 15 Pro Max part of Apple's Developer program gained access to an early version of Intelligence in July with the release of iOS 18.1 beta.

Computing

Here’s why Macs were a no-show at today’s Apple event

Greg Joswiak presents the iPhone 16 Pro at Apple's 'It's Glowtime' iPhone 16 event in September 2024.

Apple’s "It’s Glowtime" event was absolutely jam-packed with new products, including the iPhone 16 range, new Apple Watches, AirPods upgrades, and more. But something was conspicuous by its absence: the Mac. Where was Apple’s computer lineup?

If you’ve been wondering why there were no updates to the Mac, as well as when we might finally see some new Mac products, you’re in the right place. Here, we’ve got everything you need to know about the lack of Macs at Apple’s iPhone event, as well as when you’ll see Mac upgrades being announced.
The iPhone event was super busy

Computing

The ‘most powerful AI training system in the world’ just went online

Elon Musk talks to the press as he arrives to to have a look at the construction site of the new Tesla Gigafactory near Berlin.

The race for AI supremacy is once again accelerating as xAI CEO Elon Musk announced via Twitter that his company successfully brought its Colossus AI training cluster, which Musk bills as the world's "most powerful," online over the weekend.

https://x.com/elonmusk/status/1830650370336473253