Skip to main content

Qualcomm wants to add these crazy AI tools to your Android phone

Qualcomm Snapdragon 8 Gen 3 media asset.
Qualcomm

At Mobile World Congress 2024, Qualcomm is adding more to its portfolio of AI-on-phone tricks facilitated by the Snapdragon series silicon for Android phones.  The chipmaker has already showcased some impressive AI capabilities for the Snapdragon 8 Gen 3 flagship, such as voice-activated media editing, on-device image generation using Stable Diffusion, and a smarter virtual assistant built atop large language models from the likes of Meta.

Today, the company is adding more grunt to those AI superpowers. The first is the ability to run a Large Language and Vision Assistant (LLaVa) on a smartphone. Think of it as a chatbot like ChatGPT that has been granted Google Lens abilities. As such, Qualcomm’s solution can not only accept text input, but also process images.

Recommended Videos

For example, you can push an image depicting a charcuterie board and ask questions based on it. The AI assistant, based on a large multimodal model (LMM) that can process over 7 billion parameters, will then tell you all the kinds of fruits, cheeses, meats, and nuts on the board depicted in the input image seen below.

Qualcomm’s demo of smarter AI assistant on phone.
Qualcomm

It can also handle follow-on queries, so you can conduct a flowing back-and-forth conversation. Now, the likes of ChatGPT have also gained multiple-modal capabilities, which means OpenAI’s tool can also process image inputs. However, there’s a crucial difference.

Products like ChatGPT and Copilot are still very much tethered to a cloud-based architecture, meaning your data is handled on remote servers. Qualcomm’s push is in the direction of on-device processing. Everything happens on your phone, which means the whole process is faster, and there is little risk of privacy intrusion.

“This LMM runs at a responsive token rate on device, which results in enhanced privacy, reliability, personalization, and costs,” says Qualcomm. Whether Qualcomm’s promised LLaVa-based virtual assistant will arrive as a standalone app or if it will carry a fee is yet to be officially confirmed.

The next announcement from Qualcomm dives into the creative domain of image generation and manipulation. Not too long ago, Qualcomm demoed the world’s fastest text-to-image generation on a phone using Stable Diffusion tech. Today, the company is giving a first glimpse of LoRA-driven image generation.

Qualcomm showcase of AI image generation on phone.
Qualcomm

LoRA takes a different approach to image generation than a regular generative AI tool such as Dall.E. LoRA, short for Low-Rank Adaptation, is a technique developed by Microsoft. Training an AI model can be quite cost-prohibitive, high on latency, and particularly demanding from a hardware perspective.

What LoRA does is it dramatically reduces the model weight, a goal that is achieved by only focusing on specific segments of the model and reducing the number of parameters for training purposes. In doing so, the memory requirements go down, the process becomes faster, and the amount of time and effort it takes to adapt a text-to-image model also drops dramatically.

Over time, the LoRA distillation technique has been applied to the Stable Diffusion model for generating images from text prompts. Owing to the gains in efficiency and the easier adaptability of LoRA-based models, it is seen as a tailor-made route for smartphones. Qualcomm certainly thinks so, and even rival MediaTek has embraced the same solution for generative AI tricks on its flagship Dimensity 9300 chip.

Qualcomm is also showcasing a few other AI tricks at MWC 2024, some of which have already appeared on the Samsung Galaxy S24 Ultra. Among them is the ability to expand the canvas of an image using generative AI fill and AI-powered video generation. The latter is quite ambitious, especially after seeing what OpenAI has accomplished with Sora. It would be interesting to see how Qualcomm manages to port it over to smartphones.

Nadeem Sarwar
Nadeem is a tech journalist who started reading about cool smartphone tech out of curiosity and soon started writing…
This smartphone camera sensor could make blurry photos a thing of the past
Metavision camera sensor tech.

Google Pixel 8 (left) and OnePlus 12 Andy Boxall / Digital Trends

Paris-based Prophesee made waves last year when it showcased its in-house event-based Metavision sensor tech for smartphone cameras. The core idea behind the stack was to make blurry images a thing of the past, demonstrating some impressive results during the development phase.

Read more
Your next phone could get a huge 5G upgrade, thanks to AI
Qualcomm Snapdragon X80 Modem-RF chip.

It’s that time of year again when Qualcomm ushers in its next generation of 5G modem technology. Announced at Mobile World Congress (MWC ) 2024, this year’s Snapdragon X80 5G Modem-RF system is the successor to last year’s Snapdragon X75, and it builds on the 5G Advanced foundation laid last year with more raw power and new AI features.

While the Snapdragon X75 moved the needle by adding support for the latest 5G Advanced standards, we’re still in that fourth phase of 5G technology, otherwise known as 3GPP Release 18 — and most carrier networks are still catching up. So, with no new standards to embrace, Qualcomm has focused on improving the inside of the Snapdragon X80 to take even fuller advantage of these cutting-edge 5G technologies.
The magic of AI-powered 5G

Read more
The Amazon app on your phone just got a cool AI feature
Rufus AI chatbot in Amazon app.

Last year, Amazon CEO Andy Jassy said that every business division at the company was experimenting with AI. Today, Amazon has announced its most ambitious AI product yet: a chatbot named Rufus to assist with your online shopping.
Imagine ChatGPT, but one that knows every detail about all the products in Amazon’s vast catalog. Plus, it is also connected to the web, which means it can pull information from the internet to answer your questions. For example, if you plan to buy a microSD card, Rufus can tell you which speed class is the best for your photography needs.
Amazon says you can type all your questions in the search box, and Rufus will handle the rest. The generative AI chatbot is trained on “product catalog, customer reviews, community Q&As, and information from across the web.”
In a nutshell, Amazon wants to decouple the hassle of looking up articles on the web before you make up your mind and then arrive on Amazon to put an item in your cart. Another benefit of Rufus is that instead of reading through a product page for a certain tiny detail, you can ask the question directly and get the appropriate responses.

An AI nudge to informed shopping

Read more