Skip to main content

Google's video AI was tricked into thinking a video about apes was about spaghetti

researchers trick google cloud video intelligence into misclassifying videos data center header
Google
While artificial intelligence is an incredibly important field that’s growing by leaps and bounds, perhaps its most interesting lessons concerns just how incredible the human brain is at performing certain functions. While computers might be better at performing math and looking dozens of chess moves into the future, they can’t yet compete with the human brain at figuring out things like a video’s topic.

A recent research project demonstrated just that fact by feeding videos to Google’s Cloud Video Intelligence API and seeing if it could determine exactly what a given video was about. Apparently, this seemingly simple task is a challenge for Google’s AI and points out the difficulty of creating automatic systems to categorize video, as Motherboard reports.

Recommended Videos

The research team in question works at the University of Washington, and the team used some trickery to see how smart the Google API really is. Currently in beta, the Google Cloud Video Intelligence API has one job, which was to “make video searchable” and to annotate video to make it easier for humans to search through them.

Please enable Javascript to view this content

In their tests, the researchers injected extraneous, and subliminal, images of a pasta bowl into a video featuring primatologist Jane Goodall and gorillas. The result was that the Google AI concluded that the video was actually about spaghetti and not the apes. Another example involved placing a picture of an Audi into a video about tigers, which caused the AI to conclude that the video was about cars.

University of Washington

Although it might sound somewhat comical, these mistakes point out a serious issue with the AI. As the researchers noted in their conclusion:

“However, we showed that the API has certain security weaknesses. Specifically, an adversary can insert an image, periodically and at a very low rate, into the video in a way that all the generated shot labels are about the inserted image. Such vulnerability seriously undermines the applicability of the API in adversarial environments.”

Even worse, according to the researchers, “Furthermore, an adversary can bypass a video filtering system by inserting a benign image into a video with illegal contents.” The fact that the process of doing so requires no specialized knowledge about the AI’s machine learning algorithms or about video annotation in general was particularly disturbing.

Ultimately, what the research points out is that AI has a long way to go before it can match the human brain in determining things like a video’s topic. Inserting subliminal messages into video has been known for a long time to affect the human psyche, but at least a human wouldn’t think that a video about apes is actually about spaghetti — the human would probably just start craving pasta instead.

Mark Coppock
Mark Coppock is a Freelance Writer at Digital Trends covering primarily laptop and other computing technologies. He has…
Ex-Google employees say we need ‘an Android-like moment for AI’
Hugo Barra Nexus 7

Hugo Barra, Google’s former VP of Android product management, announced Wednesday that he is leading a new startup with aims to develop an Android-like operating system for AI agents.

"[We're] going back to our Android roots, building a new operating system for people & AI agents," Barra wrote in a post on X.

Read more
OpenAI’s Sora was leaked in protest over allegations of ‘art washing’
An AI image portraying two mammoths that walk through snow, with mountains and a forest in the background.

OpenAI's unreleased Sora video generation model was leaked Tuesday by a group protesting the company's "art washing" actions, per a post from X user @legit_rumors.

The group, calling themselves Sora PR Puppets, reportedly had gained early access to the Sora API. Through that, they leveraged authentication tokens to create a front-end interface enabling anyone to generate video clips with the model. While the project only remained online for around three hours before Hugging Face (or possibly OpenAI itself) revoked access, several users managed to publish their creations to social media sites.

Read more
Get ready for AI-dubbed YouTube videos
YouTube logo on top-left corner of home screen

YouTube has reportedly started rolling out a new AI-empowered translation feature for its content creators, one that will automatically redub a video's contents into one of nine languages without changing the speaker's voice.

According to a post from X user @levelsio, "YouTube will now auto dub videos in English, Spanish, Portuguese, German, French, Italian, Hindi, Indonesian and Japanese" and "will use AI to take the original voice but change the language."

Read more