Google just made its Show and Tell AI system open source on TensorFlow

By Lulu Chang Published May 7, 2021

Image used with permission by copyright holder

Artificial intelligence keeps getting more intelligent.

Two years ago, the Google Brain team began employing machine learning techniques to teach a computer how to interpret and caption images. Sure, it won’t win any humor contests for being punny or particularly clever, but if you’re looking for a literal translation of what you’re looking at, Google’s AI system has you covered.

Recommended Videos

On Thursday, the internet giant announced that it had made “the latest version of our image captioning system available as an open source model in TensorFlow.” The most recent iteration of its AI “contains significant improvements to the computer vision component of the captioning system, is much faster to train, and produces more detailed and accurate descriptions compared to the original system,” Google said.

Please enable Javascript to view this content

Called “Show and Tell,” the algorithm can recognize objects in imagery with an impressive 93.9 percent accuracy rate. That’s quite the improvement from just two years ago, when the AI was still scoring in the B-range, identifying images correctly just 89.6 percent of the time. So what’s changed? In essence, Google’s tool now tries to describe objects rather than simply classifying them.

“For example, an image classification model will tell you that a dog, grass and a Frisbee are in the image,” Google noted, “But a natural description should also tell you the color of the grass and how the dog relates to the Frisbee.”

While you may not need Google to tell you what you’re looking at on a daily basis, these machine learning capabilities could be used to help those with visual impairments, and further the work of other AI researchers. “We hope that sharing this model in TensorFlow will help push forward image captioning research and applications, and will also allow interested people to learn and have fun,” Google said.

For a full description of Google’s latest algorithm, check out “Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge,” published in IEEE Transactions on Pattern Analysis and Machine Intelligence.

Topics

Former Digital Trends Contributor

Fascinated by the effects of technology on human interaction, Lulu believes that if her parents can use your new app…

Mobile

PayPal vs. Venmo vs. Cash App vs. Apple Cash: which app should you use?

PayPal, Venmo, Cash App, and Apple Wallet apps on an iPhone.

We’re getting closer every day to an entirely cashless society. While some folks may still carry around a few bucks for emergencies, electronic payments are accepted nearly everywhere, and as mobile wallets expand, even traditional credit and debit cards are starting to fall by the wayside.

That means many of us are past the days of tossing a few bills onto the table to pay our share of a restaurant tab or slipping our pal a couple of bucks to help them out. Now, even those things are more easily doable from our smartphones than our physical wallets.

Computing

How to change margins in Google Docs

Laptop Working from Home

When you create a document in Google Docs, you may need to adjust the space between the edge of the page and the content --- the margins. For instance, many professors have requirements for the margin sizes you must use for college papers.

You can easily change the left, right, top, and bottom margins in Google Docs and have a few different ways to do it.

Computing

What is Microsoft Teams? How to use the collaboration app

A close-up of someone using Microsoft Teams on a laptop for a videoconference.

Online team collaboration is the new norm as companies spread their workforce across the globe. Gone are the days of primarily relying on group emails, as teams can now work together in real time using an instant chat-style interface, no matter where they are.

Using Microsoft Teams affords video conferencing, real-time discussions, document sharing and editing, and more for companies and corporations. It's one of many collaboration tools designed to bring company workers together in an online space. It’s not designed for communicating with family and friends, but for colleagues and clients.