Skip to main content

AI-generated videos have arrived, and they’re evolving fast

In a few short months, the world of generative AI has evolved from chatbots like ChatGPT to photorealistic images created by Midjourney. In case you thought things were slowing down any time soon, AI-generated videos might be about to have their big moment in the spotlight.

As highlighted by a tweet from Philipp Tsipman, the founder of a marketing platform for such videos, there have been five new AI video generators launched in just the last seven days. The quality varies, but it’s clear that the technology is moving at a relentless pace. So, let’s break down the five AI video generators, which range from janky to seriously impressive.

Recommended Videos

If you’re having trouble keeping up with Video AI😅, there have been 5 state-of-the-art generative video models released *in last 7 days*: 🤯😎🧵 pic.twitter.com/cki9SMfozr

— Philipp Tsipman (@ptsi) March 26, 2023

Runway

The wait is over.

Gen-1 is now available at https://t.co/ekldoIshdw pic.twitter.com/Wm2YVOvm26

— Runway (@runwayml) March 27, 2023

Runway has been leading the pack in publicly available AI editing. Masking, motion-tracking, super slow-motion, and color grading all benefit from AI assistance. This is a subscription-based service, so it has extra incentive to push the limits.

In February, Runway started testing generative video with its Gen-1 update, which lets you restyle a video with words. A video of a walk down the street becomes an animated short that appears to be made with hand-sculpted and posed Claymation characters.

Runway released Gen-1 publicly today and is already testing Gen-2, eliminating the need for a reference video. Soon, Runway will let you create a video with a simple text prompt.

Generate videos with nothing but words. If you can say it, now you can see it.

Introducing, Text to Video. With Gen-2.

Learn more at https://t.co/PsJh664G0Q pic.twitter.com/6qEgcZ9QV4

— Runway (@runwayml) March 20, 2023

Picsart

Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators

abs: https://t.co/5xCsj4PNRj
github: https://t.co/BdSzlepGQG pic.twitter.com/XY4piH6j4v

— AK (@_akhaliq) March 24, 2023

You might recognize Picsart as a free photo editor, but it also offers video editing. Text2Video-Zero is a Picsart research tool that uses the Stable Diffusion AI image generator to create multiple frames with enough in common to look cohesive.

This is significant progress. as early experiments at assembling AI images as video frames produced chaotic scenes where every detail constantly shifted. The effect was eye-catching, but distracted from the content.

Video-P2P

Video-P2P is an open-source project similar to Runway Gen-1. It takes video input and uses Stable Diffusion to style it with text prompts. The examples shown look good, with steady backgrounds and consistency in the altered elements.

Video-P2P: Video Editing with Cross-attention Control@Gradio demo is out on @huggingface

demo: https://t.co/dHHoiLh18v
github: https://t.co/vbq1MqwjXO pic.twitter.com/EYBCr9TkbK

— AK (@_akhaliq) March 20, 2023

TemporalNet

TemporalNet is based on ControlNet, an advanced AI image generator that allows more control of the poses of figures that appear in pictures.

#TemporalNet has been published!
You can access the model to give it a try yourself here:https://t.co/0tdfmGxsVH
Curious to see how far people can push it ^^#ControlNet #AIart #stablediffusion pic.twitter.com/hd0e3MEFhG

— CiaraRowles (@CiaraRowles1) March 24, 2023

The videos have some flicker, but the results are promising. ControlNet took a big step toward producing predictable scenes, and TemporalNet might provide a nice alternative method of creating videos.

Text-to-video

Text-to-video is an AI model created by Alibaba. It’s clear that this model needs a bit more work, but having more options to explore is always a good thing.

fast & longer text-to-video with 🧨 diffusers

you maybe saw fun junky text-to-video from the ModelScope's research model lately

with diffusers you can control how long the video is – and fit it on smol VRAM GPUs, including free colab. Try out here:https://t.co/uPVd9hEOtr pic.twitter.com/mIFkGnGT5V

— apolinario 🌐 (@multimodalart) March 22, 2023

Fast-paced innovation

The rapid pace of AI innovation is phenomenal. Just months ago, we were teased with glimpses of the potential of similar videos created in AI labs. In 2022, one of the first public AI video generators simply superimposed tiny AI images over stock footage.

Those “old school” methods, from a few months ago, are laughable when you see how far AI has progressed in such a short amount of time.

Alan Truly
Alan Truly is a Writer at Digital Trends, covering computers, laptops, hardware, software, and accessories that stand out as…
Stable Diffusion aims to fix its problem with generating fingers
Stable Diffusion AI image generator.

Future iterations of AI-generated art are set to be more realistic thanks to an upcoming version of Stable Diffusion that specifically tackles the problem of depicting fingers and hands.

According to a recent Bloomberg report, the company Stability AI, which develops the Stable Diffusion AI image generator, has plans to release a new SDXL 0.9 model that will propel the abilities of Stable Diffusion.

Read more
5 things AI image generators still struggle with
Dall-E was an early AI leader but hands are not its thing.

AI image generators like Dall-E, Stable Diffusion, Midjourney, and Bing Image Creator produce amazing results, but sometimes they can be incredibly frustrating. With simple prompts containing just a few words, an AI can output impressive images that appear to be professional photographs and convincing art in various styles. However, the same prompt will occasionally create some horrific creature or hilariously flawed rendering.

Negative prompts might help reduce the likelihood of these errors, but complexity can't always save you. Even AI experts struggle with misshapen creatures and unworldly scenes, requiring long hours of refining prompts or touching-up images with a traditional photo editor. For the time being, if you look carefully in the right areas of an image, there's a good chance you'll be able to identify if it was made by a machine.
Hand salad and balls of fingers
AI developers have made progress in the struggle to teach artificial intelligence tools how human hands should look, but there's plenty of room for improvement. If fingers aren't featured prominently, it's easy to miss errors, but it's an ongoing problem.

Read more
The best AI image generators to create art from text
Théâtre D’opéra Spatial AI artwork developed by Jason Allen.

AI image generators are becoming a hot topic online, but they are far from new. The technology for these tools has been around for some time. It is just reaching a point where they are more accessible to the everyday user.

Some of these text-to-art generators are free, while some are behind paywalls, and others allow for a trial. There are also many styles of art you can create from different generators. Take a look at our roundup of some of the best AI image generators below to see which ones might match your artistic style.
What is an AI image generator?
An AI image generator is essentially a tool that uses machine learning to create art. In its simplest form, it will use text prompts to describe the type of art you want to create, and then it'll do its best job to make it for you. Some tools include additional styles and parameters to their generators to make the results more unique.

Read more