Skip to main content

Click and drag AI image editing could change everything

The latest development in artificial intelligence is a tool that allows you to edit an already-generated image to your specifications.

Say you wanted to “change the dimensions of a car or manipulate a smile into a frown with a simple click and drag,” you could do so with this model called DragGAN.

Recommended Videos

Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold

paper page: https://t.co/Gjcm1smqfl pic.twitter.com/XHQIiMdYOA

— AK (@_akhaliq) May 19, 2023

Please enable Javascript to view this content

The Generative Adversarial Network (GAN) is currently in the form of a research paper, however, it has garnered such attention from those interested in viewing its demos that the research team’s homepage has experienced crashing due to the heavy traffic.

The Verge compared DragGAN to the Warp tool in Photoshop, adding that it is much more powerful since it doesn’t “smush pixels around,” but rather “re-generates the underlying object,” and can even rotate 3D images.

The potential of such a tool lies in the fact that text-to-image generative AI doesn’t always output what you might want. So you can go back in afterward and make edits to an existing image, instead of automatically having to generate a new image.

Some demos that are a part of the research paper include adding height to a mountain, changing the positioning of a model and editing the length and shape of her clothes, opening or closing a lion’s mouth, and changing a person’s face from a plain look to a smile. With many AI tools currently available, users have to regenerate an image with a more specific prompt to get a more desirable result.

The research team noted in its paper that new details can be added within the regeneration of the edited aspects of images that are beneficial to the update. “Our approach can hallucinate occluded content, like the teeth inside a lion’s mouth, and can deform following the object’s rigidity, like the bending of a horse leg.”

There are many brands that are attempting to offer editing options for generative AI content. However, most do not go as far as allowing for the actual editing of images, but rather for aspects such as editing around images. For example, Microsoft’s Designer app allows you to generate AI images from a text prompt, and you can select your favorite from three results, then take it to the design studio where you can create a host of creativity and productivity-based projects, such as social media posts, invitations, digital postcards, or graphics with the image as the focal point. However, you cannot edit the AI-generated image.

With the DragGAN tool still being a demo for now, there is no telling what the quality of a readily available technology would be, or if it would even be possible, especially since the demos are based on low-resolution videos. However, it is an interesting example of how quickly AI continues to develop.

Fionna Agomuoh
Fionna Agomuoh is a Computing Writer at Digital Trends. She covers a range of topics in the computing space, including…
AI-controlled robots can be jailbroken, and the results could be disastrous
The Figure 02 robot looking at its own hand

Researchers at Penn Engineering have reportedly uncovered previously unidentified security vulnerabilities in a number of AI-governed robotic platforms.

"Our work shows that, at this moment, large language models are just not safe enough when integrated with the physical world," George Pappas, UPS Foundation Professor of Transportation in Electrical and Systems Engineering, said in a statement.

Read more
Google will begin labeling AI-generated images in Search
Google Search on mobile

AI-generated images have become increasingly predominant in the results of Google searches in recent months, crowding out legitimate results and making it harder for users to find what they're actually looking for. In response, Google announced on Tuesday that it will begin labeling AI-generated and AI-edited image search results in the coming months.

The company will flag such content through the “About this image” window and it will be applied to Search, Google Lens, and Android's Circle to Search features. Google is also applying the technology to its ad services and is considering adding a similar flag to YouTube videos, but will "have more updates on that later in the year," per the announcement post.

Read more
OpenAI Project Strawberry: Here’s everything we know so far
a strawberry

Even as it is reportedly set to spend $7 billion on training and inference costs (with an overall $5 billion shortfall), OpenAI is steadfastly seeking to build the world's first Artificial General Intelligence (AGI).

Project Strawberry is the company's next step toward that goal, and as of mid September, it's officially been announced.
What is Project Strawberry?
Project Strawberry is OpenAI's latest (and potentially greatest) large language model, one that is expected to broadly surpass the capabilities of current state-of-the-art systems with its "human-like reasoning skills" when it rolls out. It just might power the next generation of ChatGPT.
What can Strawberry do?
Project Strawberry will reportedly be a reasoning powerhouse. Using a combination of reinforcement learning and “chain of thought” reasoning, the new model will reportedly be able to solve math problems it has never seen before and act as a high-level agent, creating marketing strategies and autonomously solving complex word puzzles like the NYT's Connections. It can even "navigate the internet autonomously" to perform "deep research," according to internal documents viewed by Reuters in July.

Read more