Georgia Tech Solves 'Texture Fill' Problem with Machine Learning

*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************

Atlanta, GA | Posted: July 10, 2018

Contact

Albert Snedeker, Communications Manager

albert.snedeker@cc.gatech.edu

Sidebar Content

No sidebar content submitted.

Summaries

Summary Sentence:

A new technique allows users to spread textures across sketches of objects to create high resolution images.

Full Summary:

No summary paragraph submitted.

Media

Georgia Tech Using Machine Learning to Solve Texture Fill Problem
(image/jpeg)

A new machine learning technique developed at Georgia Tech may soon give budding fashionistas and other designers the freedom to create realistic, high-resolution visual content without relying on complicated 3-D rendering programs.

TextureGAN is the first deep image synthesis method that can realistically spread multiple textures across an object. With this new approach, users drag one or more texture patches onto a sketch — say of a handbag or a skirt — and the network texturizes the sketch to accurately account for 3-D surfaces and lighting.

[VIDEO: See TextureGAN in action]

Prior to this work, producing realistic images of this kind could be tedious and time-consuming, particularly for those with limited experience. And, according to the researchers, existing machine learning-based methods are not particularly good at generating high-resolution texture details.

Using a neural network to improve results

“The ‘texture fill’ operation is difficult for a deep network to learn because it not only has to propagate the color, but also has to learn how to synthesize the structure of texture across 3-D shapes,” said Wenqi Xian, computer science (CS) major and co-lead developer.

[VIDEO: Wenqi Xian presents TextureGAN at CVPR 2018]

The researchers initially trained a type of neural network called a conditional generative adversarial network (GAN) on sketches and textures extracted from thousands of ground-truth photographs. In this approach, a generator neural network creates images that a discriminator neural network then evaluates for accuracy. The goal is for both to get increasingly better at their respective tasks, which leads to more realistic outputs.

To ensure that the results look as realistic as possible, researchers fine-tuned the new system to minimize pixel-to-pixel style differences between generated images and training data. But the results were not quite what the team had expected.

Producing more realistic images

“We realized that we needed a stronger constraint to preserve high-level texture in our outputs,” said Georgia Tech CS Ph.D. student Patsorn Sangkloy. “That’s when we developed an additional discriminator network that we trained on a separate texture dataset. Its only job is to be presented with two samples and ask ‘are these the same or not?’”

With its sole focus on a single question, this type of discriminator is much harder to fool. This, in turn, leads the generator to produce images that are not only realistic, but also true to the texture patch the user placed onto the sketch.

The work was presented in June at the conference on Computer Vision and Pattern Recognition (CVPR) 2018 held in Salt Lake City and is funded through National Science Foundation award 1561968. School of Interactive Computing Associate Professor James Hays advises Xian and Sangkloy. Georgia Tech is collaborating on this research with Adobe Research, University of California at Berkeley, and Argo AI.

Additional Information

Groups

College of Computing, ML@GT, School of Interactive Computing, GVU Center

Categories

No categories were selected.

Related Core Research Areas

People and Technology

Newsroom Topics

Science and Technology

Keywords

machine learning, texture GAN, neural network, deep learning

Status

Created By: Ben Snedeker
Workflow Status: Published
Created On: Jul 10, 2018 - 1:52pm
Last Updated: Jul 12, 2018 - 10:29am