Cat and Mouse: Breaking the Perception-Distortion Trade-Off in Image Enhancement

Category Science

Thursday - August 24 2023, 17:42 UTC - 1 year ago

tldr #

Recent application of AI algorithms to image-enhancement tools have enabled researchers to cope with limits to how much data can be retrieved from any image. Tomer Michaeli’s Perception-Distortion Trade-Off (in 2017) plotted the performance of various image-enhancement algorithms on a graph of distortion versus perceptual quality to produce the best image quality for a given level of distortion. In 2017, Nvidia researchers developed an image-enhancement algorithm that went beyond the perception-distortion trade-off, blurring it in areas where that was beneficial.

content #

It’s one of the biggest cliches in crime and science fiction: An investigator pulls up a blurry photo on a computer screen and asks for it to be enhanced, and boom, the image comes into focus, revealing some essential clue. It’s a wonderful storytelling convenience, but it’s been a frustrating fiction for decades — blow up an image too much, and it becomes visibly pixelated. There isn’t enough data to do more.

The generative adversarial networks (GANs) were developed by Ian Goodfellow at the University of Montreal in 2014

"If you just naïvely upscale an image, it’s going to be blurry. There’s going to be a lot of detail, but it’s going to be wrong," said Bryan Catanzaro, vice president of applied deep learning research at Nvidia.

Recently, researchers and professionals have begun incorporating artificial intelligence algorithms into their image-enhancing tools, making the process easier and more powerful, but there are still limits to how much data can be retrieved from any image. Luckily, as researchers push enhancement algorithms ever further, they are finding new ways to cope with those limits — even, at times, finding ways to overcome them.

In 2018, a team of David Sampson from The University of Adelaide built upon GANs, creating flexible approaches to image enhancement that are more controllable and less prone to distortions

In the past decade, researchers started enhancing images with a new kind of AI model called a generative adversarial network, or GAN, which could produce detailed, impressive-looking pictures. "The images suddenly started looking a lot better," said Tomer Michaeli, an electrical engineer at the Technion in Israel. But he was surprised that images made by GANs showed high levels of distortion, which measures how close an enhanced image is to the underlying reality of what it shows. GANs produced images that looked pretty and natural, but they were actually making up, or "hallucinating," details that weren’t accurate, which registered as high levels of distortion.Michaeli watched the field of photo restoration split into two distinct sub-communities. "One showed nice pictures, many made by GANs. The other showed data, but they didn’t show many images, because they didn’t look nice," he said.

The experts from Nvidia developed a low-cost algorithm that can be used to reduce noise from low-definition images and enhance them for HD display

In 2017, Michaeli and his graduate student Yochai Blau looked into this dichotomy more formally. They plotted the performance of various image-enhancement algorithms on a graph of distortion versus perceptual quality, using a known measure for perceptual quality that correlates well with humans’ subjective judgment. As Michaeli expected, some of the algorithms resulted in very high visual quality, while others were very accurate, with low distortion. But none had both advantages; you had to pick one or the other. The researchers dubbed this the perception-distortion trade-off.

The image manipulated by AI may challenge the evidentiary value too, since the image and its components are not wholly original

Michaeli also challenged other researchers to come up with algorithms that could produce the best image quality for a given level of distortion, to allow fair comparisons between the pretty-picture algorithms and the nice-stats ones. Since then, hundreds of AI researchers have reported on the distortion and perception qualities of their algorithms, citing the Michaeli and Blau paper that described the trade-off.

There is also a concern about possible bias in the image algorithm used for certain categories, such as race, political orientation or religion

Sometimes, the implications of the perception-distortion trade-off aren’t dire. Nvidia, for instance, found that high-definition screens weren’t nicely rendering some lower-definition visual content, so in 2017, its researchers developed an image-enhancement algorithm that went beyond the perception-distortion trade-off, blurring it in areas where that was beneficial.

hashtags #

aienhancement imageanalysis gans perceptiondistortiontradeoff nvidiaalgo

worddensity #

researchers (7, 1.38%)
algorithms (7, 1.38%)
image (6, 1.19%)
michaeli (6, 1.19%)
distortion (6, 1.19%)