VEViD: A vision enhancement algorithm based on physics

VEViD: A vision enhancement algorithm based on physics
Physical interpretation of the VEViD algorithm showing its impact in spatial domain (top row) and in spectral domain (bottom row). In spatial domain, the real part of the image is nearly unchanged whereas an imaginary part is created after diffraction. This observation supports the mathematical approximation in the latter part of the paper. Credit: eLight (2022). DOI: 10.1186/s43593-022-00034-y

In a new paper published in eLight, a team of scientists led by Professor Bahram Jalali and graduate student Callen MacPhee from UCLA have developed a new algorithm for performing computational imaging tasks. The paper "VEViD: Vision Enhancement via Virtual diffraction and coherent Detection" uses a physics-based algorithm to correct for poor illumination and low contrast in images captured in low-light conditions.

In such conditions, often incur undesirable visual qualities such as low contrast, feature loss, and poor signal-to-noise ratio. Low-light image enhancement aims to improve these qualities for two purposes: increased for and increased accuracy of computer vision algorithms. In the former, real-time processing can serve as a boon for convenient viewing. In the latter, it is a requirement for emerging applications such as autonomous vehicles and security where must be completed with low latency.

The paper shows that physical diffraction and coherent detection can be used as a toolbox for the transformation of digital images and videos. This approach leads to a new and surprisingly powerful algorithm for low-light and color enhancement.

Unlike traditional algorithms that are mostly hand-crafted empirical rules, the VEViD algorithm emulates physical processes. In contrast to deep learning-based approaches, this technique is unique in having its roots in deterministic physics. The algorithm is interpretable and does not require labeled data for training. The authors explain that although the mapping to is not precise, it may be possible in the future to implement a physical device that executes the algorithm in the analog domain.

The paper demonstrates the high performance of VEViD in several imaging applications such as security cameras, night-time driving, and space exploration. Also demonstrated is VEViD's ability to perform color enhancement.

The algorithm's exceptional computational speed is proven by processing 4k video at over 200 frames per second. Comparison with leading deep learning algorithms shows comparable or better image quality but with one to two orders of magnitude faster processing speed.

Deep neural networks have proven powerful tools for object detection and tracking, and they are the key to several emerging technologies that leverage autonomous machines. The authors show the utility of VEViD as pre-processing tool that increases the accuracy of object detection by a popular neural network (YOLO).

Processing an image first by VEViD allows neural networks trained on daylight images to recognize objects in night-time environments without retraining, making these networks more robust while saving vast amounts time and energy.

More information: Bahram Jalali et al, VEViD: Vision Enhancement via Virtual diffraction and coherent Detection, eLight (2022). DOI: 10.1186/s43593-022-00034-y

Citation: VEViD: A vision enhancement algorithm based on physics (2022, November 8) retrieved 10 December 2023 from
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Machine learning generates pictures of proteins in 5D


Feedback to editors