Impact of visual enhancement and color conversion algorithms on remote sound recovery from silent videos
The visual microphone is a technique for remote sound recovery that extracts sound information from tiny pixel-scale vibrations in a video. Despite having demonstrated success in sound recovery, the impact of various visual enhancement and color conversion algorithms applied on the video before the...
Saved in:
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Published: |
Wiley
2024
|
Subjects: | |
Online Access: | http://eprints.um.edu.my/45448/ https://doi.org/10.1002/jsid.1275 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | The visual microphone is a technique for remote sound recovery that extracts sound information from tiny pixel-scale vibrations in a video. Despite having demonstrated success in sound recovery, the impact of various visual enhancement and color conversion algorithms applied on the video before the sound recovery process has not been explored. Thus, it is important to investigate these effects have on the recovered sound quality, as the vibrations are so small the effects play an important role. This work experimented with different color to grayscale conversions and visual enhancement algorithms on 576 videos, and found that the recovered sound quality is indeed greatly affected by the choice of algorithms. The best conversion algorithms were found to be the average of the red, green and blue color channels and the perceptual lightness in the CIELAB color space, improving the recovered sound quality by up to 23.22%. Furthermore, visual enhancement techniques such as gamma correction have been found to corrupt vibration information, leading to a 22.47% drop in recovered sound quality in one of the tested videos. Therefore, it is advisable to avoid or minimize the use of visual enhancement techniques for remote sound recovery to prevent the elimination of useful subtle vibrations. Different color to grayscale conversion and visual enhancement algorithms were applied to high-speed videos before performing sound recovery using the visual microphone. It was found that color to grayscale conversion algorithms prioritizing the green color channel led to a better sound recovery, while visual enhancements degraded the quality of the recovered sound. image |
---|