3D Convolutional Neural Networks for Audio-Visual Recognition

The essential problem is to find the correspondence between the audio and visual streams, which is the goal of this work. We proposed the utilization of a coupled 3D Convolutional Neural Network (CNN) architecture that …

3D Convolutional Neural Networks for Audio-Visual Recognition Read More »