June 26, 2016
End-to-End Voxel-to-Voxel Prediction
Conference on Computer Vision and Pattern Recognition (CVPR)
Over the last few years deep learning methods have emerged as one of the most prominent approaches for video analysis with most successful applications having been in the area of video classification and detection. In this paper we challenge these views by presenting a deep 3D convolutional architecture trained end to end to perform voxel-level prediction, i.e., to output a variable at every voxel of the video.
By: Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, Manohar Paluri