Video Frame Interpolation Using Recurrent Convolutional Layers

Zhifeng Zhang, Li Song, Rong Xie, Li Chen

January 2018

Abstract

Frame interpolation attempts to generate intermediate frames given existing ones, which is challenging because of complex video scenes and motion. Standard methods ﬁrst estimate motion between two continuous frames and then synthesize new ones. In this paper, we propose a novel frame interpolation method based on a video synthesis approach deep voxel ﬂow (DVF). In DVF, a deep convolutional encoder-decoder predicts 3D voxel ﬂow, and then a volume sampling layer synthesizes the intermediate frame guided by the ﬂow. To improve the accuracy of voxel ﬂow, we employ recurrent convolutional layers (RCL) in the encoder-decoder module to reﬁne the ﬂow step by step, called DVF-RCL. We also incorporate perceptual loss to increase the visual quality. Experiments demonstrate that our method greatly improves the performance of original DVF and produce results that compare favorably to state-of-the-art methods both quantitatively and qualitatively.

Type

Conference paper

Publication

2018 IEEE Fourth International Conference on Multimedia Big Data (BigMM)

Li Song

Professor, IEEE Senior Member

Professor, Doctoral Supervisor, the Deputy Director of the Institute of Image Communication and Network Engineering of Shanghai Jiao Tong University, the Double-Appointed Professor of the Institute of Artificial Intelligence and the Collaborative Innovation Center of Future Media Network, the Deputy Secretary-General of the China Video User Experience Alliance and head of the standards group.