Learning based fast H.264 to H.265 transcoding

Abstract

The newly proposed video coding standard, High Efficiency Video Coding (HEVC), has been widely accepted and adopted by industry and academia due to its better coding efficiency compared with H.264/AVC. While HEVC achieves an increase of about 40% in coding efficiency, its computational complexity has been increased significantly. Given this, a high performance AVC to HEVC transcoder is needed urgently. This paper introduces a learning based fast transcoding algorithm which can speed up the process of CU decision. The stream is first decoded by JM and then important features are extracted. Those features are used as inputs for a machine learning model and the specific CU depth will be obtained. In x265, we skip depths that are not selected and early pruning is used to terminate splitting in advance. The experimental results show that our proposed transcoding algorithm can save up to 41% coding speed compared with original x265 while the BD-BitRate drop 0.078dB on average. The algorithm achieves a good tradeoff between the performance and transcoding speed.

Publication
2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)
Li Song
Li Song
Professor, IEEE Senior Member

Professor, Doctoral Supervisor, the Deputy Director of the Institute of Image Communication and Network Engineering of Shanghai Jiao Tong University, the Double-Appointed Professor of the Institute of Artificial Intelligence and the Collaborative Innovation Center of Future Media Network, the Deputy Secretary-General of the China Video User Experience Alliance and head of the standards group.