SJTU Media Lab
SJTU Media Lab
News
People
Publications
Teaching
Resources
Jobs
Light
Dark
Automatic
Publications
Type
1
2
Journal article
Book section
Conference paper
Patent
Standard
Thesis
Date
2024
2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2000
1999
Hengsheng Zhang
,
Xueyi Zou
,
Guo Lu
,
Li Chen
,
Li Song
,
Wenjun Zhang
(2024).
EffiHDR: An Efficient Framework for HDRTV Reconstruction and Enhancement in UHD Systems
.
IEEE Transactions on Broadcasting
.
Cite
DOI
Shuai Guo
,
Jingchuan Hu
,
Kai Zhou
,
Jionghao Wang
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2024).
Real-Time Free Viewpoint Video Synthesis System Based on DIBR and a Depth Estimation Network
.
IEEE Transactions on Multimedia
.
Cite
DOI
Bingcong Lu
,
Keyu Wang
,
Jun Xu
,
Rong Xie
,
Li Song
,
Wenjun Zhang
(2024).
Pioneer: Offline Reinforcement Learning Based Bandwidth Estimation for Real-Time Communication
.
Proceedings of the 15th ACM Multimedia Systems Conference, Mmsys 2024, Bari, Italy, April 15-18, 2024
.
Cite
DOI
Haitao Huang
,
Rongli Jia
,
Rong Xie
,
Li Song
,
Lin Li
,
Yanan Feng
(2024).
No-Reference Quality Assessment of Text-to-Image Generation
.
IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, BMSB 2024, Toronto, on, Canada, June 19-21, 2024
.
Cite
DOI
Yiwei Zhang
,
Guo Lu
,
Yunuo Chen
,
Shen Wang
,
Yibo Shi
,
Jing Wang
,
Li Song
(2024).
Neural Rate Control for Learned Video Compression
.
The Twelfth International Conference on Learning Representations, ICLR 2024, Vienna, Austria, May 7-11, 2024
.
Cite
Han Wang
,
Xinning Chai
,
Yiwen Wang
,
Yuhong Zhang
,
Rong Xie
,
Li Song
(2024).
Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior
.
CoRR
.
Cite
DOI
Yuhong Zhang
,
Hengsheng Zhang
,
Xinning Chai
,
Rong Xie
,
Li Song
,
Wenjun Zhang
(2024).
MRIR: Integrating Multimodal Insights for Diffusion-Based Realistic Image Restoration
.
CoRR
.
Cite
DOI
Zihan Zheng
,
Houqiang Zhong
,
Qiang Hu
,
Xiaoyun Zhang
,
Li Song
,
Ya Zhang
,
Yanfeng Wang
(2024).
JointRF: End-to-end Joint Optimization for Dynamic Neural Radiance Field Representation and Compression
.
CoRR
.
Cite
DOI
Zihan Zheng
,
Houqiang Zhong
,
Qiang Hu
,
Xiaoyun Zhang
,
Li Song
,
Ya Zhang
,
Yanfeng Wang
(2024).
HPC: Hierarchical Progressive Coding Framework for Volumetric Video
.
CoRR
.
Cite
DOI
Hengsheng Zhang
,
Xinning Chai
,
Yuhong Zhang
,
Rong Xie
,
Li Song
(2024).
Hdrtvformer: Efficient Sdrtv-to-Hdrtv via Affine Transformation and Spatial-Aware Transformer
.
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2024, Seoul, Republic of Korea, April 14-19, 2024
.
Cite
DOI
Zhiyu Zhang
,
Guo Lu
,
Huanxiong Liang
,
Anni Tang
,
Qiang Hu
,
Li Song
(2024).
Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization
.
CoRR
.
Cite
DOI
Yuhong Zhang
,
Hengsheng Zhang
,
Xinning Chai
,
Zhengxue Cheng
,
Rong Xie
,
Li Song
,
Wenjun Zhang
(2024).
Diff-Restorer: Unleashing Visual Prompts for Diffusion-Based Universal Image Restoration
.
CoRR
.
Cite
DOI
Shuai Guo
,
Qiuwen Wang
,
Yijie Gao
,
Rong Xie
,
Li Song
(2024).
Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views
.
Thirty-Eighth AAAI Conference on Artificial Intelligence, AAAI 2024, Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence, IAAI 2024, Fourteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2014, February 20-27, 2024, Vancouver, Canada
.
Cite
DOI
Junyi Lu
,
Bingcong Lu
,
Jun Xu
,
Li Song
,
Wenjun Zhang
(2024).
A Priority Aware Free Viewpoint Video Transmit Scheme Based on QUIC
.
IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, BMSB 2024, Toronto, on, Canada, June 19-21, 2024
.
Cite
DOI
Jun Ling
,
Xu Tan
,
Liyang Chen
,
Runnan Li
,
Yuchao Zhang
,
Sheng Zhao
,
Li Song
(2023).
StableFace: Analyzing and Improving Motion Stability for Talking Face Generation
.
IEEE J. Sel. Top. Signal Process.
.
Cite
DOI
Wenpei Yin
,
Bingcong Lu
,
Yan Zhao
,
Jun Xu
,
Li Song
,
Wenjun Zhang
(2023).
SAFR: A Real-Time Communication System with Adaptive Frame Rate
.
Proceedings of the 1st International Workshop on Networked AI Systems, NetAISys 2023, Helsinki, Finland, 18 June 2023
.
Cite
DOI
Rongli Jia
,
Yuhong Zhang
,
Jun Xu
,
Wenjun Zhang
,
Li Song
,
Lin Li
,
Yanan Feng
(2023).
Quality of Experience Assessment for Free-Viewpoint Video
.
IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, BMSB 2023, Beijing, China, June 14-16, 2023
.
Cite
DOI
Yan Huang
,
Jun Xu
,
Chen Zhu
,
Li Song
,
Wenjun Zhang
(2023).
Precise Encoding Complexity Control for Versatile Video Coding
.
IEEE Transactions on Broadcasting
.
Cite
DOI
Haoqiang Ren
,
Luheng Jia
,
Yifan Zang
,
Li Song
,
Jun Ma
,
Hua Chen
(2023).
Perceptual Video Coding Based on Spatial Masking for Medical Video Communication
.
Proceedings of the 2023 12th International Conference on Computing and Pattern Recognition, ICCPR 2023, Qingdao, China, October 27-29, 2023
.
Cite
DOI
Feng Peng
,
Bingcong Lu
,
Li Song
,
Rong Xie
,
Yanmei Liu
,
Ying Chen
(2023).
PACC: Perception Aware Congestion Control for Real-Time Communication
.
IEEE International Conference on Multimedia and Expo, ICME 2023, Brisbane, Australia, July 10-14, 2023
.
Cite
DOI
Xibei Liu
,
Xinning Chai
,
Hengsheng Zhang
,
Rong Xie
,
Xiao Gu
,
Li Song
,
Liean Cao
(2023).
Old-Photo Restoration with Detail- and Structure-Enhanced Cascaded Learning
.
IEEE International Conference on Multimedia and Expo Workshops, ICMEW Workshops 2023, Brisbane, Australia, July 10-14, 2023
.
Cite
DOI
Qiuwen Wang
,
Shuai Guo
,
Haoning Wu
,
Rong Xie
,
Li Song
,
Wenjun Zhang
(2023).
NeRF-SDP: Efficient Generalizable Neural Radiance Field with Scene Depth Perception
.
ACM Multimedia Asia 2023, Mmasia 2023, Tainan, Taiwan, December 6-8, 2023
.
Cite
DOI
Hengsheng Zhang
,
Li Song
,
Wenyao Gan
,
Rong Xie
(2023).
Multi-Scale-Based Joint Super-Resolution and Inverse Tone-Mapping with Data Synthesis for UHD HDR Video
.
Displays
.
Cite
DOI
Chen Li
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2023).
Local Bidirection Recurrent Network for Efficient Video Deblurring with the Fused Temporal Merge Module
.
ACM Trans. Multim. Comput. Commun. Appl.
.
Cite
DOI
Yanjun Wang
,
Qingping Sun
,
Wenjia Wang
,
Jun Ling
,
Zhongang Cai
,
Rong Xie
,
Li Song
(2023).
Learning Dense UV Completion for Human Mesh Recovery
.
CoRR
.
Cite
DOI
Wenhong Duan
,
Zheng Chang
,
Chuanmin Jia
,
Shanshe Wang
,
Siwei Ma
,
Li Song
,
Wen Gao
(2023).
Learned Image Compression Using Cross-Component Attention Mechanism
.
IEEE Transactions on Image Processing
.
Cite
DOI
Zhiyu Zhang
,
Anni Tang
,
Chen Zhu
,
Guo Lu
,
Rong Xie
,
Li Song
(2023).
High-Fidelity Free-View Talking Head Synthesis for Low-Bandwidth Video Conference
.
IEEE International Conference on Visual Communications and Image Processing, VCIP 2023, Jeju, Republic of Korea, December 4-7, 2023
.
Cite
DOI
Han Xue
,
Jun Ling
,
Anni Tang
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2023).
High-Fidelity Face Reenactment via Identity-Matched Correspondence Learning
.
ACM Trans. Multim. Comput. Commun. Appl.
.
Cite
DOI
Han Xue
,
Zhiwu Huang
,
Qianru Sun
,
Li Song
,
Wenjun Zhang
(2023).
Freestyle Layout-to-Image Synthesis
.
IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023
.
Cite
DOI
Jionghao Wang
,
Shuai Guo
,
Qiuwen Wang
,
Rong Xie
,
Li Song
(2023).
Efficient Human Rendering with Geometric and Semantic Priors
.
IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, BMSB 2023, Beijing, China, June 14-16, 2023
.
Cite
DOI
Yuhong Zhang
,
Hengsheng Zhang
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2023).
Dual-Head Fusion Network for Image Enhancement
.
IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP 2023, Rhodes Island, Greece, June 4-10, 2023
.
Cite
DOI
Yunqian Wen
,
Bo Liu
,
Jingyi Cao
,
Rong Xie
,
Li Song
(2023).
Divide and Conquer: A Two-Step Method for High Quality Face de-Identification with Model Explainability
.
IEEE/CVF International Conference on Computer Vision, ICCV 2023, Paris, France, October 1-6, 2023
.
Cite
DOI
Chen Li
,
Li Song
,
Shuai Chen
,
Rong Xie
,
Wenjun Zhang
(2023).
Deep Online Video Stabilization Using IMU Sensors
.
IEEE Trans. Multim.
.
Cite
DOI
Yiwei Zhang
,
Guo Lu
,
Donghui Feng
,
Chen Zhu
,
Li Song
(2023).
Content Adaptive Checkerboard Context Model for Learned Image Compression
.
IEEE International Symposium on Circuits and Systems, ISCAS 2023, Monterey, CA, USA, May 21-25, 2023
.
Cite
DOI
Yurong Zhang
,
Liulei Li
,
Wenguan Wang
,
Rong Xie
,
Li Song
,
Wenjun Zhang
(2023).
Boosting Video Object Segmentation via Space-Time Correspondence Learning
.
IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023
.
Cite
DOI
Jingyi Cao
,
Bo Liu
,
Yunqian Wen
,
Rong Xie
,
Li Song
(2023).
Achieving Privacy-Preserving Multi-View Consistency with Advanced 3D-aware Face de-Identification
.
ACM Multimedia Asia 2023, Mmasia 2023, Tainan, Taiwan, December 6-8, 2023
.
Cite
DOI
Jionghao Wang
,
Ziyu Chen
,
Jun Ling
,
Rong Xie
,
Li Song
(2023).
360-Degree Panorama Generation from Few Unregistered NFoV Images
.
Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, on, Canada, 29 October 2023- 3 November 2023
.
Cite
DOI
Chen Zhu
,
Jun Xu
,
Donghui Feng
,
Rong Xie
,
Li Song
(2022).
Edge-Based Video Compression Texture Synthesis Using Generative Adversarial Network
.
IEEE Transactions on Circuits and Systems for Video Technology
.
Cite
DOI
Zhiyong Chen
,
Haoyu Zhu
,
Li Song
,
Dazhi He
,
Bin Xia
(2022).
Wireless Multiplayer Interactive Virtual Reality Game Systems with Edge Computing: Modeling and Optimization
.
IEEE Trans. Wirel. Commun.
.
Cite
DOI
Yu Dong
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2022).
Ultra-Low Latency, Stable, and Scalable Video Transmission for Free-Viewpoint Video Services
.
IEEE Transactions on Broadcasting
.
Cite
DOI
Jun Ling
,
Xu Tan
,
Liyang Chen
,
Runnan Li
,
Yuchao Zhang
,
Sheng Zhao
,
Li Song
(2022).
StableFace: Analyzing and Improving Motion Stability for Talking Face Generation
.
CoRR
.
Cite
DOI
Kai Zhou
,
Shuai Guo
,
Jingchuan Hu
,
Jionghao Wang
,
Qiuwen Wang
,
Li Song
(2022).
RGBD-based Real-Time Volumetric Reconstruction System: Architecture Design and Implementation
.
IEEE International Conference on Visual Communications and Image Processing, VCIP 2022, Suzhou, China, December 13 - 16, 2022
.
Cite
DOI
Haoyong Li
,
Bingcong Lu
,
Jun Xu
,
Li Song
,
Wenjun Zhang
,
Lin Li
,
Yaoyao Yin
(2022).
Reinforcement Learning Based Cross-Layer Congestion Control for Real-Time Communication
.
IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, BMSB 2022, Bilbao, Spain, June 15-17, 2022
.
Cite
DOI
Han Wang
,
Jun Tang
,
Xiaodong Liu
,
Shanyan Guan
,
Rong Xie
,
Li Song
(2022).
PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection
.
CoRR
.
Cite
DOI
Han Wang
,
Jun Tang
,
Xiaodong Liu
,
Shanyan Guan
,
Rong Xie
,
Li Song
(2022).
PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer towards Video Object Detection
.
Computer Vision - ECCV 2022 - 17th European Conference, Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part VIII
.
Cite
DOI
Donghui Feng
,
Chen Zhu
,
Guo Lu
,
Li Song
(2022).
Position-Based Motion Vector Prediction for Textual Image Coding
.
Picture Coding Symposium, PCS 2022, San Jose, CA, USA, December 7-9, 2022
.
Cite
DOI
Chen Zhu
,
Guo Lu
,
Rong Xie
,
Li Song
(2022).
Perceptual Video Coding Based on Semantic-Guided Texture Detection and Synthesis
.
Picture Coding Symposium, PCS 2022, San Jose, CA, USA, December 7-9, 2022
.
Cite
DOI
Shuai Guo
,
Li Song
,
Rong Xie
,
Lin Li
,
Shenglan Liu
(2022).
Multiview Nonlinear Discriminant Structure Learning for Emotion Recognition
.
Knowl. Based Syst.
.
Cite
DOI
Chen Li
,
Li Song
,
Xueyi Zou
,
Jiaming Guo
,
Youliang Yan
,
Wenjun Zhang
(2022).
Multi-Scale Coarse-to-Fine Transformer for Frame Interpolation
.
MM ‘22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10 - 14, 2022
.
Cite
DOI
Xinning Chai
,
Xibei Liu
,
Hengsheng Zhang
,
Han Wang
,
Li Song
,
Liean Cao
(2022).
MLS-GAN: Multi-Level Semantic Guided Image Colorization
.
2022 IEEE International Conference on Image Processing, ICIP 2022, Bordeaux, France, 16-19 October 2022
.
Cite
DOI
Anni Tang
,
Tianyu He
,
Xu Tan
,
Jun Ling
,
Runnan Li
,
Sheng Zhao
,
Li Song
,
Jiang Bian
(2022).
Memories Are One-to-Many Mapping Alleviators in Talking Face Generation
.
CoRR
.
Cite
DOI
Shen Wang
,
Yibing Fu
,
Chen Zhu
,
Li Song
,
Wenjun Zhang
(2022).
Low-Complexity Multi-Model CNN in-Loop Filter for AVS3
.
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022
.
Cite
DOI
Shen Wang
,
Yibing Fu
,
Chen Zhu
,
Li Song
,
Wenjun Zhang
(2022).
Low-Complexity Multi-Model CNN in-Loop Filter for AVS3
.
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022
.
Cite
DOI
Chen Li
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2022).
L0 Structure-Prior Assisted Blur-Intensity Aware Efficient Video Deblurring
.
Neurocomputing
.
Cite
DOI
Yan Huang
,
Jizheng Xu
,
Li Zhang
,
Yan Zhao
,
Li Song
(2022).
Intra Encoding Complexity Control with a Time-Cost Model for Versatile Video Coding
.
IEEE International Symposium on Circuits and Systems, ISCAS 2022, Austin, TX, USA, May 27 - June 1, 2022
.
Cite
DOI
Yunqian Wen
,
Bo Liu
,
Jingyi Cao
,
Rong Xie
,
Li Song
,
Zhu Li
(2022).
IdentityMask: Deep Motion Flow Guided Reversible Face Video de-Identification
.
IEEE Transactions on Circuits and Systems for Video Technology
.
Cite
DOI
Yunqian Wen
,
Bo Liu
,
Ming Ding
,
Rong Xie
,
Li Song
(2022).
IdentityDP: Differential Private Identification Protection for Face Images
.
Neurocomputing
.
Cite
DOI
Yunqian Wen
,
Bo Liu
,
Ming Ding
,
Rong Xie
,
Li Song
(2022).
IdentityDP: Differential Private Identification Protection for Face Images
.
Neurocomputing
.
Cite
DOI
Jingyi Cao
,
Bo Liu
,
Yunqian Wen
,
Yunhui Zhu
,
Rong Xie
,
Li Song
,
Lin Li
,
Yaoyao Yin
(2022).
Hiding among Your Neighbors: Face Image Privacy Protection with Differential Private k-Anonymity
.
IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, BMSB 2022, Bilbao, Spain, June 15-17, 2022
.
Cite
DOI
Anni Tang
,
Yan Huang
,
Jun Ling
,
Zhiyu Zhang
,
Yiwei Zhang
,
Rong Xie
,
Li Song
(2022).
Generative Compression for Face Video: A Hybrid Scheme
.
IEEE International Conference on Multimedia and Expo, ICME 2022, Taipei, Taiwan, July 18-22, 2022
.
Cite
DOI
Anni Tang
,
Yan Huang
,
Jun Ling
,
Zhiyu Zhang
,
Yiwei Zhang
,
Rong Xie
,
Li Song
(2022).
Generative Compression for Face Video: A Hybrid Scheme
.
IEEE International Conference on Multimedia and Expo, ICME 2022, Taipei, Taiwan, July 18-22, 2022
.
Cite
DOI
Hongcheng Zhong
,
Jun Xu
,
Chen Zhu
,
Donghui Feng
,
Li Song
(2022).
Complexity-Oriented Per-Shot Video Coding Optimization
.
IEEE International Conference on Multimedia and Expo, ICME 2022, Taipei, Taiwan, July 18-22, 2022
.
Cite
DOI
Hongcheng Zhong
,
Jun Xu
,
Chen Zhu
,
Donghui Feng
,
Li Song
(2022).
Complexity-Oriented per-Shot Video Coding Optimization
.
IEEE International Conference on Multimedia and Expo, ICME 2022, Taipei, Taiwan, July 18-22, 2022
.
Cite
DOI
Jun Xu
,
Guoqing Wu
,
Chen Zhu
,
Yan Huang
,
Li Song
(2022).
CNN-based Fast CU Partitioning Algorithm for VVC Intra Coding
.
2022 IEEE International Conference on Image Processing, ICIP 2022, Bordeaux, France, 16-19 October 2022
.
Cite
DOI
Yibing Fu
,
Shen Wang
,
Chen Zhu
,
Li Song
,
Wenjun Zhang
(2022).
An Attention Based CNN with Temporal Hierarchical Deployment for AVS3 Inter In-Loop Filtering
.
IEEE International Symposium on Circuits and Systems, ISCAS 2022, Austin, TX, USA, May 27 - June 1, 2022
.
Cite
DOI
Shuai Guo
,
Kai Zhou
,
Jingchuan Hu
,
Jionghao Wang
,
Jun Xu
,
Li Song
(2022).
A New Free Viewpoint Video Dataset and DIBR Benchmark
.
MMSys ‘22: 13th ACM Multimedia Systems Conference, Athlone, Ireland, June 14 - 17, 2022
.
Cite
DOI
Jingchuan Hu
,
Shuai Guo
,
Yu Dong
,
Kai Zhou
,
Jun Xu
,
Li Song
(2022).
A Multi-User Oriented Live Free-Viewpoint Video Streaming System Based on View Interpolation
.
IEEE International Conference on Multimedia and Expo, ICME 2022, Taipei, Taiwan, July 18-22, 2022
.
Cite
DOI
Han Wang
,
Xiaojun Zhou
,
Qinyu Xu
,
Huaqiang Ren
,
Rong Xie
,
Li Song
(2022).
A Large-Scale Sports Tracking Dataset and Progressive Re-Detection Based Sports Tracking
.
IEEE International Conference on Visual Communications and Image Processing, VCIP 2022, Suzhou, China, December 13 - 16, 2022
.
Cite
DOI
Hengsheng Zhang
,
Xueyi Zou
,
Jiaming Guo
,
Youliang Yan
,
Rong Xie
,
Li Song
(2022).
A Codec Information Assisted Framework for Efficient Compressed Video Super-Resolution
.
Computer Vision - ECCV 2022 - 17th European Conference, Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part XVII
.
Cite
DOI
Yanying Sun
,
Joseph Ma
,
Qi Shen
,
Li Song
(2022).
A Cloud-Based Free View Solution
.
24th IEEE International Workshop on Multimedia Signal Processing, MMSP 2022, Shanghai, China, September 26-28, 2022
.
Cite
DOI
Anni Tang
,
Han Xue
,
Jun Ling
,
Rong Xie
,
Li Song
(2021).
Dense 3D Coordinate Code Prior Guidance for High-Fidelity Face Swapping and Face Reenactment
.
IEEE International Conference on Automatic Face and Gesture Recognition 2021 (FG2021)
.
PDF
Cite
Yan Huang
,
Li Song
,
Rong Xie
,
Ebroul Izquierdo
,
Wenjun Zhang
(2021).
Modeling Acceleration Properties for Flexible INTRA HEVC Complexity Control
.
IEEE Transactions on Circuits and Systems for Video Technology
.
Cite
DOI
Jinjin Chen
,
Wenyao Gan
,
Hengsheng Zhang
,
Rong Xie
,
Li Song
,
Min Chen
(2021).
Video Enhancement Based on Unpaired Learning
.
2021 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)
.
Cite
DOI
Jinghao Yuan
,
Bingcong Lu
,
Mingyue Hao
,
Xiaoyong Liu
,
Li Song
,
Wenjun Zhang
(2021).
SpaAbr: Size Prediction Assisted Adaptive Bitrate Algorithm for Scalable Video Coding Contents
.
2021 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)
.
Cite
DOI
Geyingjie Wen
,
Zhen Yang
,
Chen Li
,
Rong Xie
,
Li Song
,
Weiyong Cai
(2021).
3D-BitNet: Flow-Agnostic and Precise Network for video Bit-Depth Expansion
.
2021 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)
.
Cite
DOI
Dahu Feng
,
Yan Huang
,
Yiwei Zhang
,
Jun Ling
,
Anni Tang
,
Li Song
(2021).
A Generative Compression Framework For Low Bandwidth Video Conference
.
2021 IEEE International Conference on Multimedia Expo Workshops (ICMEW)
.
Cite
Code
DOI
Chen Zhu
,
Yan Huang
,
Rong Xie
,
Li Song
(2021).
HEVC VMAF-oriented Perceptual Rate Distortion Optimization using CNN
.
2021 Picture Coding Symposium (PCS)
.
Cite
DOI
Yi Fang
,
Jiapeng Tang
,
Wang Shen
,
Wei Shen
,
Xiao Gu
,
Li Song
,
Guangtao Zhai
(2021).
Dual Attention Guided Gaze Target Detection in the Wild
.
2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
.
PDF
Cite
DOI
Guoqing Wu
,
Yan Huang
,
Chen Zhu
,
Li Song
,
Wenjun Zhang
(2021).
SVM Based Fast CU Partitioning Algorithm for VVC Intra Coding
.
2021 IEEE International Symposium on Circuits and Systems (ISCAS)
.
Cite
DOI
Haoyu Zhu
,
Yingjiao Li
,
Zhiyong Chen
,
Li Song
(2021).
Mobile Edge Resource Optimization for Multiplayer Interactive Virtual Reality Game
.
2021 IEEE Wireless Communications and Networking Conference (WCNC)
.
Cite
DOI
Yunqian Wen
,
Li Song
,
Bo Liu
,
Ming Ding
,
Rong Xie
(2021).
IdentityDP: Differential Private Identification Protection for Face Images
.
arXiv:2103.01745 [cs]
.
PDF
Cite
Bo Liu
,
Ming Ding
,
Hanyu Xue
,
Tianqing Zhu
,
Dayong Ye
,
Li Song
,
Wanlei Zhou
(2021).
DP-Image: Differential Privacy for Image Data in Feature Space
.
arXiv:2103.07073 [cs]
.
Cite
Zhengyi Luo
,
Chen Zhu
,
Yan Huang
,
Rong Xie
,
Li Song
,
C.-C. Jay Kuo
(2021).
VMAF Oriented Perceptual Coding Based on Piecewise Metric Coupling
.
IEEE Transactions on Image Processing
.
Cite
DOI
Yuzhuo Wei
,
Li Chen
,
Li Song
(2021).
Video Compression Based on Jointly Learned Down-Sampling and Super-Resolution Networks
.
International Conference on Visual Communications and Image Processing, VCIP 2021, Munich, Germany, December 5-8, 2021
.
Cite
DOI
Jun Ling
,
Han Xue
,
Li Song
,
Rong Xie
,
Xiao Gu
(2021).
Region-Aware Adaptive Instance Normalization for Image Harmonization
.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
.
PDF
Cite
Jingyi Cao
,
Bo Liu
,
Yunqian Wen
,
Rong Xie
,
Li Song
(2021).
Personalized and Invertible Face De-Identification by Disentangled Identity Information Manipulation
.
PDF
Cite
Jingwen Xu
,
Yu Dong
,
Li Song
,
Rong Xie
,
Sixin Lin
,
Yaqing Li
(2021).
Learning a No Reference Quality Assessment Metric for Encoded 4K-UHD Video
.
Digital TV and Wireless Multimedia Communication
.
Cite
DOI
Donghui Feng
,
Yiwei Zhang
,
Chen Zhu
,
Han Zhang
,
Li Song
(2021).
DVRCNN: Dark Video Post-processing Method for VVC
.
MultiMedia Modeling
.
Cite
DOI
Yunqian Wen
,
Bo Liu
,
Rong Xie
,
Jingyi Cao
,
Li Song
(2021).
Deep Motion Flow Aided Face Video De-identification
.
International Conference on Visual Communications and Image Processing, VCIP 2021, Munich, Germany, December 5-8, 2021
.
Cite
DOI
Shuhui Yang
,
Han Xue
,
Jun Ling
,
Li Song
,
Rong Xie
(2021).
Deep Face Swapping via Cross-Identity Adversarial Training
.
MultiMedia Modeling
.
Cite
DOI
Han Zhang
,
Li Song
,
Yan Huang
,
Rong Xie
(2021).
Current Frame Priors Assisted Neural Network for Intra Prediction
.
IEEE Access
.
Cite
DOI
Weijia Huang
,
Bingcong Lu
,
Haoyong Li
,
Rong Xie
,
Li Song
,
Wenjun Zhang
(2021).
Configurable Low Delay Congestion Control Scheme for Cellular Networks
.
IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, BMSB 2021, Chengdu, China, August 4-6, 2021
.
Cite
DOI
Mingyue Hao
,
Jinghao Yuan
,
Bingcong Lu
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2021).
Buffer Displacement Based Online Learning Algorithm For Low Latency HTTP Adaptive Streaming
.
IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, BMSB 2021, Chengdu, China, August 4-6, 2021
.
Cite
DOI
Jiapeng Tang
,
Yi Fang
,
Yu Dong
,
Rong Xie
,
Xiao Gu
,
Guangtao Zhai
,
Li Song
(2021).
Blindly Predict Image and Video Quality in the Wild
.
MMAsia ‘21: ACM Multimedia Asia, Gold Coast, Australia, December 1 - 3, 2021
.
Cite
DOI
Yu Dong
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2021).
An Elastic System Architecture for Edge Based Low Latency Interactive Video Applications
.
IEEE Transactions on Broadcasting
.
Cite
DOI
Yu Dong
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2020).
Real-time UHD video super-resolution and transcoding on heterogeneous hardware
.
Journal of Real-Time Image Processing
.
PDF
Cite
DOI
Zhiming Zhou
,
Yu Dong
,
Li Song
,
Rong Xie
,
Lin Li
,
Bing Zhou
(2020).
Quality of Experience Evaluation for Streaming Video Using CGNN
.
2020 IEEE International Conference on Visual Communications and Image Processing (VCIP)
.
Cite
DOI
Jiapeng Tang
,
Yu Dong
,
Rong Xie
,
Xiao Gu
,
Li Song
,
Lin Li
,
Bing Zhou
(2020).
Deep Blind Video Quality Assessment for User Generated Videos
.
2020 IEEE International Conference on Visual Communications and Image Processing (VCIP)
.
Cite
DOI
Yunqian Wen
,
Bo Liu
,
Rong Xie
,
Yunhui Zhu
,
Jingyi Cao
,
Li Song
(2020).
A Hybrid Model for Natural Face De-Identiation with Adjustable Privacy
.
2020 IEEE International Conference on Visual Communications and Image Processing (VCIP)
.
Cite
DOI
Zhen Yang
,
Li Song
,
Rong Xie
,
Wenjun Zhang
,
Lin Li
,
Yanan Feng
(2020).
TSGAN: A Two-Stream Generative Adversarial Network for Bit-Depth Expansion
.
2020 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)
.
Cite
DOI
Han Xue
,
Jun Ling
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2020).
Realistic Talking Face Synthesis With Geometry-Aware Feature Transformation
.
2020 IEEE International Conference on Image Processing (ICIP)
.
Cite
DOI
Zaixin Yang
,
Yu Dong
,
Li Song
,
Rong Xie
,
Lin Li
,
Yanan Feng
(2020).
Native Resolution Detection for 4K-UHD Videos
.
2020 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)
.
Cite
DOI
Xiangwen Wang
,
Zhengyi Luo
,
Pengbo Li
,
Li Song
(2020).
Learning Based Estimation of Video Coding Distortion
.
2020 IEEE International Symposium on Circuits and Systems (ISCAS)
.
Cite
DOI
Xun Tong
,
Chen Zhu
,
Rong Xie
,
Jian Xiong
,
Li Song
(2020).
A VMAF Directed Perceptual Rate Distortion Optimization for Video Coding
.
2020 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)
.
Cite
DOI
Shengwei Yu
,
Xun Tong
,
Yan Huang
,
Rong Xie
,
Li Song
(2020).
Learning-Based Quality Enhancement For Scalable Coded Video Over Packet Lossy Networks
.
2020 IEEE International Conference on Multimedia and Expo (ICME)
.
Cite
DOI
Hanyu Xue
,
Bo Liu
,
Ming Din
,
Li Song
,
Tianqing Zhu
(2020).
Hiding Private Information in Images From AI
.
ICC 2020 - 2020 IEEE International Conference on Communications (ICC)
.
Cite
DOI
Jun Ling
,
Han Xue
,
Li Song
,
Shuhui Yang
,
Rong Xie
,
Xiao Gu
(2020).
Toward Fine-grained Facial Expression Manipulation
.
arXiv:2004.03132 [cs]
.
PDF
Cite
DOI
Xiangwen Wang
,
En-Hui Yang
,
Da-Ke He
,
Li Song
,
Xiang Yu
(2020).
Rate Distortion Optimization: A Joint Framework and Algorithms for Random Access Hierarchical Video Coding
.
IEEE Transactions on Image Processing
.
PDF
Cite
DOI
Yicheng Zhang
,
Lei Li
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2020).
FACT: Fused Attention for Clothing Transfer with Generative Adversarial Networks
.
Proceedings of the AAAI Conference on Artificial Intelligence
.
Cite
Han Zhang
,
Li Song
,
Li Li
,
Zhu Li
,
Xiaokang Yang
(2020).
Compression Priors Assisted Convolutional Neural Network for Fractional Interpolation
.
IEEE Transactions on Circuits and Systems for Video Technology
.
Cite
DOI
Shenhui Peng
,
Li Song
,
Jun Ling
,
Rong Xie
,
Song Xu
,
Lin Li
(2020).
A Deep Tracking and Segmentation Approach for Soccer Videos Visual Effects
.
Pattern Recognition and Computer Vision
.
Cite
DOI
Wenyao Gan
,
Li Song
,
Li Chen
,
Rong Xie
,
Xiao Gu
(2019).
Identifying and Pruning Redundant Structures for Deep Neural Networks
.
2019 IEEE Visual Communications and Image Processing (VCIP)
.
Cite
DOI
Yuzhuo Wei
,
Li Chen
,
Rong Xie
,
Li Song
,
Xiaoyun Zhang
,
Zhiyong Gao
(2019).
FPGA Based Video Transcoding System with 2K-4K Super-Resolution Conversion
.
2019 IEEE Visual Communications and Image Processing (VCIP)
.
Cite
DOI
Jinan Wu
,
Rong Xie
,
Li Song
,
Bo Liu
(2019).
Deep Feature Guided Image Retargeting
.
2019 IEEE Visual Communications and Image Processing (VCIP)
.
Cite
DOI
Zaixin Yang
,
Tiantian He
,
Li Song
,
Rong Xie
,
Xiao Gu
(2019).
An Improved QoE Evaluation Model for HTTP Adaptive Streaming
.
2019 IEEE Visual Communications and Image Processing (VCIP)
.
Cite
DOI
Chen Zhu
,
Li Song
,
Rong Xie
,
Jingning Han
,
Yaowu Xu
(2019).
JND-based Perceptual Rate Distortion Optimization for AV1 Encoder
.
2019 Picture Coding Symposium (PCS)
.
Cite
DOI
Yan Huang
,
Li Song
,
Ebroul Izquierdo
(2019).
CNN Accelerated Intra Video Coding, Where Is the Upper Bound?
.
2019 Picture Coding Symposium (PCS)
.
Cite
DOI
Yucheng Xu
,
Shiyu Ning
,
Rong Xie
,
Li Song
(2019).
Gan Based Multi-Exposure Inverse Tone Mapping
.
2019 IEEE International Conference on Image Processing (ICIP)
.
Cite
DOI
Yucheng Xu
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2019).
Deep Video Inverse Tone Mapping
.
2019 IEEE Fifth International Conference on Multimedia Big Data (BigMM)
.
Cite
DOI
Han Zhang
,
Li Li
,
Li Song
,
Xiaokang Yang
,
Zhu Li
(2019).
Advanced CNN Based Motion Compensation Fractional Interpolation
.
2019 IEEE International Conference on Image Processing (ICIP)
.
Cite
DOI
Xiao Li
,
Siyi Wang
,
Chen Zhu
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2019).
Viewport Prediction for Panoramic Video with Multi-CNN
.
2019 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)
.
Cite
DOI
Yilun Zhou
,
Li Chen
,
Rong Xie
,
Li Song
,
Wenjun Zhang
(2019).
Low-precision CNN Model Quantization based on Optimal Scaling Factor Estimation
.
2019 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)
.
Cite
DOI
Zhengyi Luo
,
Yan Huang
,
Xiangwen Wang
,
Rong Xie
,
Li Song
(2019).
VMAF Oriented Perceptual Optimization for Video Coding
.
2019 IEEE International Symposium on Circuits and Systems (ISCAS)
.
Cite
DOI
Xiaona Wu
,
Xiao Li
,
Xun Tong
,
Rong Xie
,
Li Song
(2019).
Reinforcement Learning Based Adaptive Bitrate Algorithm for Transmitting Panoramic Videos
.
2019 IEEE International Symposium on Circuits and Systems (ISCAS)
.
Cite
DOI
Xinyuan Chen
,
Chang Xu
,
Xiaokang Yang
,
Li Song
,
Dacheng Tao
(2019).
Gated-GAN: Adversarial Gated Networks for Multi-Collection Style Transfer
.
IEEE Transactions on Image Processing
.
PDF
Cite
DOI
张文军
,
管云峰
,
何大治
,
陈智勇
,
宋利
,
徐异凌
,
夏斌
(2019).
新一代融合媒体网络架构
.
通信学报
.
Cite
黄永铖
,
宋利
,
解蓉
(2019).
基于深度残差网络的 VP9 超级块快速划分算法
.
电视技术
.
Cite
Yicheng Zhang
,
Li Song
,
Rong Xie
,
Wenjuan Zhang
(2019).
Multi-Scale Generative Adversarial Learning for Facial Attribute Transfer
.
Digital TV and Wireless Multimedia Communication - 16th International Forum, IFTC 2019, Shanghai, China, September 19-20, 2019, Revised Selected Papers
.
Cite
DOI
Zhaoliang Ma
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2019).
Motion Adaptive Intra Refresh for Low Delay HEVC Encoding
.
Digital TV and Multimedia Communication
.
PDF
Cite
DOI
Chung Nicolas
,
Rong Xie
,
Li Song
,
Wenjun Zhang
(2019).
Improving Semantic Style Transfer Using Guided Gram Matrices
.
Digital TV and Multimedia Communication
.
PDF
Cite
DOI
Jianlun Tang
,
Yan Huang
,
Rong Xie
,
Zhengyi Luo
,
Li Song
(2018).
GPU Based Motion-Compensated Frame Interpolation Acceleration for Future Video Coding
.
2018 25th IEEE International Conference on Image Processing (ICIP)
.
PDF
Cite
DOI
Zhifeng Zhang
,
Li Chen
,
Rong Xie
,
Li Song
(2018).
Frame Interpolation via Refined Deep Voxel Flow
.
2018 25th IEEE International Conference on Image Processing (ICIP)
.
PDF
Cite
DOI
Maurizio Murroni
,
Reza Rassool
,
Li Song
,
Rafael Sotelo
(2018).
Guest Editorial Special Issue on Quality of Experience for Advanced Broadcast Services
.
IEEE Transactions on Broadcasting
.
Cite
DOI
Yue Ma
,
Yan Huang
,
Jia Wang
,
Rong Xie
,
Xiao Gu
,
Li Song
(2018).
A Segment Constraint ABR Algorithm for HEVC Encoder
.
2018 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)
.
PDF
Cite
DOI
Tiantian He
,
Rong Xie
,
Jia Su
,
Xin Tang
,
Li Song
(2018).
A No Reference Bitstream-Based Video Quality Assessment Model for H.265/HEVC and H.264/AVC
.
2018 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)
.
PDF
Cite
DOI
Yu Dong
,
Li Song
,
Rong Xie
,
Wenjun Zhang
,
Ya Zhang
(2018).
A Generic Distributed Scheduling Algorithm for Frame Rate Up Convert Video Transcoding
.
2018 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)
.
PDF
Cite
DOI
Yan Huang
,
Li Song
,
Rong Xie
,
Zhengyi Luo
,
Xiangwen Wang
(2018).
An MCMC based Efficient Parameter Selection Model for x265 Encoder
.
2018 IEEE International Symposium on Circuits and Systems (ISCAS)
.
PDF
Cite
DOI
Shiyu Ning
,
Hongteng Xu
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2018).
Learning an Inverse Tone Mapping Network with a Generative Adversarial Regularizer
.
2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
.
PDF
Cite
DOI
宋利
,
罗莹
(2018).
虚拟现实技术及其广泛应用
.
科学
.
Cite
罗莹
,
宋利
,
解蓉
,
罗传飞
(2018).
全景媒体的系统架构研究综述
.
电信科学
.
Cite
Zhifeng Zhang
,
Li Song
,
Rong Xie
,
Li Chen
(2018).
Video Frame Interpolation Using Recurrent Convolutional Layers
.
2018 IEEE Fourth International Conference on Multimedia Big Data (BigMM)
.
Cite
DOI
Ying Luo
,
Xu Liu
,
Chen Zhu
,
Rong Xie
,
Li Song
(2018).
Rate-mixed HEVC Tile based 360 Video Streaming System
.
2018 IEEE Visual Communications and Image Processing (VCIP)
.
Cite
DOI
Hao Wang
,
Li Song
,
Rong Xie
,
Zhengyi Luo
,
Xiangwen Wang
(2018).
Masking Effects Based Rate Control Scheme for High Efficiency Video Coding
.
2018 IEEE International Symposium on Circuits and Systems (ISCAS)
.
PDF
Cite
DOI
Xiaona Wu
,
Cheng Zhao
,
Rong Xie
,
Li Song
(2018).
Low Latency MPEG-DASH System Over HTTP 2.0 and WebSocket
.
Digital TV and Wireless Multimedia Communication
.
PDF
Cite
DOI
宋利, 解蓉 冯宁
(2018).
HDR / WCG 关键技术分析及标准化进展
.
电视技术
.
Cite
冯宁
,
宋利
,
解蓉
(2018).
HDR / WCG 关键技术分析及标准化进展
.
电视技术
.
Cite
Zhaoliang Ma
,
Shengwei Yu
,
Yongcheng Huang
,
Rong Xie
,
Li Song
(2018).
An Improved Real-Time Video Communication System
.
2018 IEEE Visual Communications and Image Processing (VCIP)
.
Cite
DOI
Yu Dong
,
Xusheng Zhang
,
Yanan Zhao
,
Li Song
(2018).
A Containerized Media Cloud for Video Transcoding Service
.
2018 IEEE International Conference on Consumer Electronics (ICCE)
.
PDF
Cite
DOI
BiJia Li
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2017).
Weight-based bit allocation scheme for VR videos in HEVC
.
2017 IEEE Visual Communications and Image Processing (VCIP)
.
Cite
DOI
Jingwei Xu
,
Li Song
,
Rong Xie
(2017).
Two-stream deep encoder-decoder architecture for fully automatic video object segmentation
.
2017 IEEE Visual Communications and Image Processing (VCIP)
.
Cite
DOI
Lixun Bai
,
Li Song
,
Rong Xie
,
Liang Zhang
,
Zhengyi Luo
(2017).
Rate control model for high dynamic range video
.
2017 IEEE Visual Communications and Image Processing (VCIP)
.
Cite
DOI
Yankai Liu
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2017).
A Generic Method to Improve No-Reference Image Blur Metric Accuracy in Video Contents
.
2017 IEEE Visual Communications and Image Processing (VCIP)
.
Cite
DOI
Xuchao Lu
,
Li Song
,
Rong Xie
,
Xiaokang Yang
,
Wenjun Zhang
(2017).
Deep Binary Representation for Efficient Image Retrieval
.
Advances in Multimedia
.
PDF
Cite
DOI
Ying Luo
,
Li Song
,
Rong Xie
,
Chuanfei Luo
(2017).
View-Dependent Omnidirectional Video Encapsulation Using Multiple Tracks
.
2017 International Conference on Virtual Reality and Visualization (ICVRV)
.
Cite
DOI
Xu Liu
,
Yongcheng Huang
,
Li Song
,
Rong Xie
,
Xiaokang Yang
(2017).
The SJTU UHD 360-Degree Immersive Video Sequence Dataset
.
2017 International Conference on Virtual Reality and Visualization (ICVRV)
.
Cite
DOI
Guichun Li
,
Lingzhi Liu
,
Nam Ling
,
Jianhua Zheng
,
Chen-Xiong Zhang
,
Li Song
(2017).
Simplification of mode dependent intra smoothing
.
PDF
Cite
Xiangwen Wang
,
Li Song
,
Zhengyi Luo
,
Rong Xie
(2017).
Lagrangian method based Rate-Distortion Optimization revisited for dependent video coding
.
2017 IEEE International Conference on Image Processing (ICIP)
.
Cite
DOI
Xuchao Lu
,
Li Song
,
Rong Xie
,
Xiaokang Yang
,
Wenjun Zhang
(2017).
Deep hash learning for efficient image retrieval
.
2017 IEEE International Conference on Multimedia Expo Workshops (ICMEW)
.
Cite
DOI
Xiao Wei
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2017).
Two-stream recurrent convolutional neural networks for video saliency estimation
.
2017 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)
.
Cite
DOI
Xiangyu Li
,
Rong Xie
,
Li Song
,
Liang Zhang
(2017).
Machine learning based VP9-to-HEVC video transcoding
.
2017 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)
.
Cite
DOI
Tiantian He
,
Yankai Liu
,
Rong Xie
,
Xin Tang
,
Li Song
(2017).
Evaluation of No Reference Bitstream-based Video Quality Assessment Methods
.
arXiv:1706.10143 [cs]
.
PDF
Cite
Xusheng Zhang
,
Li Song
,
Rong Xie
,
Yanan Zhao
(2017).
A Lightweight Distributed Media Processing System for Uhd Service
.
2017 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)
.
Cite
DOI
方宏俊
,
宋利
,
杨小康
(2017).
适配分辨率动态变化的低复杂度视频场景切换检测方法
.
计算机科学
.
Cite
贺甜甜
,
刘彦凯
,
宋利
(2017).
视频用户体验理论与实践
.
电信科学
.
Cite
Qingpeng Xie
,
Peiyun Di
,
Jose Alvarez
,
Chuanfei Luo
,
Li Song
(2017).
ROI bitstream signaling in DASH-VR
.
MPEG-m39892
.
Cite
Hongjun Fang
,
Li Song
,
Xiaokang Yang
(2017).
Low Complexity Scene Change Detection Algorithm for Supporting Resolution Dynamic Change
.
unicode35745 unicode31639 unicode26426 unicode31185 unicode23398*
.
Cite
DOI
Han Zhang
,
Li Song
,
Zhengyi Luo
,
Xiaokang Yang
(2017).
Learning a convolutional neural network for fractional interpolation in HEVC inter coding
.
2017 IEEE Visual Communications and Image Processing (VCIP)
.
Cite
Peiyun Di
,
Qingpeng Xie
,
Jose Alvarez
,
Chuanfei Luo
,
Li Song
(2017).
Fast switching bitstream signaling in OMAF
.
MPEG-m39894
.
Cite
Peiyun Di
,
Qingpeng Xie
,
Jose Alvarez
,
Chuanfei Luo
,
Li Song
(2017).
Fast switching bitstream signaling in DASH-VR
.
MPEG-m39893
.
Cite
Biao Wang
,
Ge Chen
,
Luoyi Fu
,
Li Song
,
Xinbing Wang
(2017).
DRIMUX: Dynamic rumor influence minimization with user experience in social networks
.
IEEE Transactions on Knowledge and Data Engineering
.
Cite
DOI
Chen Li
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2017).
CNN Based Post-Processing to Improve HEVC
.
2017 IEEE International Conference on Image Processing, ICIP 2017, Beijing, China, September 17-20, 2017
.
Cite
DOI
Chongyang Zhang
,
Bingbing Ni
,
Li Song
,
Guangtao Zhai
,
Xiaokang Yang
,
Wenjun Zhang
(2017).
BEST: Benchmark and Evaluation of Surveillance Task
.
Computer Vision – ACCV 2016 Workshops
.
Cite
DOI
Lixun Bai
,
Li Song
,
Rong Xie
,
Jianfeng Xie
,
M. Chen
(2016).
Saliency based rate control scheme for high efficiency video coding
.
2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)
.
PDF
Cite
DOI
Yankai Liu
,
Li Song
,
Xiaokang Yang
,
Rong Xie
,
Wenjun Zhang
(2016).
Review of ITU-T Parametric Models for Compressed Video Quality Estimation
.
2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)
.
Cite
DOI
Yankai Liu
,
Li Song
,
Xiaokang Yang
,
Rong Xie
,
Wenjun Zhang
(2016).
Review of ITU-T parametric models for compressed video quality estimation
.
2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)
.
PDF
Cite
DOI
Rujun Wei
,
Rong Xie
,
Li Song
,
Liang Zhang
,
Wenjun Zhang
(2016).
Improved intra angular prediction with novel interpolation filter and boundary filter
.
2016 Picture Coding Symposium (PCS)
.
Cite
DOI
Jingwei Xu
,
Li Song
,
Rong Xie
(2016).
Shot Boundary Detection Using Convolutional Neural Networks
.
2016 Visual Communications and Image Processing (VCIP)
.
Cite
DOI
Jingwei Xu
,
Li Song
,
Rong Xie
(2016).
Shot boundary detection using convolutional neural networks
.
2016 Visual Communications and Image Processing (VCIP)
.
Cite
DOI
Han Zhang
,
Li Song
,
Xiaokang Yang
,
Zhengyi Luo
(2016).
Evaluation of beyond-HEVC entropy coding methods for DCT transform coefficients
.
2016 Visual Communications and Image Processing (VCIP)
.
Cite
DOI
(2016).
Parallel fusion RNN-LSTM architecture for image caption generation
. IEEE international conference on image Processing(ICIP2016).
Cite
Xinyuan Chen
,
Li Song
,
Xiaokang Yang
(2016).
Deep RNNs for video denoising
.
Applications of Digital Image Processing XXXIX
.
PDF
Cite
DOI
Minsi Wang
,
Li Song
,
Xiaokang Yang
,
Chuanfei Luo
(2016).
A Parallel-Fusion RNN-LSTM Architecture for Image Caption Generation
.
2016 IEEE International Conference on Image Processing (ICIP)
.
Cite
DOI
Cheng Zhao
,
Li Song
,
Da Huo
,
Rong Xie
,
Nam Ling
(2016).
A Proxy-assisted DASH Live Streaming Scheme
.
The 9th international conference on ubi-media computing (UMEDIA 2016)
.
Cite
Zhangzong Zhao
,
Li Song
,
Rong Xie
,
Xiaokang Yang
(2016).
GPU accelerated high-quality video/image super-resolution
.
2016 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)
.
Cite
DOI
Lingzhi Liu
,
Guichun Li
,
Nam Ling
,
Jianhua Zheng
,
Philipp Zhang
,
Li Song
(2016).
Reference pixel reduction for intra LM prediction
.
PDF
Cite
Z. Zhao
,
Li Song
,
B. Xiao
(2016).
GPU implementation and optimization of video super-resolution
.
2016 GPU technology conference (GTC)
.
Cite
Jiaming Shen
,
Zhenyu Song
,
Shitao Li
,
Zhaowei Tan
,
Yuning Mao
,
Luoyi Fu
,
Li Song
,
Xinbing Wang
(2016).
Modeling Topic-Level Academic Influence in Scientific Literatures
.
Workshops at the Thirtieth AAAI Conference on Artificial Intelligence
.
PDF
Cite
霍达
,
宋利
(2016).
基于 Celery 的分布式视频计算处理框架
.
电视技术
.
Cite
Li Song
,
Yankai Liu
,
Xiaokang Yang
,
Guangtao Zhai
,
Rong Xie
,
Wenjun Zhang
(2016).
The SJTU HDR Video Sequence Dataset
.
Proceedings of International Conference on Quality of Multimedia Experience (QoMEX 2016)
.
Cite
Yutong Zhu
,
Li Song
,
Rong Xie
,
Wenjun Zhang
(2016).
SJTU 4K video subjective quality dataset for content adaptive bit rate estimation without encoding
.
2016 IEEE international symposium on broadband multimedia systems and broadcasting (BMSB)
.
Cite
Zhangzong Zhao
,
Li Song
,
Chuanfei Luo
,
Rong Xie
,
Xiaokang Yang
,
Wenjun Zhang
(2016).
GPU Accelerating super-resolution for converting HD to 4K
.
International Broadcasting Convention
.
Cite
BiJia Li
,
Li Song
,
Rong Xie
,
Nam Ling
(2016).
Evaluation of H.265 and H.264 for Panoramas Video under Different Map Projections
.
The 9th international conference on ubi-media computing (UMEDIA 2016)
.
PDF
Cite
Jianfeng Xie
,
Li Song
,
Rong Xie
,
Zhengyi Luo
,
Min Chen
(2016).
A Novel Parallel-Friendly Rate Control Scheme for HEVC
.
2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)
.
Cite
Qingxiong Huangyuan
,
Li Song
,
Yue Ma
,
Rong Xie
,
Zhengyi Luo
(2015).
Learning based fast H.264 to H.265 transcoding
.
2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)
.
PDF
Cite
DOI
Jianfeng Xie
,
Li Song
,
Rong Xie
,
Zhengyi Luo
,
Xiangwen Wang
(2015).
Temporal dependent bit allocation scheme for rate control in HEVC
.
2015 IEEE Workshop on Signal Processing Systems (SiPS)
.
PDF
Cite
DOI
Xiaofeng Lu
,
Junhao Zhang
,
Li Song
,
Rui Lei
,
Hengli Lu
,
Nam Ling
(2015).
Person tracking with partial occlusion handling
.
2015 IEEE Workshop on Signal Processing Systems (SiPS)
.
PDF
Cite
DOI
Rujun Wei
,
Rong Xie
,
Liang Zhang
,
Li Song
(2015).
Fast depth decision with enlarged coding block sizes for HEVC intra coding of 4K ultra-HD video
.
2015 IEEE Workshop on Signal Processing Systems (SiPS)
.
PDF
Cite
DOI
Jifa Gu
,
Sangying Xu
,
Yong Fang
,
Bo Wang
,
Qianqian Li
,
Kan Shi
,
Li Song
,
Rong Xie
(2015).
Systemic view on service management in Shanghai World Expo
.
2015 International Conference on Logistics, Informatics and Service Sciences (LISS)
.
Cite
DOI
Jianhua Xiao
,
Li Song
,
Zhengyi Luo
,
Rong Xie
,
Wenjun Zhang
(2015).
Which metric can predict coding gain of H.265/HEVC over H.264/AVC?
.
2015 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting
.
PDF
Cite
DOI
Cong Shen
,
Rong Xie
,
Liang Zhang
,
Li Song
(2015).
Small group people behavior analysis based on temporal recursive trajectory identification
.
2015 IEEE International Conference on Multimedia Expo Workshops (ICMEW)
.
Cite
DOI
Miok Kim
,
Nam Ling
,
Li Song
(2015).
Fast Single Depth Intra Mode Decision for Depth Map Coding in 3D-HEVC
.
2015 IEEE International Conference on Multimedia Expo Workshops (ICMEW)
.
Cite
DOI
Wenjing Tong
,
Li Song
,
Xiaokang Yang
,
Hui Qu
,
Rong Xie
(2015).
CNN-Based Shot Boundary Detection and Video Annotation
.
2015 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting
.
PDF
Cite
DOI
方宏俊
,
宋利
,
解蓉
(2015).
用于电视系统的视频噪声检测的优化算法设计
.
电视技术
.
Cite
方宏俊
,
施海鹏
,
宋利
(2015).
用于电视系统中的图像清晰度测量算法及实现
.
电视技术
.
Cite
Juan Hu
,
Yi Fang
,
Nam Ling
,
Li Song
(2015).
Topic Modeling for Large-Scale Multimedia Analysis and Retrieval
.
Big Data-Algorithms, Analytics, and Applications
.
Cite
Li Song
(2015).
SJTU Test Sequences for video coding development
.
JCTVC-V0083
.
Cite
Jianzhou Feng
,
Li Song
,
Xiaoming Huo
,
Xiaokang Yang
,
Wenjun Zhang
(2015).
An Optimized Pixel-Wise Weighting Approach for Patch-Based Image Denoising
.
IEEE Signal Processing Letters
.
PDF
Cite
DOI
Fangshun Mul
,
Li Song
,
Zhenyi Luo
,
Xiangwen Wang
,
Xiaokang Yang
(2014).
Speed up HEVC encoder by precoding with H.264
.
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific
.
PDF
Cite
DOI
Qingxiong Huangyuan
,
Li Song
,
Zhengyi Luo
,
Xiangwen Wang
,
Yanan Zhao
(2014).
Performance evaluation of H.265/MPEG-HEVC encoders for 4K video sequences
.
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific
.
PDF
Cite
DOI
Li Song
,
Chen Chen
,
Yi Xu
,
Genjian Xue
,
Yi Zhou
(2014).
Blind Image Quality Assessment Based on a New Feature of Nature Scene Statistics
.
2014 IEEE Visual Communications and Image Processing Conference
.
PDF
Cite
DOI
Shiyu Liang
,
Ruotian Luo
,
Ge Chen
,
Songjun Ma
,
Weijie Wu
,
Li Song
,
Xiaohua Tian
,
Xinbing Wang
(2014).
Are We Still Friends: Kernel Multivariate Survival Analysis
.
2014 IEEE Global Communications Conference
.
PDF
Cite
DOI
Miok Kim
,
Nam Ling
,
Li Song
,
Zhouye Gu
(2014).
Fast skip mode decision with rate-distortion optimization for High Efficiency Video Coding
.
2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)
.
PDF
Cite
DOI
Fangshun Mu
,
Li Song
,
Xiaokang Yang
,
Zhenyi Luo
(2014).
Fast coding unit depth decision for HEVC
.
2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)
.
PDF
Cite
DOI
Zhongzhu Yang
,
Li Song
,
Zhengyi Luo
,
Xiangwen Wang
(2014).
Low delay rate control for HEVC
.
2014 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting
.
PDF
Cite
DOI
Jianzhou Feng
,
Xiaoming Huo
,
Li Song
,
Xiaokang Yang
,
Wenjun Zhang
(2014).
Evaluation of Different Algorithms of Nonnegative Matrix Factorization in Temporal Psychovisual Modulation
.
IEEE Transactions on Circuits and Systems for Video Technology
.
PDF
Cite
DOI
王淦
,
宋利
,
张文军
(2014).
注意力模型指导下的视频质量评价方法
.
电视技术
.
Cite
张玮
,
宋利
,
杨小康
(2014).
基于视频编码增益的视频质量评价算法性能研究
.
电视技术
.
Cite
张虎
,
梁龙飞
,
宋利
(2014).
基于 DisplayPort 接口的高清视频传输方案
.
电视技术
.
Cite
Zhengcong Wang
,
Peng Wang
,
Hongguang Zhang
,
Hongjun Zhang
,
Shibao Zheng
,
Li Song
(2014).
Texture Direction Based Optimization for Intra Prediction in HEVC
.
IEICE Transactions on Information and Systems
.
Cite
DOI
Li Song
,
Yanan Zhao
,
Xiangwen Wang
,
Zhengyi Luo
,
Min Chen
(2014).
Progress on realtime UHD HEVC encoder
.
The 2nd international workshop on video coding and video Processing(VCVP2014)
.
Cite
J. Feng
,
X. Huo
,
Li Song
,
X. Yang
,
W. Zhang
(2014).
Image nonnegative factorization: Formulation and numerical strategies
.
Master lectures on mathematics
.
Cite
Yuming Cao
,
Xiaoquan You
,
Jia Wang
,
Li Song
(2014).
A QoE Friendly Rate Adaptation Method for DASH
.
2014 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting
.
Cite
DOI
Zhengyi Luo
,
Li Song
,
Shibao Zheng
,
Nam Ling
(2013).
Raptor Codes Based Unequal Protection for Compressed Video According to Packet Priority
.
IEEE Transactions on Multimedia
.
PDF
Cite
DOI
Qi Cai
,
Li Song
(2013).
Avs Encoding Optimization with Perceptual Just Noticeable Distortion Model
.
2013 9th International Conference on Information, Communications Signal Processing
.
Cite
DOI
Hui Qu
,
Li Song
,
Gengjian Xue
(2013).
Shaking video synthesis for video stabilization performance assessment
.
2013 Visual Communications and Image Processing (VCIP)
.
Cite
DOI
Hui Qu
,
Li Song
(2013).
Video stabilization with L1–L2 optimization
.
2013 IEEE International Conference on Image Processing
.
Cite
DOI
Jifa Gu
,
Shanying Xu
,
Yong Fang
,
Kan Shi
,
Benfu Lv
,
Geng Peng
,
Bo Wang
,
Li Song
,
Rong Xie
(2013).
Three Aspects on Solving Queuing Service System in Shanghai World Expo
.
Journal of Systems Science and Systems Engineering
.
Cite
DOI
Jifa Gu
,
Shanying Xu
,
Yong Fang
,
Kan Shi
,
Benfu Lv
,
Geng Peng
,
Bo Wang
,
Li Song
,
Rong Xie
(2013).
Three aspects on solving queuing service system in Shanghai world expo
.
Journal of Systems Science and Systems Engineering
.
PDF
Cite
DOI
Jianzhou Feng
,
Li Song
,
Xiaoming Huo
,
Xiaokang Yang
,
Wenjun Zhang
(2013).
Image restoration via efficient Gaussian mixture model learning
.
2013 IEEE International Conference on Image Processing
.
Cite
DOI
Gengjian Xue
,
Li Song
,
Jun Sun
(2013).
Foreground Estimation Based on Linear Regression Model With Fused Sparsity on Outliers
.
IEEE Transactions on Circuits and Systems for Video Technology
.
PDF
Cite
DOI
Chengfeng Lin
,
Jianhua He
,
Yi Zhou
,
Xiaokang Yang
,
Kai Chen
,
Li Song
(2013).
Analysis and Identification of Spamming Behaviors in Sina Weibo Microblog
.
Proceedings of the 7th Workshop on Social Network Mining and Analysis
.
Cite
DOI
Jifa Gu
,
Shanying Xu
,
Yong Fang
,
Kan Shi
,
Bo Wang
,
Li Song
,
Rong Xie
(2013).
Shanghai World Expo and queuing service system
.
2013 10th International Conference on Service Systems and Service Management
.
Cite
DOI
Xiangwen Wang
,
Li Song
,
Min Chen
,
Junjie Yang
(2013).
Paralleling variable block size motion estimation of HEVC on CPU plus GPU platform
.
2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)
.
Cite
DOI
Xiaofeng Lu
,
Li Song
,
Sumin Shen
,
Kang He
,
Songyu Yu
,
Nam Ling
(2013).
Parallel Hough Transform-Based Straight Line Detection and Its FPGA Implementation in Embedded Vision
.
Sensors
.
PDF
Cite
DOI
Chen Chen
,
Li Song
,
Xiangwen Wang
,
Meng Guo
(2013).
No-reference video quality assessment on mobile devices
.
2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)
.
Cite
DOI
Zhengyi Luo
,
Li Song
,
Shibao Zheng
,
Nam Ling
(2013).
H.264/AVC Perceptual Optimization Coding Based on JND-Directed Coefficient Suppression
.
IEEE Trans on Circuits and Systems for Video Tech
.
Cite
DOI
Zhengyi Luo
,
Li Song
,
Shibao Zheng
,
Nam Ling
(2013).
H.264/AVC Perceptual Optimization Coding based on JND-Directed Coefficient Suppression
.
IEEE Trans on Circuits and Systems for Video Tech
.
Cite
DOI
Keyi Shen
,
Jianmin Wu
,
Ya Zhang
,
Yiping Han
,
Xiaokang Yang
,
Li Song
,
Xiao Gu
(2013).
Reorder user's tweets
.
ACM Transactions on Intelligent Systems and Technology
.
PDF
Cite
DOI
Miok Kim
,
Nam Ling
,
John D. Ralston
,
Li Song
(2013).
A Mesh-Based Method for Wavelet Video Coding Using Edge-Detection in Low Frequency Subband
.
2013 IEEE 4th Latin American Symposium on Circuits and Systems (LASCAS)
.
Cite
DOI
Li Song
,
Xun Tang
,
Wei Zhang
,
Xiaokang Yang
,
Pingjian Xia
(2013).
The SJTU 4K video sequence dataset
.
2013 Fifth International Workshop on Quality of Multimedia Experience (QoMEX)
.
Cite
DOI
Gengjian Xue
,
Li Song
,
Jun Sun
,
Jun Zhou
(2013).
Foreground detection: Combining background subspace learning with object smoothing model
.
2013 IEEE International Conference on Multimedia and Expo (ICME)
.
Cite
Yanan Zhao
,
Li Song
,
Xiangwen Wang
,
Min Chen
,
Jia Wang
(2013).
Efficient realization of parallel HEVC intra encoding
.
2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)
.
Cite
Qi Cai
,
Li Song
,
Guichun Li
,
Nam Ling
(2012).
Lossy and Lossless Intra Coding Performance Evaluation: HEVC, H.264/AVC, JPEG 2000 and JPEG LS
.
Proceedings of The 2012 Asia Pacific Signal and Information Processing Association Annual Summit and Conference
.
Cite
Zhengyi Luo
,
Li Song
,
Shibao Zheng
,
Nam Ling
(2012).
Optimized nested protection for video Region of Interest with Raptor codes
.
2012 Visual Communications and Image Processing
.
Cite
DOI
Jianzhou Feng
,
Li Song
,
Xiaoming Huo
,
Xiaokang Yang
,
Wenjun Zhang
(2012).
New bounds on image denoising: Viewpoint of sparse representation and non-local averaging
.
2012 Visual Communications and Image Processing
.
Cite
DOI
Gengjian Xue
,
Jun Sun
,
Li Song
(2012).
Background Subtraction Based on Phase Feature and Distance Transform
.
Pattern Recognition Letters
.
PDF
Cite
DOI
Yi Zhou
,
Kai Chen
,
Li Song
,
Xiaokang Yang
,
Jianhua He
(2012).
Feature Analysis of Spammers in Social Networks with Active Honeypots: A Case Study of Chinese Microblogging Networks
.
2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
.
Cite
DOI
Xiaofeng Lu
,
Li Song
,
Yi Xu
,
Songyu Yu
(2012).
A Novel Occlusion-Adaptive Object Tracking Method
.
2012 International Conference on Computer Science and Information Processing (CSIP)
.
Cite
DOI
Xiaofeng Lu
,
Li Song
,
Songyu Yu
,
Nam Ling
(2012).
Object contour tracking using multi-feature fusion based particle filter
.
2012 7th IEEE Conference on Industrial Electronics and Applications (ICIEA)
.
Cite
DOI
Lingzhi Liu
,
Guichun Li
,
Nam Ling
,
Li Song
,
Zheng
,
Philipp Zhang
(2012).
Using averaged down-sampling reference pixels in LM parameter generation
.
JCTVC-H0464
.
Cite
Guichun Li
,
Lingzhi Liu
,
Nam Ling
,
Changcai Lai
,
Jianhua Zheng
,
Philipp Zhang
,
Li Song
(2012).
Simplification of mode dependent intra smoothing
.
JCTVC-H0465
.
Cite
李争光
,
宋利
(2012).
基于结点相似性的层次化社团发现算法
.
信息技术
.
Cite
Bing Liu
,
Li Song
(2012).
Skew Estimation Based on Haar-Like Features
.
Advances on Digital Television and Wireless Multimedia Communications
.
Cite
J. Gu
,
S. Xu
,
Y. Fang
,
K. Shi
,
B. Wang
,
Li Song
,
Rong Xie
(2012).
Queuing problems in Shanghai World Expo, Social Dynamics
.
Social Physics
.
Cite
Cheng Xia
,
Tsuyoshi Saito
,
Li Song
(2012).
Measurement Algorithm for Image Structure Noise on Hardcopy
.
Advances on Digital Television and Wireless Multimedia Communications
.
Cite
Yi Zhou
,
Kai Chen
,
Li Song
,
Xiaokang Yang
(2012).
Analyzing Spammers of Social Networks using Honeypot-A Case Study of Microblogging of China
.
Cite
Li Song
,
Zhengyi Luo
,
Cong Xiong
(2011).
Improving Lossless Intra Coding of H.264/AVC by Pixel-Wise Spatial Interleave Prediction
.
IEEE Transactions on Circuits and Systems for Video Technology
.
Cite
DOI
Q. Chen
,
Y. Zhou
,
K. Chen
,
L. Song
,
X. Yang
(2011).
Chinese words detection in camera based images using stroke width transform
.
International conference on opto-electronics engineering and information science (ICOEIS 2011), Dec.23-25, 2011
.
Cite
Jianzhou Feng
,
Li Song
,
Xiaokang Yang
,
Wenjun Zhang
(2011).
Learning dictionary via subspace segmentation for sparse representation
.
2011 18th IEEE International Conference on Image Processing
.
Cite
DOI
Gengjian Xue
,
Li Song
,
Jun Sun
,
Meng Wu
(2011).
Hybrid Center-Symmetric Local Pattern for Dynamic Background Subtraction
.
2011 IEEE International Conference on Multimedia and Expo
.
Cite
DOI
Gengjian Xue
,
Li Song
,
Jun Sun
,
Meng Wu
(2011).
Hybrid center-symmetric local pattern for dynamic background subtraction
.
2011 IEEE International Conference on Multimedia and Expo
.
Cite
DOI
Kai Chen
,
Yi Zhou
,
Li Song
,
Xiaokang Yang
(2011).
Building Artificial Identities in Social Network Using Semantic Information
.
2011 International Conference on Advances in Social Networks Analysis and Mining
.
Cite
DOI
Kai Chen
,
Yi Zhou
,
Chenxuan Li
,
Li Song
,
Xiaokang Yang
(2011).
A Learning-Based Text Detection Method in Camera Images
.
2011 IEEE International Conference on Computer Science and Automation Engineering
.
Cite
DOI
Jianzhou Feng
,
Li Song
,
Xiaoming Huo
,
Xiaokang Yang
,
Wenjun Zhang
(2011).
Learning sparse dictionaries with a popularity-based model
.
2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
.
Cite
DOI
Junni Zou
,
Hongkai Xiong
,
Chenglin Li
,
Li Song
,
Zhihai He
,
Tsuhan Chen
(2011).
Prioritized Flow Optimization With Multi-Path and Network Coding Based Routing for Scalable Multirate Multicasting
.
IEEE Transactions on Circuits and Systems for Video Technology
.
Cite
DOI
陈启立
,
宋利
,
余松煜
(2011).
视频稳像技术综述
.
电视技术
.
Cite
顾基发
,
徐山鹰
,
房勇
,
时勘
,
王波
,
宋利
,
解蓉
(2011).
世博会排队集群行为研究
.
上海理工大学学报
.
Cite
Ji-Fa Gu
,
San-Ying Xu
,
Yong Fang
,
Kan Shi
,
Bo Wang
,
Li Song
,
Rong Xie
(2011).
Study on the collective behaviors of queuing in the Shanghai World Expo
.
Journal of University of Shanghai for Science and Technology
.
Cite
Die Hu
,
Li Song
,
Cheng Zhi
(2011).
Multi-illumination Face Recognition from a Single Training Image per Person with Sparse Representation
.
Computer Vision – ACCV 2010
.
Cite
DOI
Kai Chen
,
Yi Zhou
,
Qi Zheng
,
Xiaokang Yang
,
Li Song
(2011).
MCM: An Efficient Geometric Constraint Method for Robust Local Feature Matching
.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2011), Nara Centennial Hall, Nara, Japan, June 13-15, 2011
.
Cite
Zhenchao Xu
,
Li Song
,
Jia Wang
,
Yi Xu
(2011).
Improving Detector of Viola and Jones through SVM
.
Computer Vision – ACCV 2010 Workshops
.
Cite
DOI
Gengjian Xue
,
Li Song
,
Jun Sun
,
Meng Wu
(2011).
Foreground estimation based on robust linear regression model
.
2011 18th IEEE International Conference on Image Processing
.
Cite
DOI
Kai Chen
,
Zhou Yi
,
Qi Zheng
,
Xiaokang Yang
,
Li Song
(2011).
An Efficient Geometric Constraint Method for Robust Local Feature Matching
.
The 12th IAPR Conference on Machine Vision Applications
.
Cite
Shusheng Li
,
Rong Xie
,
Li Song
,
Xiaokang Yang
,
Wenjun Zhang
(2011).
A New Combining Prediction Method of Visitor Numbers at Shanghai Expo
.
International conference on opto-electronics engineering and information science (ICOEIS 2011), Dec.23-25, 2011
.
Cite
Hongkai Xiong
,
Hui Lv
,
Yongsheng Zhang
,
Li Song
,
Zhihai He
,
Tsuhan Chen
(2010).
Subgraphs Matching-Based Side Information Generation for Distributed Multiview Video Coding
.
EURASIP Journal on Advances in Signal Processing
.
PDF
Cite
DOI
Nana Guo
,
Li Song
,
Xiaokang Yang
,
Wenjun Zhang
(2010).
Image interpolation based on decomposition
.
2010 International Symposium on Intelligent Signal Processing and Communication Systems
.
Cite
DOI
Li Song
,
Zhengyi Luo
,
Xiaokang Yang
(2010).
On reference sample filtering and transform-bypass lossless intra coding
.
VCEG-AO11
.
Cite
Zejun Ma
,
Li Song
,
Cheng Zhi
,
Libo Yang
(2010).
Distributed link-aware rate allocation for R-D optimal multiple video streaming over wireless networks
.
2010 International Conference on Wireless Communications Signal Processing (WCSP)
.
Cite
DOI
Pei Wang
,
Li Song
,
Songyu Yu
,
Libo Yang
(2010).
Analysis and comparison of FEC and FEC-ARQ protection schemes based on RS and Raptor code
.
2010 International Conference on Wireless Communications Signal Processing (WCSP)
.
Cite
DOI
Keyi Shen
,
Li Song
,
Xiaokang Yang
,
Wenjun Zhang
(2010).
A Hierarchical Diffusion Algorithm for Community Detection in Social Networks
.
2010 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery
.
Cite
DOI
Jianzhou Feng
,
Li Song
,
Xiaoming Huo
,
Xiaokang Yang
,
Wenjun Zhang
(2010).
Image denoising using local tangent space alignment
.
Visual Communications and Image Processing 2010
.
PDF
Cite
DOI
Li You
,
Song Li
,
Wang Jia
(2010).
Automatic Weak Calibration of Master-Slave Surveillance System Based on Mosaic Image
.
2010 20th International Conference on Pattern Recognition
.
Cite
DOI
Gengjian Xue
,
Jun Sun
,
Li Song
(2010).
Dynamic Background Subtraction Based on Spatial Extended Center-Symmetric Local Binary Pattern
.
2010 IEEE International Conference on Multimedia and Expo
.
Cite
DOI
Gengjian Xue
,
Jun Sun
,
Li Song
(2010).
Dynamic background subtraction based on spatial extended center-symmetric local binary pattern
.
2010 IEEE International Conference on Multimedia and Expo
.
Cite
DOI
Zhengyi Luo
,
Li Song
,
Shibao Zheng
(2010).
Improving H.264/AVC video coding with adaptive coefficient suppression
.
Proceedings of 2010 IEEE International Symposium on Circuits and Systems
.
Cite
DOI
Zhengyi Luo
,
Song Li
,
Yi Xu
,
Xiaokang Yang
(2010).
Improved error concealment of region of interest based on the H.264/AVC standard
.
Optical Engineering
.
PDF
Cite
DOI
李铀
,
宋利
,
王嘉
(2010).
基于图像拼接的双摄像机系统自动标定方法
.
电视技术
.
Cite
刘硕
,
宋利
,
余松煜
(2010).
基于分层判断的 x264 快速模式选择算法
.
上海交通大学学报
.
Cite
周圣鑫
,
周军
,
宋利
,
陈立
(2010).
一种针对小目标的跟踪算法
.
计算机工程
.
Cite
Qiang Zhou
,
Li Song
,
Wenjun Zhang
(2010).
Video coding with key frames guided super-resolution
.
Pacific-Rim Conference on Multimedia
.
Cite
Yi Xu
,
Xiaokang Yang
,
Li Song
,
Leonardo Traversoni
,
Wei Lu
(2010).
QWT: Retrospective and New Applications
.
Geometric Algebra Computing
.
PDF
Cite
DOI
Chen Yao
,
Xiaokang Yang
,
Jia Wang
,
Song Li
,
Guangtao Zhai
(2010).
Patch-Driven Colorization
.
Optical Engineering
.
PDF
Cite
DOI
Gengjian Xue
,
Jun Sun
,
Li Song
(2010).
Background Subtraction Based on Phase and Distance Transform under Sudden Illumination Change
.
Proceedings of the International Conference on Image Processing, ICIP 2010, September 26-29, Hong Kong, China
.
Cite
DOI
Xin Ma
,
Li Song
,
Songyu Yu
(2010).
Adaptive Pixel Interpolation for Spatial Error Concealment
.
Cite
DOI
Zhijun Fang
,
Jiasheng Yuan
,
Li Song
(2009).
Interpolation Method Based Adaptive Directional Lifting Wavelet Transform
.
2009 International Symposium on Computer Network and Multimedia Technology
.
PDF
Cite
DOI
Li Song
,
Yi Xu
,
Cong Xiong
,
Leonardo Traversoni
(2009).
Improved Intra-coding Methods for H.264/AVC
.
EURASIP Journal on Advances in Signal Processing
.
PDF
Cite
DOI
Jianzhou Feng
,
Li Song
,
Xiaokang Yang
,
Wenjun Zhang
(2009).
Sub clustering K-SVD: Size variable dictionary learning for sparse representations
.
2009 16th IEEE International Conference on Image Processing (ICIP)
.
PDF
Cite
DOI
Yongsheng Zhang
,
Hongkai Xiong
,
Li Song
,
Songyu Yu
(2009).
Spatial non-stationary correlation noise modeling for Wyner-Ziv error resilience video coding
.
2009 16th IEEE International Conference on Image Processing (ICIP)
.
PDF
Cite
DOI
Qian Chen
,
Xiaokang Yang
,
Li Song
,
Wenjun Zhang
(2009).
Robust Video Region-of-Interest Coding Based on Leaky Prediction
.
IEEE Transactions on Circuits and Systems for Video Technology
.
PDF
Cite
DOI
Dorian Schneider
,
Marco Jeub
,
Jun Zhou
,
Song Li
(2009).
Advanced H.264/AVC encoder optimizations on a TMS320DM642 digital signal processor
.
2009 16th International Conference on Digital Signal Processing
.
PDF
Cite
DOI
J. Zou
,
H. Xiong
,
L. Song
,
Z. He
,
T. Chen
(2009).
Prioritized Flow Optimization with Generalized Routing for Scalable Multirate Multicasting
.
2009 IEEE International Conference on Communications
.
PDF
Cite
DOI
Zhengyi Luo
,
Li Song
,
Shibao Zheng
(2009).
Offset based leaky prediction for error resilient ROI coding
.
2009 IEEE International Conference on Multimedia and Expo
.
PDF
Cite
DOI
Li Song
,
Xin Ma
(2009).
Improving flexible macroblock ordering of H.264/AVC
.
2009 IEEE International Conference on Multimedia and Expo
.
Cite
DOI
Jun Xu
,
Xiaokang Yang
,
Shibao Zheng
,
Li Song
(2009).
Group-of-pictures-based unequal error protection for scalable video coding extension of H.264/AVC
.
Optical Engineering
.
PDF
Cite
DOI
H. Lv
,
H. Xiong
,
L. Song
,
Z. He
,
T. Chen
(2009).
Graph Matching Based Side Information Generation for Distributed Multi-View Video Coding
.
2009 IEEE International Conference on Communications
.
PDF
Cite
DOI
Zhe Yuan
,
Hongkai Xiong
,
Li Song
,
Yuan F. Zheng
(2009).
Generic video coding with abstraction and detail completion
.
2009 IEEE International Conference on Acoustics, Speech and Signal Processing
.
PDF
Cite
DOI
刘素丽
(2009).
高误码环境下对 TFMCC 性能的改进研究
.
Cite
Zhengyi Luo
,
Li Song
,
Shibao Zheng
(2009).
Unequal Error Protection of Multiple Programs Based on Length-Variable Transport Stream Packets
.
2009 WRI World Congress on Computer Science and Information Engineering
.
PDF
Cite
DOI
Zhou Yu
,
Yi Xu
,
Xiaokang Yang
,
Li Song
(2009).
Structure-Preserving Colorization Based on Quaternionic Phase Reconstruction
.
Advances in Multimedia Information Processing - PCM 2009
.
PDF
Cite
DOI
Hao Wang
,
Yi Xu
,
Xiaokang Yang
,
Li Song
,
Wenjun Zhang
(2009).
Spatiotemporal Phase Congruency Based Invariant Features for Human Behavior Classification
.
Advances in Multimedia Information Processing - PCM 2009
.
PDF
Cite
DOI
王任
,
宋利
,
支琤
(2009).
HD Photo 的码率控制算法研究
.
信息技术
.
Cite
王祥远
,
王兴东
,
宋利
(2009).
DVCPRO HD 并行解码算法的研究与实现
.
信息技术
.
Cite
Xiaojun Ma
,
Yi Xu
,
Li Song
,
Xiaokang Yang
,
Hans Burkhardt
(2008).
Color Image Watermarking Using Local Quaternion Fourier Spectral Analysis
.
2008 IEEE International Conference on Multimedia and Expo
.
PDF
Cite
DOI
Wenrui Dai
,
Hongkai Xiong
,
Li Song
(2008).
On Non-sequential Context Modeling with Application to Executable Data Compression
.
Data Compression Conference (dcc 2008)
.
PDF
Cite
DOI
Wei Lu
,
Yi Xu
,
Xiaokang Yang
,
Li Song
(2008).
Local Quaternionic Gabor Binary Patterns for color face recognition
.
2008 IEEE International Conference on Acoustics, Speech and Signal Processing
.
PDF
Cite
DOI
Haohao Song
,
Songyu Yu
,
Xiaokang Yang
,
Li Song
,
Chen Wang
(2008).
Contourlet-based Image Adaptive Watermarking
.
Signal Processing: Image Communication
.
PDF
Cite
DOI
熊聪
(2008).
视频编码增强技术研究
.
Cite
宋利
,
周源华
,
周军
(2008).
基于运动矢量的视频去抖动算法
.
上海交通大学学报
.
Cite
金崇奎
,
王嘉
,
宋利
(2008).
一种基于 TCPW 的流媒体端到端拥塞控制方法
.
中国图象图形学报
.
Cite
Xiaokang Yang
,
Rui Zhang
,
Yi Xu
,
Anwen Liu
,
Jiemin Liu
,
Zheng Lu
,
Xiaoling Chen
,
Erkang Chen
,
Qing Yan
,
Zhaowen Wang
,
Yanlan Song
,
Xiaojie Sheng
,
Bo Xiao
,
Zhou Yu
,
Zhenfei Chu
,
Hang Su
,
Jun Huang
,
Li Song
(2008).
Shanghai Jiao Tong University Participation in High-Level Feature Extraction, Automatic Search and Surveillance Event Detectionat TRECVID 2008
.
TRECVID 2008 Workshop Participants Notebook Papers, Gaithersburg, MD, USA, November 2008
.
Cite
Xiaokang Yang
,
Rui Zhang
,
Yi Xu
,
Anwen Liu
,
Jiemin Liu
,
Zheng Lu
,
Xiaolin Chen
,
Erkang Chen
,
Qing Yan
,
Zhaowen Wang
,
Yanlan Song
,
Xiaojie Sheng
,
Bo Xiao
,
Zhou Yu
,
Zhenfei Chu
,
Hang Su
,
Jun Huang
,
Li Song
(2008).
Shanghai Jiao Tong University participation in high-level feature extraction, automatic search and surveillance event detection at TRECVID 2008
.
Cite
JIN Chong-Kui
,
WAND Jia
,
Song Li
(2008).
An End-to-end Congestion Control Strategy for Streaming Media Based on TCPW
.
Journal of Image and Graphics
.
Cite
Nannan Ma
,
Hongkai Xiong
,
Li Song
(2008).
2-D Dual Multiresolution Decomposition through NUDFB and Its Application
.
International Workshop on Multimedia Signal Processing, MMSP 2008, October 8-10, 2008, Shangri-la Hotel, Cairns, Queensland, Australia
.
Cite
DOI
Qian Chen
,
Li Song
,
Xiaokang Yang
,
Wenjun Zhang
(2007).
Robust Region-of-Interest Scalable Coding with Leaky Prediction in H.264/AVC
.
2007 IEEE Workshop on Signal Processing Systems
.
PDF
Cite
DOI
Yi Xu
,
Xiaokang Yang
,
Peifeng Zhang
,
Li Song
,
Leonardo Traversoni
(2007).
Cooperative Stereo Matching using Quaternion Wavlets and Top-Down Segmentation
.
Multimedia and Expo, 2007 IEEE International Conference on
.
PDF
Cite
DOI
Jun Xu
,
Li Song
,
Shibao Zheng
,
Xiaokang Yang
,
Rong Xie
(2007).
Bit Allocation for Fine-Granular SNR Scalability Coding with Hierarchical B Pictures
.
Multimedia and Expo, 2007 IEEE International Conference on
.
PDF
Cite
DOI
Haohao Song
(2007).
Fusion of multispectral and panchromatic satellite images based on contourlet transform and local average gradient
.
Optical Engineering
.
PDF
Cite
DOI
马鑫
,
杨小康
,
宋利
(2007).
自适应时域差错掩盖方法
.
中国图象图形学报
.
Cite
马佳
,
支琤
,
宋利
(2007).
联合拥塞控制与 H. 264 容错编码的视频传输
.
中国图象图形学报
.
Cite
周瑾
,
支琤
,
宋利
(2007).
流媒体应用中 TS 和 MP4 格式分析
.
信息技术
.
Cite
宋春霞
,
熊红凯
,
余松煜
,
宋利
(2007).
基于可分级编码基本层的码率控制方法
.
中国图象图形学报
.
Cite
谈永敏
,
杨小康
,
宋利
(2007).
基于可伸缩视频编码 Hierarchical-B 结构的恒定质量控制
.
中国图象图形学报
.
Cite
骆政屹
,
余松煜
,
宋利
,
杨小康
(2007).
H. 264 可分级扩展技术的介绍和分析
.
中国图象图形学报
.
Cite
熊聪
,
余松煜
,
宋利
,
杨小康
(2007).
H. 264 兼容的全景视频编码方法
.
中国图象图形学报
.
Cite
ZHOU Jin
,
ZHI Cheng
,
Song Li
(2007).
Analysis of TS and MP4 formats in the application of streaming media
.
Information Technology
.
Cite
Tao Zhang
,
Shuo Liu
,
Jialin He
,
Hai Zhang
(2007).
A New Algorithm on Short Window MDCT for Dolby Ac3
.
2007 International Symposium on Intelligent Signal Processing and Communication Systems
.
PDF
Cite
DOI
Zheng Lu
,
Yi Xu
,
Xiaokang Yang
,
Li Song
,
Leonardo Traversoni
(2007).
2D Quaternion Fourier Transform: The Spectrum Properties and Its Application in Color Image Registration
.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007, July 2-5, 2007, Beijing, China
.
Cite
DOI
Haohao Song
,
Songyu Yu
,
Chen Wang
,
Li Song
,
Hongkai Xiong
(2006).
A New Deblocking Algorithm Based on Adjusted Contourlet Transform
.
2006 IEEE International Conference on Multimedia and Expo
.
PDF
Cite
DOI
Yang Li
,
Hongkai Xiong
,
Li Song
,
Songyu Yu
(2006).
A Context-Based Error Detection Strategy into H.264/AVC CABAC
.
2006 IEEE International Conference on Multimedia and Expo
.
PDF
Cite
DOI
Jia Ma
,
Hongkai Xiong
,
Li Song
,
Songyu Yu
(2006).
An Unequal Error Protection in ROI for H.264
.
2006 Picture Coding Symposium (PCS)
.
Cite
ZHOU Jin
,
Xiong Hong-Kai
,
Song Li
,
Yu Song-Yu
(2006).
Resynchronization and remultiplexing for transcoding to H.264/AVC
.
Journal of Zhejiang University-SCIENCE A
.
PDF
Cite
DOI
王晨
,
周军
,
宋利
,
熊红凯
(2006).
H. 264 帧内方向预测增强模式
.
上海交通大学学报
.
Cite
童伟
,
支琤
,
宋利
,
熊红凯
(2006).
H. 264 中 CAVLC 解码的高效算法
.
微计算机信息
.
Cite
MA Jia
,
ZHI Cheng
,
Song Li
(2006).
An Integrated Congestion Control and H. 264 Error Resilience Framework for Video Transmission
.
Journal of Image and Graphics
.
Cite
Dongdong Zhang
,
Wenjun Zhang
,
Li Song
,
Hongkai Xiong
(2005).
A Study on Motion Prediction and Coding for In-Band Motion Compensated Temporal Filtering
.
IEEE international conference on computational intelligence and security (CIS 2005)
.
PDF
Cite
DOI
Li Song
,
Yuanhua Zhou
,
Jun Zhou
(2005).
Progressive refinement for robust image registration
.
Chinese Optics Letters
.
PDF
Cite
Haohao Song
,
Songyu Yu
,
Li Song
,
Hongkai Xiong
(2005).
A High-Efficient Significant Coefficient Scanning Algorithm for 3-D Embedded Wavelet Video Coding
.
Visual Communications and Image Processing 2005
.
PDF
Cite
DOI
Song Li
,
H. K. Xiong
,
Feng Wu
,
Hong Chen
(2005).
Adaptive Update Using Visual Models for Lifting-Based Motion-Compensated Temporal Filtering
.
PDF
Cite
DOI
Li Song
,
Hongkai Xiong
,
JiZhen Xu
,
Feng Wu
,
Hui Su
(2005).
Adaptive Predict Based on Fading Compensation for Lifting-Based Motion Compensated Temporal Filtering
.
Proceedings. (ICASSP ‘05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.
.
Cite
DOI
Li Song
,
Yuan-Hua Zhou
,
Jun Zhou
(2005).
Robust video stabilization based on motion vectors
.
Journal of Shanghai University (English Edition)
.
PDF
Cite
DOI
宋利
,
周源华
,
周军
(2005).
基于复小波变换的图像镶嵌
.
上海交通大学学报
.
Cite
肖平平
,
余松煜
,
宋利
,
熊红凯
(2005).
基于交叠正交变换的视频压缩编码分析与实现
.
中国图象图形学报
.
Cite
Haohao Song
,
Songyu Yu
,
Li Song
,
Hongkai Xiong
(2005).
Contourlet Image Coding Based on Adjusted SPIHT
.
Advances in Multimedia Information Processing - PCM 2005
.
PDF
Cite
DOI
L. Song
,
J. Xu
,
H. Xiong
,
F. Wu
(2005).
Content Adaptive Update for Lifting-Based Motion-Compensated Temporal Filtering
.
Electronics Letters
.
PDF
Cite
DOI
Li Song
,
Jizheng Xu
,
Hongkai Xiong
,
Feng Wu
(2004).
Content adaptive update steps for improved visual quality in lifting-based motion compensated temporal filtering
.
MPEG-m11129
.
Cite
宋利
,
周源华
,
周军
(2004).
基于特征匹配的鲁棒图像镶嵌
.
上海交通大学学报
.
Cite
Li Song
,
JiZhen Xu
,
Hongkai Xiong
,
Feng Wu
(2004).
CONTENT ADAPTIVE UPDATE STEPS FOR LIFTING-BASED MOTION COMPENSATED TEMPORAL FILTERING
.
Proc. of the Picture Coding Symposium
.
Cite
宋利
,
周源华
,
周军
(2003).
一种全景图浏览器的 JAVA 实现算法
.
计算机应用与软件
.
Cite
Lianji Cheng
,
Li Song
,
Songyu Yu
(2003).
Unequal packet loss protected transmission for FGS video
.
Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint
.
PDF
Cite
DOI
周军
,
徐奕
,
宋利
(2002).
基于多分辨小波变换的相位匹配
.
科学技术与工程
.
Cite
宋利
,
周源华
,
Others
(2002).
基于 Fourier 合成技术的水波动画
.
计算机工程与应用
.
Cite
何瑛
,
宋利
,
张伟
,
Others
(2000).
基于 LabVIEW 的数采卡 (DAQ) 驱动程序设计
.
电测与仪表
.
Cite
张伟
,
李永新
(1999).
基于分布式多机系统的准动态压力标定系统
.
微计算机信息
.
Cite
顾玉辉
,
宋利
,
朱明武
(1999).
IVI 模型在虚拟仪器驱动程序开发中的应用
.
电子技术应用
.
Cite
Cite
×