The Community for Technology Leaders
2016 IEEE International Conference on Multimedia and Expo (ICME) (2016)
Seattle, WA, USA
July 11, 2016 to July 15, 2016
ISSN: 1945-788X
ISBN: 978-1-4673-7259-6
TABLE OF CONTENTS

Author index (PDF)

pp. 1-98

Title page (PDF)

pp. 1

Depth augmented stereo panorama for cinematic virtual reality with head-motion parallax (Abstract)

Jayant Thatte , Department of Electrical Engineering, Stanford University
Jean-Baptiste Boin , Department of Electrical Engineering, Stanford University
Haricharan Lakshman , Department of Electrical Engineering, Stanford University
Bernd Girod , Department of Electrical Engineering, Stanford University
pp. 1-6

Quality-of-experience prediction for streaming video (Abstract)

Zhengfang Duanmu , Dept. of Electrical & Computer Engineering, University of Waterloo, Waterloo, ON, Canada
Abdul Rehman , Dept. of Electrical & Computer Engineering, University of Waterloo, Waterloo, ON, Canada
Kai Zeng , Dept. of Electrical & Computer Engineering, University of Waterloo, Waterloo, ON, Canada
Zhou Wang , Dept. of Electrical & Computer Engineering, University of Waterloo, Waterloo, ON, Canada
pp. 1-6

Region similarity arrangement for image retrieval (Abstract)

Jingya Tang , Institute of Computing Technology, Chinese Academy of Sciences
Dongming Zhang , Institute of Computing Technology, Chinese Academy of Sciences
Yongdong Zhang , Institute of Computing Technology, Chinese Academy of Sciences
Qi Tian , Department of Computer Science, University of Texas at San Antonio
pp. 1-6

Speed-adaptive street view image generation using driving video recorder (Abstract)

Hua-Tsung Chen , Information & Communications Technology Lab, National Chiao Tung University, Hsinchu, Taiwan
Devi Eddy , Department of Computer Science, National Chiao Tung University, Hsinchu, Taiwan
Ruei-Lin Chen , Department of Computer Science, National Chiao Tung University, Hsinchu, Taiwan
Chien-Li Chou , Department of Computer Science, National Chiao Tung University, Hsinchu, Taiwan
pp. 1-6

Robust feature encoding for age-invariant face recognition (Abstract)

Xiaonan Hou , Department of Computer Science and Engineering, Shanghai Jiao Tong University, China
Shouhong Ding , Department of Computer Science and Engineering, Shanghai Jiao Tong University, China
Lizhuang Ma , Department of Computer Science and Engineering, Shanghai Jiao Tong University, China
pp. 1-6

Subjective-quality-optimized complexity control for HEVC decoding (Abstract)

Ren Yang , The School of Electronic and Information Engineering, Beihang University, China
Mai Xu , The School of Electronic and Information Engineering, Beihang University, China
Lai Jiang , The School of Electronic and Information Engineering, Beihang University, China
Zulin Wang , The School of Electronic and Information Engineering, Beihang University, China
pp. 1-6

A robust automatic object segmentation method for 3D printing (Abstract)

Tzu-Kuei Huang , National Taiwan University
Ying-Hsuang Wang , National Taiwan University
Ta-Kai Lin , National Taiwan University
Yung-Yu Chuang , National Taiwan University
pp. 1-6

Weakly supervised image parsing by discriminatively semantic graph propagation (Abstract)

Xiaocheng Xu , School of Computer Science and Technology, Shandong University
Jun Ma , School of Computer Science and Technology, Shandong University
pp. 1-6

Robust image matching via feature guided Gaussian mixture model (Abstract)

Jiayi Ma , Electronic Information School, Wuhan University, Wuhan 430072, China
Junjun Jiang , School of Computer Science, China University of Geosciences, Wuhan 430074, China
Yuan Gao , Department of Electronic Engineering, City University of Hong Kong, Hong Kong
Jun Chen , School of Automation, China University of Geosciences, Wuhan 430074, China
Chengyin Liu , Electronic Information School, Wuhan University, Wuhan 430072, China
pp. 1-6

A soundtrack generation system to synchronize the climax of a video clip with music (Abstract)

Haruki Sato , Waseda University
Tatsunori Hirai , Waseda University
Tomoyasu Nakano , National Institute of Advanced Industrial Science and Technology (AIST)
Masataka Goto , National Institute of Advanced Industrial Science and Technology (AIST)
Shigeo Morishima , Waseda Research Institute for Science and Engineering
pp. 1-6

Person re-identification via rich color-gradient feature (Abstract)

Lingxiang Wu , Global Big Data Technologies Centre, University of Technology Sydney, Australia
Jinqiao Wang , National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China, 100190
Guibo Zhu , National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China, 100190
Min Xu , Global Big Data Technologies Centre, University of Technology Sydney, Australia
Hanqing Lu , National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China, 100190
pp. 1-6

Study on the field test result of mobile MMT trial service over LTE network at open dense area, subway and high speed train (Abstract)

Hyunmin Jang , Network O&M Division, SK Telecom, Korea (Republic of)
Jongmin Lee , Network Technology R&D Center, SK Telecom, Korea (Republic of)
Hyeonmin Choi , Network O&M Division, SK Telecom, Korea (Republic of)
Sungmin Cho , Network Technology R&D Center, SK Telecom, Korea (Republic of)
pp. 1-3

A pipeline-based runtime technique for improving Ray-Tracing on HSA-compliant systems (Abstract)

Chih-Chen Kao , National Taiwan University, Department of Computer Science & Information Engineering
Yu-Tsung Miao , National Taiwan University, Department of Computer Science & Information Engineering
Wei-Chung Hsu , National Taiwan University, Department of Computer Science & Information Engineering
pp. 1-6

Factors affecting user preference for mobile video quality (Abstract)

Andreea Molnar , University of Portsmouth, School of Creative Technologies, Winston Churchill Ave, Portsmouth, United Kingdom
pp. 1-6

Learning-based quality assessment of retargeted stereoscopic images (Abstract)

Yi Liu , Department of Computer Science and Technology, Tsinghua University, Tsinghua NLIST. Beijing, China
Lifeng Sun , Department of Computer Science and Technology, Tsinghua University, Tsinghua NLIST. Beijing, China
Shiqiang Yang , Department of Computer Science and Technology, Tsinghua University, Tsinghua NLIST. Beijing, China
pp. 1-6

BLeSS: Bio-inspired low-level spatiochromatic similarity assisted image quality assessment (Abstract)

Dogancan Temel , Center for Signal and Information Processing (CSIP) School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, 30332-0250 USA
Ghassan AlRegib , Center for Signal and Information Processing (CSIP) School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, 30332-0250 USA
pp. 1-6

Adaptive background for real-time visual tracking (Abstract)

He Li , School of Remote Sensing and Information Engineering, Wuhan University, China
Daiqin Yang , School of Remote Sensing and Information Engineering, Wuhan University, China
Zhenzhong Chen , School of Remote Sensing and Information Engineering, Wuhan University, China
pp. 1-6

Codec independent region of interest video coding using a joint pre- and postprocessing framework (Abstract)

Holger Meuel , Institut für Informationsverarbeitung, Leibniz Universität Hannover, Germany
Marco Munderloh , Institut für Informationsverarbeitung, Leibniz Universität Hannover, Germany
Florian Kluger , Institut für Informationsverarbeitung, Leibniz Universität Hannover, Germany
Jorn Ostermann , Institut für Informationsverarbeitung, Leibniz Universität Hannover, Germany
pp. 1-6

Robust low rank dynamic mode decomposition for compressed domain crowd and traffic flow analysis (Abstract)

Caglayan Dicle , Northeastern University, Boston, MA
Hassan Mansour , Mitsubishi Electric Research Laboratories, Cambridge, MA
Dong Tian , Mitsubishi Electric Research Laboratories, Cambridge, MA
Mouhacine Benosman , Mitsubishi Electric Research Laboratories, Cambridge, MA
Anthony Vetro , Mitsubishi Electric Research Laboratories, Cambridge, MA
pp. 1-6

Recognizing heterogeneous cross-domain data via generalized joint distribution adaptation (Abstract)

Yuan-Ting Hsieh , Department of Electrical Engineering, National Taiwan University, Taipei, Taiwan
Shi-Yen Tao , Department of Electrical Engineering, National Taiwan University, Taipei, Taiwan
Yao-Hung Hubert Tsai , Research Center for IT Innovation, Academia Sinica, Taipei, Taiwan
Yi-Ren Yeh , Department of Mathematics, National Kaohsiung Normal University, Kaohsiung, Taiwan
Yu-Chiang Frank Wang , Research Center for IT Innovation, Academia Sinica, Taipei, Taiwan
pp. 1-6

Parameterized reconstruction based Fourier Ptychography (Abstract)

Weixin Jiang , Shenzhen Key Lab of Broadband Network and Multimedia, Graduate School at Shenzhen, Tsinghua University, Shenzhen, 518055, China
Yongbing Zhang , Shenzhen Key Lab of Broadband Network and Multimedia, Graduate School at Shenzhen, Tsinghua University, Shenzhen, 518055, China
Qionghai Dai , Shenzhen Key Lab of Broadband Network and Multimedia, Graduate School at Shenzhen, Tsinghua University, Shenzhen, 518055, China
pp. 1-6

Structure-regularized compressive tracking (Abstract)

Qing Guo , School of Computer Science and Technology, Tianjin University, Tianjin, China
Wei Feng , School of Computer Science and Technology, Tianjin University, Tianjin, China
Ce Zhou , School of Computer Science and Technology, Tianjin University, Tianjin, China
Bin Wu , School of Computer Science and Technology, Tianjin University, Tianjin, China
pp. 1-6

A novel obstacle detection method based on distortion of laser pattern (Abstract)

Zichao Guo , Key Laboratory of Intelligent Information Processing & Beijing Key Laboratory of Mobile Computing and Pervasive Device, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190
Hong Liu , Key Laboratory of Intelligent Information Processing & Beijing Key Laboratory of Mobile Computing and Pervasive Device, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190
Yueliang Qian , Key Laboratory of Intelligent Information Processing & Beijing Key Laboratory of Mobile Computing and Pervasive Device, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190
Xiangdong Wang , Key Laboratory of Intelligent Information Processing & Beijing Key Laboratory of Mobile Computing and Pervasive Device, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190
pp. 1-6

Learning a pose lexicon for semantic action recognition (Abstract)

Lijuan Zhou , School of Computing and Information Technology, University of Wollongong, Keiraville, NSW 2522, Australia
Wanqing Li , School of Computing and Information Technology, University of Wollongong, Keiraville, NSW 2522, Australia
Philip Ogunbona , School of Computing and Information Technology, University of Wollongong, Keiraville, NSW 2522, Australia
pp. 1-6

Describing images by feeding LSTM with structural words (Abstract)

Shubo Ma , School of Computer Software, Tianjin University, Tianjin, China
Yahong Han , School of Computer Science and Technology, Tianjin University, Tianjin, China
pp. 1-6

Cross-modal hashing through ranking subspace learning (Abstract)

Kai Li , Department of Computer Science, University of Central Florida, USA
Guojun Qi , Department of Computer Science, University of Central Florida, USA
Jun Ye , Department of Computer Science, University of Central Florida, USA
Kien A. Hua , Department of Computer Science, University of Central Florida, USA
pp. 1-6

Distance learning by treating negative samples differently and exploiting impostors with symmetric triplet constraint for person re-identification (Abstract)

Xiaoke Zhu , State Key Laboratory of Software Engineering, School of Computer, Wuhan University
Xiao-Yuan Jing , State Key Laboratory of Software Engineering, School of Computer, Wuhan University
Fei Wu , State Key Laboratory of Software Engineering, School of Computer, Wuhan University
Weishi Zheng , School of Data and Computer Science, Sun Yat-sen University
Ruimin Hu , State Key Laboratory of Software Engineering, School of Computer, Wuhan University
Chunxia Xiao , State Key Laboratory of Software Engineering, School of Computer, Wuhan University
Chao Liang , National Engineering Research Center for Multimedia Software, Computer School of Wuhan Univ.
pp. 1-6

Recognizing human actions from low-resolution videos by region-based mixture models (Abstract)

Ying Zhao , Beijing Laboratory of Intelligent Information Technology, Beijing Institute of Technology, Beijing, China
Huijun Di , Beijing Laboratory of Intelligent Information Technology, Beijing Institute of Technology, Beijing, China
Jian Zhang , Advanced Analytics Institute, University of Technology, Sydney, Australia
Yao Lu , Beijing Laboratory of Intelligent Information Technology, Beijing Institute of Technology, Beijing, China
Feng Lv , Beijing Laboratory of Intelligent Information Technology, Beijing Institute of Technology, Beijing, China
pp. 1-6

Adaptive affinity matrix for unsupervised metric learning (Abstract)

Yaoyi Li , Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Department of Computer Science and Engineering, Shanghai Jiao Tong University, P.R. China
Junxuan Chen , Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Department of Computer Science and Engineering, Shanghai Jiao Tong University, P.R. China
Yiru Zhao , Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Department of Computer Science and Engineering, Shanghai Jiao Tong University, P.R. China
Hongtao Lu , Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Department of Computer Science and Engineering, Shanghai Jiao Tong University, P.R. China
pp. 1-6

Efficient structure-preserving superpixel segmentation based on minimum spanning tree (Abstract)

Yu Bai , CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System, University of Science and Technology of China, Hefei Anhui, China
Xuejin Chen , CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System, University of Science and Technology of China, Hefei Anhui, China
pp. 1-6

Robust face image alignment using structural priors (Abstract)

Xiaojie Guo , State Key Laboratory Of Information Security, IIE, CAS
Dongdai Lin , State Key Laboratory Of Information Security, IIE, CAS
pp. 1-6

Inferring users' emotions for human-mobile voice dialogue applications (Abstract)

Boya Wu , Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
Jia Jia , Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
Tao He , College of Computer Science, Sichuan University, Chengdu 610065, China
Juan Du , Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
Xiaoyuan Yi , Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
Yishuang Ning , Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
pp. 1-6

Video object segmentation aggregation (Abstract)

Tianfei Zhou , Beijing Laboratory of Intelligent Information Technology
Yao Lu , Beijing Laboratory of Intelligent Information Technology
Huijun Di , Beijing Laboratory of Intelligent Information Technology
Jian Zhang , Faculty of Engineering and Information Technology, University of Technology Sydney
pp. 1-6

Digital forensics for printed character source identification (Abstract)

Min-Jen Tsai , Institute of Information Management, National Chiao Tung University, R.O.C.
Chien-Lun Hsu , Institute of Information Management, National Chiao Tung University, R.O.C.
Jin-Sheng Yin , Institute of Information Management, National Chiao Tung University, R.O.C.
Imam Yuadi , Institute of Information Management, National Chiao Tung University, R.O.C.
pp. 1-6

Visual data deblocking using structural layer priors (Abstract)

Siyuan Li , School of Computer Software, Tianjin University
Jiawan Zhang , School of Computer Software, Tianjin University
Xiaojie Guo , State Key Laboratory Of Information Security, IIE, CAS
pp. 1-6

A perceptually motivated approach via sparse and low-rank model for speech enhancement (Abstract)

Gang Min , Lab of Intelligent Information Processing, PLA University of Science and Technology, Nanjing, China
Xiongwei Zhang , Lab of Intelligent Information Processing, PLA University of Science and Technology, Nanjing, China
Jibin Yang , Lab of Intelligent Information Processing, PLA University of Science and Technology, Nanjing, China
Wei Han , Lab of Intelligent Information Processing, PLA University of Science and Technology, Nanjing, China
Xia Zou , Lab of Intelligent Information Processing, PLA University of Science and Technology, Nanjing, China
pp. 1-6

Learning kinematic model of targets in videos from fixed cameras (Abstract)

Xi En Cheng , School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, Shanghai, China
Shuo Hong Wang , School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, Shanghai, China
Yan Qiu Chen , School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, Shanghai, China
pp. 1-6

Unsupervised visual domain adaptation via dictionary evolution (Abstract)

Songsong Wu , School of Automation, Nanjing University of Posts and Telecommunications, China
Xiao-Yuan Jing , School of computer science, Wuhan University, China
Dong Yue , School of Automation, Nanjing University of Posts and Telecommunications, China
Jian Zhang , School of Computing and Communications, University of Technology Sydney, Australia
K Jian Yang , School of Computer Science and Engineering, Nanjing University of Science and Technology, China
Jingyu Yang , School of Computer Science and Engineering, Nanjing University of Science and Technology, China
pp. 1-6

Online video tracking using collaborative convolutional networks (Abstract)

Hao Guan , Shanghai Key Laboratory of Intelligent Information Processing, School of Computer Science, Fudan University, Shanghai, China
Xiangyang Xue , Shanghai Key Laboratory of Intelligent Information Processing, School of Computer Science, Fudan University, Shanghai, China
An Zhiyong , Shanghai Key Laboratory of Intelligent Information Processing, School of Computer Science, Fudan University, Shanghai, China
pp. 1-6

Spherical superpixel segmentation (Abstract)

Qiang Zhao , School of Computer Software, Tianiin University, Tianiin, China
Liang Wan , School of Computer Software, Tianiin University, Tianiin, China
Jiawan Zhang , School of Computer Software, Tianiin University, Tianiin, China
pp. 1-6

Multi-label active learning for image classification with asymmetrical conditional dependence (Abstract)

Jian Wu , The Institute of Intelligent Information Processing and Application, Soochow University, Suzhou 215006, China
Shiquan Zhao , The Institute of Intelligent Information Processing and Application, Soochow University, Suzhou 215006, China
Victor S. Sheng , Department of Computer Science, University of Central Arkansas, Conway 72035, USA
Pengpeng Zhao , The Institute of Intelligent Information Processing and Application, Soochow University, Suzhou 215006, China
Zhiming Cui , The Institute of Intelligent Information Processing and Application, Soochow University, Suzhou 215006, China
pp. 1-6

Example-based visual object counting with a sparsity constraint (Abstract)

Yi Wang , ADSPLAB/ELIP, School of ECE, Peking University, Shenzhen, 518055, China
Y. X. Zou , ADSPLAB/ELIP, School of ECE, Peking University, Shenzhen, 518055, China
Jin Chen , ADSPLAB/ELIP, School of ECE, Peking University, Shenzhen, 518055, China
Xiaolin Huang , ADSPLAB/ELIP, School of ECE, Peking University, Shenzhen, 518055, China
Cheng Cai , Department of Computer Science, College of Information Engineering, Northwest A&F University, Yangling, 712100, China
pp. 1-6

Depth-aware layered edge for object proposal (Abstract)

Jing Liu , State Key Laboratory for Novel Software Technology, Nanjing University, China
Tongwei Ren , State Key Laboratory for Novel Software Technology, Nanjing University, China
Bing-Kun Bao , State Key Laboratory for Novel Software Technology, Nanjing University, China
Jia Bei , State Key Laboratory for Novel Software Technology, Nanjing University, China
pp. 1-6

One-shot deep neural network for pose and illumination normalization face recognition (Abstract)

Zhongjun Wu , Beijing University of Posts and Telecommunications, No. 10, Xitu Cheng Road, Haidian District, Beijing, China, 100876
Weihong Deng , Beijing University of Posts and Telecommunications, No. 10, Xitu Cheng Road, Haidian District, Beijing, China, 100876
pp. 1-6

Boosted local classifiers for visual tracking (Abstract)

Weijian Ruan , State Key Laboratory of Software Engineering, Wuhan University, China
Jun Chen , State Key Laboratory of Software Engineering, Wuhan University, China
Jinqiao Wang , National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, China
Bo Luo , State Key Laboratory of Software Engineering, Wuhan University, China
Wenjun Huang , State Key Laboratory of Software Engineering, Wuhan University, China
Ruimin Hu , State Key Laboratory of Software Engineering, Wuhan University, China
pp. 1-6

High-order directional features and sparse representation based classification for in-air handwritten Chinese character recognition (Abstract)

Xiwen Qu , School of Computer and Control Engineering, University of Chinese Academy of Sciences, Beijing, China
Weiqiang Wang , School of Computer and Control Engineering, University of Chinese Academy of Sciences, Beijing, China
Ke Lu , School of Computer and Control Engineering, University of Chinese Academy of Sciences, Beijing, China
Zhangjian Ji , School of Computer & Information Technology, Shanxi University, Taiyuan, China
pp. 1-6

Cost-sensitive sparse linear regression for crowd counting with imbalanced training data (Abstract)

Xiaolin Huang , ADSPLAB/ELIP, School of ECE, Peking University, Shenzhen, 518055, China
Yuexian Zou , ADSPLAB/ELIP, School of ECE, Peking University, Shenzhen, 518055, China
Yi Wang , ADSPLAB/ELIP, School of ECE, Peking University, Shenzhen, 518055, China
pp. 1-6

Attribute-based multi-dimension scalable access control for social media sharing (Abstract)

Changsha Ma , Dept. of Comp. Sci. and Eng., State Univ. of New York at Buffalo, Buffalo, NY, 14260, USA
Zhisheng Yan , Dept. of Comp. Sci. and Eng., State Univ. of New York at Buffalo, Buffalo, NY, 14260, USA
Chang Wen Chen , Dept. of Comp. Sci. and Eng., State Univ. of New York at Buffalo, Buffalo, NY, 14260, USA
pp. 1-6

Salient object detection for RGB-D image via saliency evolution (Abstract)

Jingfan Quo , State Key Laboratory for Novel Software Technology, Nanjing University, China
Tongwei Ren , State Key Laboratory for Novel Software Technology, Nanjing University, China
Jia Bei , State Key Laboratory for Novel Software Technology, Nanjing University, China
pp. 1-6

Local- and holistic-structure preserving image super resolution via deep joint component learning (Abstract)

Yukai Shi , Sun Yat-Sen University, Guangzhou, China
Keze Wang , Sun Yat-Sen University, Guangzhou, China
Li Xu , SenseTime Group Limited
Liang Lin , Sun Yat-Sen University, Guangzhou, China
pp. 1-6

A Bayesian hierarchical appearance model for robust object tracking (Abstract)

Raed Almomani , Wayne State University, Computer Science Department, Detroit, MI 48202
Ming Dong , Wayne State University, Computer Science Department, Detroit, MI 48202
Dongxiao Zhu , Wayne State University, Computer Science Department, Detroit, MI 48202
pp. 1-6

Learning deep classifiers with deep features (Abstract)

Jie Lei , Zhejiang University, Hangzhou, 310027, P.R. China
Xinhui Song , Zhejiang University, Hangzhou, 310027, P.R. China
Li Sun , Zhejiang University, Hangzhou, 310027, P.R. China
Mingli Song , Zhejiang University, Hangzhou, 310027, P.R. China
Na Li , Zhejiang International Studies University, Hangzhou, 310027, P.R. China
Chun Chen , Zhejiang University, Hangzhou, 310027, P.R. China
pp. 1-6

POM: Power efficient multi-view video streaming over multi-antenna wireless systems (Abstract)

Zhe Chen , School of Computer Science, Fudan University, Shanghai, China
Xu Zhang , School of Computer Science, Fudan University, Shanghai, China
Yuedong Xu , School of Information Science and Technology, Fudan University, Shanghai, China
Xin Wang , School of Computer Science, Fudan University, Shanghai, China
pp. 1-6

Using business-aware latent topics for image captioning in social media (Abstract)

Yan-Ting Chen , FX Palo Alto Laboratory, Inc., Palo Alto, California, USA
Francine Chen , FX Palo Alto Laboratory, Inc., Palo Alto, California, USA
Matthew Cooper , FX Palo Alto Laboratory, Inc., Palo Alto, California, USA
Dhiraj Joshi , FX Palo Alto Laboratory, Inc., Palo Alto, California, USA
pp. 1-6

Understanding spatial correlation in eye-fixation maps for visual attention in videos (Abstract)

Tariq Alshawi , Center for Signal and Information Processing (CSIP) School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA 30332-0250, USA
Zhiling Long , Center for Signal and Information Processing (CSIP) School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA 30332-0250, USA
Ghassan AlRegib , Center for Signal and Information Processing (CSIP) School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA 30332-0250, USA
pp. 1-6

Robust online face tracking-by-detection (Abstract)

Francesco Comaschi , Eindhoven University of Technology, The Netherlands
Sander Stuijk , Eindhoven University of Technology, The Netherlands
Twan Basten , Eindhoven University of Technology, The Netherlands
Henk Corporaal , Eindhoven University of Technology, The Netherlands
pp. 1-6

IBC127: Video dataset for fine-grained bird classification (Abstract)

Tomoaki Saito , Graduate School of Information Science and Technology, The University of Tokyo
Asako Kanezaki , Graduate School of Information Science and Technology, The University of Tokyo
Tatsuya Harada , Graduate School of Information Science and Technology, The University of Tokyo
pp. 1-6

Geometry-aware metric learning for similar face recognition (Abstract)

Nanhai Zhang , Beijing University of Posts and Telecommunications, Beijing, China
Jiajie Han , Beijing University of Posts and Telecommunications, Beijing, China
Jiani Hu , Beijing University of Posts and Telecommunications, Beijing, China
Weihong Deng , Beijing University of Posts and Telecommunications, Beijing, China
pp. 1-6

Phonetic posteriorgrams for many-to-one voice conversion without parallel data training (Abstract)

Lifa Sun , Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Hong Kong SAR, China
Kun Li , Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Hong Kong SAR, China
Hao Wang , Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Hong Kong SAR, China
Shiyin Kang , Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Hong Kong SAR, China
Helen Meng , Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Hong Kong SAR, China
pp. 1-6

BCA: Bi-symmetric component analysis for temporal symmetry in human actions (Abstract)

Chenyang Zhang , The City College of New York
Yingli Tian , The City College of New York
pp. 1-6

Robust latent poisson deconvolution from multiple imperfect features for web topic detection (Abstract)

Fei Tao , School of Computer and Control Engineering, University of Chinese Academy of Sciences, Beijing, China
Junbiao Pang , Beijing Key Laboratory of Multimedia and Intelligent Software Technology, College of Metropolitan Transportation, Beijing University of Technology, China
Chunjie Zhang , School of Computer and Control Engineering, University of Chinese Academy of Sciences, Beijing, China
Liang Li , School of Computer and Control Engineering, University of Chinese Academy of Sciences, Beijing, China
Li Su , School of Computer and Control Engineering, University of Chinese Academy of Sciences, Beijing, China
Weigang Zhang , School of Computer Science and Technology, Harbin Institute of Technology, China
Qingming Huang , School of Computer and Control Engineering, University of Chinese Academy of Sciences, Beijing, China
Guiping Su , School of Computer and Control Engineering, University of Chinese Academy of Sciences, Beijing, China
pp. 1-6

Memory-based object detection in surveillance scenes (Abstract)

Xudong Li , School of Computer Science and Engineering, Center for Robotics, Key Laboratory for NeuroInformation of Ministry of Education, University of Electronic Science and Technology of China, Chengdu 611731, P.R. China
Mao Ye , School of Computer Science and Engineering, Center for Robotics, Key Laboratory for NeuroInformation of Ministry of Education, University of Electronic Science and Technology of China, Chengdu 611731, P.R. China
Dan Liu , School of Computer Science and Engineering, Center for Robotics, Key Laboratory for NeuroInformation of Ministry of Education, University of Electronic Science and Technology of China, Chengdu 611731, P.R. China
Feng Zhang , School of Computer Science and Engineering, Center for Robotics, Key Laboratory for NeuroInformation of Ministry of Education, University of Electronic Science and Technology of China, Chengdu 611731, P.R. China
Song Tang , School of Computer Science and Engineering, Center for Robotics, Key Laboratory for NeuroInformation of Ministry of Education, University of Electronic Science and Technology of China, Chengdu 611731, P.R. China
pp. 1-6

Explicit modeling on depth-color inconsistency for color-guided depth up-sampling (Abstract)

Y. Zuo , School of Communication and Information Engineering, Shanghai University, China, 200072
Q. Wu , Global Big Data Technologies Centre, University of Technology, Sydney, Australia, NSW 2007
J. Zhang , Global Big Data Technologies Centre, University of Technology, Sydney, Australia, NSW 2007
P. An , School of Communication and Information Engineering, Shanghai University, China, 200072
pp. 1-6

Sparse two-dimensional singular value decomposition (Abstract)

Junhui Hou , School of Electrical and Electronics Engineering, Nanyang Technological University, Singapore 639798
Jie Chen , School of Electrical and Electronics Engineering, Nanyang Technological University, Singapore 639798
Lap-Pui Chau , School of Electrical and Electronics Engineering, Nanyang Technological University, Singapore 639798
Ying He , School of Computer Engineering, Nanyang Technological University, Singapore 639798
pp. 1-6

On-premise signs detection and recognition using fully convolutional networks (Abstract)

Yong-Xiang Wang , National Cheng Kung University
Chih-Hsin Hsueh , National Cheng Kung University
Hung-Yi Loo , Foxconn Technology Co.
Min-Chun Hu , National Cheng Kung University
pp. 1-6

Deep conditional neural network for image segmentation (Abstract)

Qiurui Wang , Department of Computer Science, Tsinghua University
Chun Yuan , Department of Computer Science, Tsinghua University
Yan Liu , Department of Computing, The Hong Kong Polytechnic University
pp. 1-6

User-oriented stereo video refocusing by computational cinematographic model (Abstract)

Wenjing Geng , State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210023, China
Dapeng Du , State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210023, China
Tongwei Ren , State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210023, China
Gangshan Wu , State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210023, China
pp. 1-6

Example-based video color transfer (Abstract)

Chun-Han Yao , Media IC and System Lab, Graduate Institute of Electronics Engineering and Dept. of Electrical Engineering, National Taiwan University
Chia-Yang Chang , Media IC and System Lab, Graduate Institute of Electronics Engineering and Dept. of Electrical Engineering, National Taiwan University
Shao-Yi Chien , Media IC and System Lab, Graduate Institute of Electronics Engineering and Dept. of Electrical Engineering, National Taiwan University
pp. 1-6

Deep learning based supervised hashing for efficient image retrieval (Abstract)

Viet-Anh Nguyen , Advanced Digital Sciences Center (ADSC), Singapore
Minh N. Do , University of Illinois at Urbana-Champaign, IL, USA
pp. 1-6

Shape-optimizing hybrid warping for image stitching (Abstract)

Qingpeng Chai , School of Computer Science and Technology, Tianjin University, P.R. China
Shiguang Liu , School of Computer Science and Technology, Tianjin University, P.R. China
pp. 1-6

Video saliency prediction with optimized optical flow and gravity center bias (Abstract)

Zhe Wu , Key Lab on Big Data Mining and Knowledge Management, University of Chinese Academy of Sciences, Beijing, China
Li Su , Key Lab on Big Data Mining and Knowledge Management, University of Chinese Academy of Sciences, Beijing, China
Qingming Huang , Key Lab on Big Data Mining and Knowledge Management, University of Chinese Academy of Sciences, Beijing, China
Bo Wu , Capital Medical University, Beijing, China
Jian Li , Beijing University of Posts and Telecommunications, Beijing, China
Guorong Li , Key Lab on Big Data Mining and Knowledge Management, University of Chinese Academy of Sciences, Beijing, China
pp. 1-6

Crowd video retrieval via deep attribute-embedding graph ranking (Abstract)

Yanhao Zhang , School of Computer Science, Harbin Institute of Technology, Harbin, 150001, China
Lei Qin , Inst. of Comput. Tech., Chinese Academy of Sciences, Beijing, 100190, China
Sicheng Zhao , School of Computer Science, Harbin Institute of Technology, Harbin, 150001, China
Rongrong Ji , School of Information Science and Engineering, Xiamen University, Xiamen, 361005, China
Xiusheng Lu , School of Computer Science, Harbin Institute of Technology, Harbin, 150001, China
Hongxun Yao , School of Computer Science, Harbin Institute of Technology, Harbin, 150001, China
Qingming Huang , School of Computer Science, Harbin Institute of Technology, Harbin, 150001, China
pp. 1-6

3D video super-resolution using fully convolutional neural networks (Abstract)

Yanchun Xie , Xi'an Jiaotong - Liverpool University
Jimin Xiao , Xi'an Jiaotong - Liverpool University
Tammam Tillo , Xi'an Jiaotong - Liverpool University
Yunchao Wei , Beijing Jiaotong University
Yao Zhao , Beijing Jiaotong University
pp. 1-6

Bayesian relevance feedback based Chinese calligraphy character synthesis (Abstract)

Xueying Du , Zhejiang University, College of Computer Science, Hangzhou, China
Jiangqin Wu , Zhejiang University, College of Computer Science, Hangzhou, China
Yang Xia , Zhejiang University, College of Computer Science, Hangzhou, China
pp. 1-6

A pair hidden Markov support vector machine for alignment of human actions (Abstract)

Zhen Wang , Global Big Data Technologies Centre, University of Technology Sydney, Australia
Massimo Piccardi , Global Big Data Technologies Centre, University of Technology Sydney, Australia
pp. 1-6

With one look: 3D face shape estimation from a single snapshot (Abstract)

Chia-Po Wei , Research Center for Information Technology Innovation, Academia Sinica, Taipei, Taiwan
Yu-Chiang Frank Wang , Research Center for Information Technology Innovation, Academia Sinica, Taipei, Taiwan
pp. 1-6

Graph-based web video search reranking through consistency analysis using spectral clustering (Abstract)

Soh Yoshida , Graduate School of Information Science and Technology, Hokkaido University, N-14, W-9, Kita-ku, Sapporo, 060-0814, Japan
Takahiro Ogawa , Graduate School of Information Science and Technology, Hokkaido University, N-14, W-9, Kita-ku, Sapporo, 060-0814, Japan
Miki Haseyama , Graduate School of Information Science and Technology, Hokkaido University, N-14, W-9, Kita-ku, Sapporo, 060-0814, Japan
pp. 1-6

Distortion recognition for image quality assessment with convolutional neural network (Abstract)

Hanli Wang , Department of Computer Science and Technology, Tongji University, Shanghai, P.R. China
Lingxuan Zuo , Department of Computer Science and Technology, Tongji University, Shanghai, P.R. China
Jie Fu , Department of Computer Science and Technology, Tongji University, Shanghai, P.R. China
pp. 1-6

Egocentric activity recognition by leveraging multiple mid-level representations (Abstract)

Peng-Ju Hsieh , National Taiwan University, Taipei, Taiwan
Yen-Liang Lin , National Taiwan University, Taipei, Taiwan
Yu-Hsiu Chen , National Taiwan University, Taipei, Taiwan
Winston Hsu , National Taiwan University, Taipei, Taiwan
pp. 1-6

Human action recognition-based video summarization for RGB-D personal sports video (Abstract)

Antonio Tejero-de-Pablos , Nara Institute of Science and Technology, Japan
Yuta Nakashima , Nara Institute of Science and Technology, Japan
Tomokazu Sato , Nara Institute of Science and Technology, Japan
Naokazu Yokoya , Nara Institute of Science and Technology, Japan
pp. 1-6

Efficient plenoptic imaging representation: Why do we need it? (Abstract)

Fernando Pereira , Instituto Superior Técnico - Instituto de Telecomunicações, Portugal
Eduardo A. B. da Silva , PEE/COPPE, Universidade Federal do Rio de Janeiro, Brazil
pp. 1-6

Digital holography: Benchmarking coding standards and representation formats (Abstract)

Jose Peixeiro , Instituto Superior Técnico, University of Lisbon, Portugal
Catarina Brites , Instituto Superior Técnico, University of Lisbon, Portugal
Joao Ascenso , Instituto Superior Técnico, University of Lisbon, Portugal
Fernando Pereira , Instituto Superior Técnico, University of Lisbon, Portugal
pp. 1-6

Recognize human activities from multi-part missing videos (Abstract)

Kaiping Xu , School of Software, Tsinghua University
Zheng Qin , School of Software, Tsinghua University
Guolong Wang , School of Software, Tsinghua University
pp. 1-6

Iterative color-depth MST cost aggregation for stereo matching (Abstract)

Peng Yao , Key Laboratory of Computer Vision and System, Tianjin University of Technology, Tianjin, China
Hua Zhang , Key Laboratory of Computer Vision and System, Tianjin University of Technology, Tianjin, China
Yanbing Xue , Key Laboratory of Computer Vision and System, Tianjin University of Technology, Tianjin, China
Mian Zhou , Key Laboratory of Computer Vision and System, Tianjin University of Technology, Tianjin, China
Guangping Xu , Key Laboratory of Computer Vision and System, Tianjin University of Technology, Tianjin, China
Zan Gao , Key Laboratory of Computer Vision and System, Tianjin University of Technology, Tianjin, China
pp. 1-6

Generalized residual vector quantization for large scale data (Abstract)

Shicong Liu , Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Department of Computer Science and Engineering, Shanghai Jiao Tong University, P.R. China
Junru Shao , Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Department of Computer Science and Engineering, Shanghai Jiao Tong University, P.R. China
Hongtao Lu , Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Department of Computer Science and Engineering, Shanghai Jiao Tong University, P.R. China
pp. 1-6

Improving the similarity estimation via score distribution (Abstract)

Lixin Liao , Institute of Information Science, Beijing Jiaotong University, Beijing, 100044, China
Shikui Wei , Institute of Information Science, Beijing Jiaotong University, Beijing, 100044, China
Yao Zhao , Institute of Information Science, Beijing Jiaotong University, Beijing, 100044, China
Guanghua Gu , School of Information Science and Engineering, Yanshan University, Qinhuangdao, 130300, China
pp. 1-6

Multichannel reduction based on sound field within two ears (Abstract)

Dengshi Li , National Engineering Research Center for Multimedia Software, Computer School of Wuhan Univ., China
Ruimin Hu , State Key Laboratory of Software Engineering, Wuhan Univ., China
Xiaochen Wang , National Engineering Research Center for Multimedia Software, Computer School of Wuhan Univ., China
Guo Wu , National Engineering Research Center for Multimedia Software, Computer School of Wuhan Univ., China
Zheng Zhang , National Engineering Research Center for Multimedia Software, Computer School of Wuhan Univ., China
Weiping Tu , National Engineering Research Center for Multimedia Software, Computer School of Wuhan Univ., China
pp. 1-6

Discovering latent affective dynamics among individuals in online mental health-related communities (Abstract)

Bo Dao , Center for Pattern Recognition and Data Analytics(PRaDA) Deakin University, Geelong, Australia
Thin Nguyen , Center for Pattern Recognition and Data Analytics(PRaDA) Deakin University, Geelong, Australia
Svetha Venkatesh , Center for Pattern Recognition and Data Analytics(PRaDA) Deakin University, Geelong, Australia
Dinh Phung , Center for Pattern Recognition and Data Analytics(PRaDA) Deakin University, Geelong, Australia
pp. 1-6

A general PID-based rate adaptation approach for TCP-based live streaming over mobile networks (Abstract)

Jiexi Wang , Institute of Computer Science & Technology, Peking University
Shengbin Meng , Institute of Computer Science & Technology, Peking University
Jun Sun , Institute of Computer Science & Technology, Peking University
Zongming Quo , Institute of Computer Science & Technology, Peking University
pp. 1-6

Reliably detecting humans in crowded and dynamic environments using RGB-D camera (Abstract)

Luchao Tian , School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, China
Guyue Zhang , School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, China
Mingchen Li , School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, China
Jun Liu , School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, China
Yan Qiu Chen , School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, China
pp. 1-6

Chinese sign language recognition with adaptive HMM (Abstract)

Jihai Zhang , University of Science and Technology of China, Hefei, China
Wengang Zhou , University of Science and Technology of China, Hefei, China
Chao Xie , University of Science and Technology of China, Hefei, China
Junfu Pu , University of Science and Technology of China, Hefei, China
Houqiang Li , University of Science and Technology of China, Hefei, China
pp. 1-6

A comparative evaluation: Different methods for simplifying the deep compositional features (Abstract)

Shuang Qiu , Institute of Information Science, Beijing Jiaotong University, Beijing, 100044, China
Shikui Wei , Institute of Information Science, Beijing Jiaotong University, Beijing, 100044, China
Yao Zhao , Institute of Information Science, Beijing Jiaotong University, Beijing, 100044, China
pp. 1-6

Exploring auditory network composition during free listening to audio excerpts via group-wise sparse representation (Abstract)

Shijie Zhao , School of Automation, Northwestern Polytechnical University, Xi'an, China
Junwei Han , School of Automation, Northwestern Polytechnical University, Xi'an, China
Xi Jiang , Cortical Architecture Imaging and Discovery Lab, Department of Computer Science and Bioimaging Research Center, The University of Georgia, GA, USA
Xintao Hu , School of Automation, Northwestern Polytechnical University, Xi'an, China
Jinglei Lv , School of Automation, Northwestern Polytechnical University, Xi'an, China
Shu Zhang , Cortical Architecture Imaging and Discovery Lab, Department of Computer Science and Bioimaging Research Center, The University of Georgia, GA, USA
Bao Ge , School of Physics & Information Technology, Shaanxi Normal University, Xi'an, China
Lei Guo , School of Automation, Northwestern Polytechnical University, Xi'an, China
Tianming Liu , Cortical Architecture Imaging and Discovery Lab, Department of Computer Science and Bioimaging Research Center, The University of Georgia, GA, USA
pp. 1-6

SATD-based joint decision algorithm for parallelized intra prediction encoder in H.265/HEVC (Abstract)

Yao-Jen Chang , Industrial Technology Research Institute (ITRI), Chutung, Hsinchu, Taiwan (R.O.C.)
Pei-Hsuan Tsat , Institute of Manufacturing Information and Systems, National Cheng Kung University, Tainan, Taiwan (R.O.C.)
Chun-Lung Lin , Industrial Technology Research Institute (ITRI), Chutung, Hsinchu, Taiwan (R.O.C.)
pp. 1-6

Compact and robust video fingerprinting using sparse represented features (Abstract)

Bo Wu , School of Biomedical Engineering, Capital Medical University, Beijing, China
Sridhar Sri Krishnan , Department of Electrical and Computer Engineering, Ryerson University, Toronto, ON, Canada
Nan Zhang , School of Biomedical Engineering, Capital Medical University, Beijing, China
Li Su , School of Computer and Control Engineering, University of CAS, Beijing, China
pp. 1-6

Blind quality assessment of compressed images via pseudo structural similarity (Abstract)

Xiongkuo Min , Institute of Image Communication and Network Engineering, Shanghai Jiao Tong University, China
Guangtao Zhai , Institute of Image Communication and Network Engineering, Shanghai Jiao Tong University, China
Ke Gu , Institute of Image Communication and Network Engineering, Shanghai Jiao Tong University, China
Yuming Fang , School of Information Technology, Jiangxi University of Finance and Economics, China
Xiaokang Yang , Institute of Image Communication and Network Engineering, Shanghai Jiao Tong University, China
Xiaolin Wu , Institute of Image Communication and Network Engineering, Shanghai Jiao Tong University, China
Jiantao Zhou , Department of Computer and Information Science, University of Macau, China
Xianming Liu , School of Computer Science and Technology, Harbin Institute of Technology, China
pp. 1-6

DBLSTM-based multi-scale fusion for dynamic emotion prediction in music (Abstract)

Xinxing Li , Key Laboratory of Pervasive Computing, Ministry of Education, Tsinghua National Laboratory for Information Science and Technology (TNList) Department of Computer Science and Technology, Tsinghua University, Beijing, China
Jiashen Tian , Key Laboratory of Pervasive Computing, Ministry of Education, Tsinghua National Laboratory for Information Science and Technology (TNList) Department of Computer Science and Technology, Tsinghua University, Beijing, China
Mingxing Xu , Key Laboratory of Pervasive Computing, Ministry of Education, Tsinghua National Laboratory for Information Science and Technology (TNList) Department of Computer Science and Technology, Tsinghua University, Beijing, China
Yishuang Ning , Key Laboratory of Pervasive Computing, Ministry of Education, Tsinghua National Laboratory for Information Science and Technology (TNList) Department of Computer Science and Technology, Tsinghua University, Beijing, China
Lianhong Cai , Key Laboratory of Pervasive Computing, Ministry of Education, Tsinghua National Laboratory for Information Science and Technology (TNList) Department of Computer Science and Technology, Tsinghua University, Beijing, China
pp. 1-6

Joint optimization of audible noise suppression and deep neural networks for single-channel speech enhancement (Abstract)

Wei Han , Lab of Intelligent Information Processing, PLAUST, Nanjing, China
Xiongwei Zhang , Lab of Intelligent Information Processing, PLAUST, Nanjing, China
Gang Min , Lab of Intelligent Information Processing, PLAUST, Nanjing, China
Meng Sun , Lab of Intelligent Information Processing, PLAUST, Nanjing, China
Jibin Yang , Lab of Intelligent Information Processing, PLAUST, Nanjing, China
pp. 1-6

Quality assessment of image patches distorted by image compression using crowdsourcing (Abstract)

Sebastian Bosse , Fraunhofer Institute for Telecommunications, Heinrich Hertz Institute, Berlin, Germany
Mischa Siekmann , Fraunhofer Institute for Telecommunications, Heinrich Hertz Institute, Berlin, Germany
Jennifer Rasch , Fraunhofer Institute for Telecommunications, Heinrich Hertz Institute, Berlin, Germany
Thomas Wiegand , Fraunhofer Institute for Telecommunications, Heinrich Hertz Institute, Berlin, Germany
Wojciech Samek , Fraunhofer Institute for Telecommunications, Heinrich Hertz Institute, Berlin, Germany
pp. 1-6

Hand gesture recognition based on canonical formed superpixel earth mover's distance (Abstract)

Chong Wang , Ningbo University
Zhong Liu , The University of Hong Kong
Jieyu Zhao , Ningbo University
pp. 1-6

Online self-organizing hashing (Abstract)

Junxuan Chen , Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Department of Computer Science and Engineering, Shanghai Jiao Tong University, P.R. China
Yaoyi Li , Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Department of Computer Science and Engineering, Shanghai Jiao Tong University, P.R. China
Hongtao Lu , Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Department of Computer Science and Engineering, Shanghai Jiao Tong University, P.R. China
pp. 1-6

Discovering affective regions in deep convolutional neural networks for visual sentiment prediction (Abstract)

Ming Sun , College of Computer and Control Engineering, Nankai University, China
Jufeng Yang , College of Computer and Control Engineering, Nankai University, China
Kai Wang , College of Computer and Control Engineering, Nankai University, China
Hui Shen , College of Computer and Control Engineering, Nankai University, China
pp. 1-6

Shape-guided segmentation for fine-grained visual categorization (Abstract)

Ming Sun , College of Computer and Control Engineering, Nankai University, China
Jufeng Yang , College of Computer and Control Engineering, Nankai University, China
Bo Sun , College of Computer and Control Engineering, Nankai University, China
Kai Wang , College of Computer and Control Engineering, Nankai University, China
pp. 1-6

Multimedia transmission over device-to-device wireless links (Abstract)

Chuang Ye , Department of Electrical Engineering and Computer Science, Syracuse University, Syracuse, New York 13244
M. Cenk Gursoy , Department of Electrical Engineering and Computer Science, Syracuse University, Syracuse, New York 13244
Senem Velipasalar , Department of Electrical Engineering and Computer Science, Syracuse University, Syracuse, New York 13244
pp. 1-6

Decoder energy-aware intra-coded HEVC bit stream generation (Abstract)

Thanuja Mallikarachchi , Centre for Vision Speech and Signal Processing, University of Surrey, United Kingdom
Dumidu S. Talagala , Centre for Vision Speech and Signal Processing, University of Surrey, United Kingdom
Hemantha Kodikara Arachchi , Centre for Vision Speech and Signal Processing, University of Surrey, United Kingdom
Anil Fernando , Centre for Vision Speech and Signal Processing, University of Surrey, United Kingdom
pp. 1-6

Content-adaptive focus configuration for near-eye multi-focal displays (Abstract)

Wanmin Wu , Ricoh Innovations Corp., 10050 N. Wolfe Road, Suite SW2-260, Cupertino, California 95014
Patrick Llull , Duke University, 10050 N. Wolfe Road, Suite SW2-260, Cupertino, California 95014
Ivana Tosic , Ricoh Innovations Corp., 10050 N. Wolfe Road, Suite SW2-260, Cupertino, California 95014
Noah Bedard , Ricoh Innovations Corp., 10050 N. Wolfe Road, Suite SW2-260, Cupertino, California 95014
Kathrin Berkner , Ricoh Innovations Corp., 10050 N. Wolfe Road, Suite SW2-260, Cupertino, California 95014
Nikhil Balram , Ricoh Innovations Corp., 10050 N. Wolfe Road, Suite SW2-260, Cupertino, California 95014
pp. 1-6

Driver confusion status detection using recurrent neural networks (Abstract)

Chiori Hori , Mitsubishi Electric Research Laboratories, Mitsubishi Electric Corporation
Shinji Watanabe , Mitsubishi Electric Research Laboratories, Mitsubishi Electric Corporation
Takaaki Hori , Mitsubishi Electric Research Laboratories, Mitsubishi Electric Corporation
Bret A. Harsham , Mitsubishi Electric Research Laboratories, Mitsubishi Electric Corporation
JohnR. Hershey , Mitsubishi Electric Research Laboratories, Mitsubishi Electric Corporation
Yusuke Koji , Information Technology R&D Center
Yoichi Fujii , Information Technology R&D Center
Yuki Furumoto , Automotive Electronics Development Center
pp. 1-6

Client-side cache management for scalable and interactive video streaming (Abstract)

Kamal K. Nayfeh , Electrical and Computer Engineering, Wayne State University, Detroit, Michigan 48202
Nabil J. Sarhan , Electrical and Computer Engineering, Wayne State University, Detroit, Michigan 48202
pp. 1-6

Robust online visual tracking via a temporal ensemble framework (Abstract)

Hao Guan , Shanghai Key Laboratory of Intelligent Information Processing, School of Computer Science, Fudan University, Shanghai, China
Xiangyang Xue , Shanghai Key Laboratory of Intelligent Information Processing, School of Computer Science, Fudan University, Shanghai, China
pp. 1-6

Effective HEVC intra coding unit size decision based on online progressive Bayesian classification (Abstract)

Jiawei Chen , Zhejiang Provincial Key Laboratory of Information Processing, Communication and Networking (IPCAN) Institute of Information and Communication Engineering, Zhejiang University, Hangzhou, 310027, China
Lu Yu , Zhejiang Provincial Key Laboratory of Information Processing, Communication and Networking (IPCAN) Institute of Information and Communication Engineering, Zhejiang University, Hangzhou, 310027, China
pp. 1-6

Recurrent convolutional neural network for video classification (Abstract)

Zhenqi Xu , Beijing University of Posts and Telecommunications, No. 10, Xitu Cheng Road, Haidian District, Beijing, China, 100876
Jiani Hu , Beijing University of Posts and Telecommunications, No. 10, Xitu Cheng Road, Haidian District, Beijing, China, 100876
Weihong Deng , Beijing University of Posts and Telecommunications, No. 10, Xitu Cheng Road, Haidian District, Beijing, China, 100876
pp. 1-6

An improved sparse reconstruction algorithm for speech compressive sensing using structured priors (Abstract)

Xiaobo Jiang , Shanghai Key Laboratory of Navigation and Location-based Services, Shanghai Jiao Tong University, Shanghai, China
Rendong Ying , Shanghai Key Laboratory of Navigation and Location-based Services, Shanghai Jiao Tong University, Shanghai, China
Fei Wen , Shanghai Key Laboratory of Navigation and Location-based Services, Shanghai Jiao Tong University, Shanghai, China
Sumxin Jiang , Shanghai Key Laboratory of Navigation and Location-based Services, Shanghai Jiao Tong University, Shanghai, China
Peilin Liu , Shanghai Key Laboratory of Navigation and Location-based Services, Shanghai Jiao Tong University, Shanghai, China
pp. 1-6

Kernelized learning in deep scattering convolution networks (Abstract)

Yuehan Xiong , Department of Electronic Engineering, Shanghai Jiao Tong University, China
Can Xu , Department of Electronic Engineering, Shanghai Jiao Tong University, China
Hongkai Xiong , Department of Electronic Engineering, Shanghai Jiao Tong University, China
pp. 1-6

Automatic suggestion of presentation image for storytelling (Abstract)

Yu Liu , State University of New York at Buffalo, NY, USA
Tao Mei , Microsoft Research Asia, Beijing, P. R. China
Chang Wen Chen , State University of New York at Buffalo, NY, USA
pp. 1-6

Patch-based face hallucination with multitask deep neural network (Abstract)

Wei-Jen Ko , Department of Electrical Engineering, National Taiwan University
Shao-Yi Chien , Department of Electrical Engineering, National Taiwan University
pp. 1-6

Binocular rivalry detection in natural image pairs (Abstract)

Yapeng Xue , Zhejiang Provincial Key Laboratory of Information Processing, Communication and Networking(IPCAN), College of Information Science and Eletronic Engineering, Zhejiang University, Hangzhou 310027, China
Wenhao Hong , Zhejiang Provincial Key Laboratory of Information Processing, Communication and Networking(IPCAN), College of Information Science and Eletronic Engineering, Zhejiang University, Hangzhou 310027, China
Yu Cao , Zhejiang Provincial Key Laboratory of Information Processing, Communication and Networking(IPCAN), College of Information Science and Eletronic Engineering, Zhejiang University, Hangzhou 310027, China
Lu Yu , Zhejiang Provincial Key Laboratory of Information Processing, Communication and Networking(IPCAN), College of Information Science and Eletronic Engineering, Zhejiang University, Hangzhou 310027, China
pp. 1-6

Extracting and describing liver capsule contour in high-frequency ultrasound image for early HBV cirrhosis diagnosis (Abstract)

Xiang Liu , School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, Shanghai, China
Jia Lin Song , Department of ultrasound, Changzheng Hospital, Second Military Medical University, Shanghai, China
Jing Wen Zhao , School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, Shanghai, China
Yan Qiu Chen , School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, Shanghai, China
Jian Quan Zhang , Department of ultrasound, Changzheng Hospital, Second Military Medical University, Shanghai, China
pp. 1-6

Full-reference perceptual quality assessment for stereoscopic images based on primary visual processing mechanism (Abstract)

Yu Cao , Zhejiang Provincial Key Laboratory of Information Processing, Communication and Networking (IPCAN) College of Information Science and Eletronic Engineering, Zhejiang University, Hangzhou 310027, China
Wenhao Hong , Zhejiang Provincial Key Laboratory of Information Processing, Communication and Networking (IPCAN) College of Information Science and Eletronic Engineering, Zhejiang University, Hangzhou 310027, China
Lu Yu , Zhejiang Provincial Key Laboratory of Information Processing, Communication and Networking (IPCAN) College of Information Science and Eletronic Engineering, Zhejiang University, Hangzhou 310027, China
pp. 1-6

Adaptive multi-dimension sparsity based coefficient estimation for compression artifact reduction (Abstract)

Jing Mu , Institute of Digital Media, Peking University, Beijing 100871, China
Xinfeng Zhang , Rapid-Rich Object Search (ROSE) Lab, Nanyang Technological University, Singapore
Ruiqin Xiong , Institute of Digital Media, Peking University, Beijing 100871, China
Siwei Ma , Institute of Digital Media, Peking University, Beijing 100871, China
Wen Gao , Institute of Digital Media, Peking University, Beijing 100871, China
pp. 1-6

Heterogeneity-entropy based unsupervised feature learning for personality prediction with cross-media data (Abstract)

Haishu Xianyu , Key Laboratory of Pervasive Computing, Ministry of Education, Tsinghua National Laboratory for Information Science and Technology (TNList) Department of Computer Science and Technology, Tsinghua University, Beijing, China
Mingxing Xu , Key Laboratory of Pervasive Computing, Ministry of Education, Tsinghua National Laboratory for Information Science and Technology (TNList) Department of Computer Science and Technology, Tsinghua University, Beijing, China
Zhiyong Wu , Key Laboratory of Pervasive Computing, Ministry of Education, Tsinghua National Laboratory for Information Science and Technology (TNList) Department of Computer Science and Technology, Tsinghua University, Beijing, China
Lianhong Cai , Key Laboratory of Pervasive Computing, Ministry of Education, Tsinghua National Laboratory for Information Science and Technology (TNList) Department of Computer Science and Technology, Tsinghua University, Beijing, China
pp. 1-6

Multimedia event detection via deep spatial-temporal neural networks (Abstract)

Jingyi Hou , Beijing Laboratory of Intelligent Information Technology, School of Computer Science, Beijing Institute of Technology, Beijing 100081, P.R. China
Xinxiao Wu , Beijing Laboratory of Intelligent Information Technology, School of Computer Science, Beijing Institute of Technology, Beijing 100081, P.R. China
Feiwu Yu , Beijing Laboratory of Intelligent Information Technology, School of Computer Science, Beijing Institute of Technology, Beijing 100081, P.R. China
Yunde Jia , Beijing Laboratory of Intelligent Information Technology, School of Computer Science, Beijing Institute of Technology, Beijing 100081, P.R. China
pp. 1-6

Occlusion pattern-based dictionary for robust face recognition (Abstract)

Cho-Ying Wu , Graduate Institute of Communication Engineering, National Taiwan University, Taipei, Taiwan, R.O.C.
Jian-Jiun Ding , Graduate Institute of Communication Engineering, National Taiwan University, Taipei, Taiwan, R.O.C.
pp. 1-6

Weakly-supervised deep self-learning for face recognition (Abstract)

Binghui Chen , Beijing University of Posts and Telecommunication, No 10, Xitucheng Road, Haidian District, Beijing, PR China
Weihong Deng , Beijing University of Posts and Telecommunication, No 10, Xitucheng Road, Haidian District, Beijing, PR China
pp. 1-6

Lossless depth map coding using binary tree based decomposition and context-based arithmetic coding (Abstract)

Shampa Shahriyar , Faculty of Information Technology
Manzur Murshed , Faculty of Science and Technology
Mortuza Ali , School of Computing and Mathematics
Manoranjan Paul , Faculty of Science and Technology
pp. 1-6

Nonlinear metric learning for visual tracking (Abstract)

Jiwen Lu , Department of Automation, Tsinghua University, Beijing, China
Junlin Hu , School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore
Yap-Peng Tan , School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore
pp. 1-6

Simultaneous estimation of gaze direction and visual focus of attention for multi-person-to-robot interaction (Abstract)

Benoit Masse , INRIA Grenoble Rhône-Alpes, France
Sileye Ba , INRIA Grenoble Rhône-Alpes, France
Radu Horaud , INRIA Grenoble Rhône-Alpes, France
pp. 1-6

Reducing manual labeling in singing voice detection: An active learning approach (Abstract)

Wei Li , School of Computer Science, Fudan University, Shanghai, China
Xiangyi Feng , School of Computer Science, Fudan University, Shanghai, China
Min Xue , School of Computer Science, Fudan University, Shanghai, China
pp. 1-5

Automatic image dataset construction with multiple textual metadata (Abstract)

Yazhou Yao , University of Technology Sydney, Australia
Jian Zhang , University of Technology Sydney, Australia
Fumin Shen , University of Electronic Science and Technology of China
Xiansheng Hua , Alibaba Group, Hangzhou, China
Jingsong Xu , University of Technology Sydney, Australia
Zhenmin Tang , Nanjing University of Science and Technology, China
pp. 1-6

Weakly supervised scalable audio content analysis (Abstract)

Anurag Kumar , Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA - USA
Bhiksha Raj , Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA - USA
pp. 1-6

A superpixel segmentation algorithm based on differential evolution (Abstract)

Yue-Jiao Gong , Department of Computer and Information Science, University of Macau, Macau, China
Yicong Zhou , Department of Computer and Information Science, University of Macau, Macau, China
Xinglin Zhang , School of Computer Science and Engineering, South China University of Technology, Guangzhou, China
pp. 1-6

Approximate convex decomposition for 2D shapes based on visibility range (Abstract)

Zhiyang Li , Dalian Maritime University, China
Wenyu Qu , Tianjin University, China
Heng Qi , Dalian University of Technology, China
Milos Stojmenovic , Singidunum University, Serbia
pp. 1-6

3D tracking targets via kinematic model weighted particle filter (Abstract)

Xi En Cheng , School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, Shanghai, China
Shuo Hong Wang , School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, Shanghai, China
Yan Qiu Chen , School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, Shanghai, China
pp. 1-6

New results in free-viewpoint television systems for horizontal virtual navigation (Abstract)

Marek Domanski , Poznań University of Technology, Chair of Multimedia Telecommunications and Microelectronics, Poznań, Poland
Maciej Bartkowiak , Poznań University of Technology, Chair of Multimedia Telecommunications and Microelectronics, Poznań, Poland
Adrian Dziembowski , Poznań University of Technology, Chair of Multimedia Telecommunications and Microelectronics, Poznań, Poland
Tomasz Grajek , Poznań University of Technology, Chair of Multimedia Telecommunications and Microelectronics, Poznań, Poland
Adam Grzelka , Poznań University of Technology, Chair of Multimedia Telecommunications and Microelectronics, Poznań, Poland
Adam Luczak , Poznań University of Technology, Chair of Multimedia Telecommunications and Microelectronics, Poznań, Poland
Dawid Mieloch , Poznań University of Technology, Chair of Multimedia Telecommunications and Microelectronics, Poznań, Poland
Jaroslaw Samelak , Poznań University of Technology, Chair of Multimedia Telecommunications and Microelectronics, Poznań, Poland
Olgierd Stankiewicz , Poznań University of Technology, Chair of Multimedia Telecommunications and Microelectronics, Poznań, Poland
Jakub Stankowski , Poznań University of Technology, Chair of Multimedia Telecommunications and Microelectronics, Poznań, Poland
Krzysztof Wegner , Poznań University of Technology, Chair of Multimedia Telecommunications and Microelectronics, Poznań, Poland
pp. 1-6

A novel trignometric energy functional for image segmentation in the presence of intensity in-homogeneity (Abstract)

Sajid Hussain , School of Electronics and Information Engineering, Xi'an Jiaotong University, Shaanxi, China
Qi Chun , School of Electronics and Information Engineering, Xi'an Jiaotong University, Shaanxi, China
Muhammad Rizwan Asif , School of Electronics and Information Engineering, Xi'an Jiaotong University, Shaanxi, China
Muhammad Sohrab Khan , Roshni Re-Cycle, Institute of Research and Technology Kashrote, Gilgit-Baltistan, Pakistan
Zhang Zhaoqiang , School of Electronics and Information Engineering, Xi'an Jiaotong University, Shaanxi, China
Muhammad Sadiq Fareed , School of Electronics and Information Engineering, Xi'an Jiaotong University, Shaanxi, China
Zhang Zhe , School of Electronics and Information Engineering, Xi'an Jiaotong University, Shaanxi, China
pp. 1-6

Multi-view distributed coding and selection of local binary features (Abstract)

Nuno Monteiro , Instituto Superior Técnico - Instituto de Telecomunicações, Portugal
Catarina Brites , Instituto Superior Técnico - Instituto de Telecomunicações, Portugal
Fernando Pereira , Instituto Superior Técnico - Instituto de Telecomunicações, Portugal
Joao Ascenso , Instituto Superior Técnico - Instituto de Telecomunicações, Portugal
pp. 1-6

Collaborative multi-view metric learning for visual classification (Abstract)

Junlin Hu , School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore
Jiwen Lu , Department of Automation, Tsinghua University, Beijing, China
Junsong Yuan , School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore
Yap-Peng Tan , School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore
pp. 1-6

No-reference image quality assessment based on high order derivatives (Abstract)

Qiaohong Li , School of Computer Engineering, Nanyang Technological University, Singapore
Weisi Lin , School of Computer Engineering, Nanyang Technological University, Singapore
Yuming Fang , School of Information Technology, Jiangxi University of Finance and Economics, Nanchang, China
pp. 1-6

Visual attention analysis on stereoscopic images for subjective discomfort evaluation (Abstract)

Sewoong Ahn , Department of Electrical and Electronic Engineering, Yonsei University, Seoul, Korea, 120-749
Junghwan Kim , Department of Electrical and Electronic Engineering, Yonsei University, Seoul, Korea, 120-749
Haksub Kim , Department of Electrical and Electronic Engineering, Yonsei University, Seoul, Korea, 120-749
Sanghoon Lee , Department of Electrical and Electronic Engineering, Yonsei University, Seoul, Korea, 120-749
pp. 1-6

Efficient MRF-based disocclusion inpainting in multiview video (Abstract)

Beerend Ceulemans , iMinds VZW, Ghent, Belgium
Shao-Ping Lu , iMinds VZW, Ghent, Belgium
Gauthier Lafruit , LISA (Laboratories of Image, Signal processing and Acoustics), Université Libre de Bruxelles
Peter Schelkens , iMinds VZW, Ghent, Belgium
Adrian Munteanu , iMinds VZW, Ghent, Belgium
pp. 1-6

Guitar solos as networks (Abstract)

Stefano Ferretti , Department of Computer Science and Engineering, University of Bologna, Mura A. Zamboni 7, I-40127 Bologna, Italy
pp. 1-6

Large-scale vehicle re-identification in urban surveillance videos (Abstract)

Xinchen Liu , Beijing Key Laboratory of Intelligent Telecommunication Software and Multimedia, Beijing University of Posts and Telecommunications, Beijing 100876, China
Wu Liu , Beijing Key Laboratory of Intelligent Telecommunication Software and Multimedia, Beijing University of Posts and Telecommunications, Beijing 100876, China
Huadong Ma , Beijing Key Laboratory of Intelligent Telecommunication Software and Multimedia, Beijing University of Posts and Telecommunications, Beijing 100876, China
Huiyuan Fu , Beijing Key Laboratory of Intelligent Telecommunication Software and Multimedia, Beijing University of Posts and Telecommunications, Beijing 100876, China
pp. 1-6

A semi-automatic brain tumor segmentation algorithm (Abstract)

Xiaoli Zhang , College of Computer Science and Technology Jilin University, Changchun, China
Xiongfei Li , College of Computer Science and Technology Jilin University, Changchun, China
Hongpeng Li , The Second Hospital Jilin University, Changchun, China
Yuncong Feng , College of Computer Science and Technology Jilin University, Changchun, China
pp. 1-6

Tracking undulatory body motion of multiple fish based on midline dynamics modeling (Abstract)

Shuo Hong Wang , School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, Shanghai, China
Xi En Cheng , School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, Shanghai, China
Yan Qiu Chen , School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, Shanghai, China
pp. 1-6

Table of contents (PDF)

pp. 1-19
100 ms
(Ver 3.3 (11022016))