Search For:

Displaying 1-26 out of 26 total
3D Imaging Techniques and Multimedia Applications [Guest editor's introduction]
Found in: IEEE MultiMedia
By Ruzena Bajcsy,Ruigang Yang,Pietro Zanuttigh,Cha Zhang
Issue Date:January 2013
pp. 14-16
With the advances in sensing, transmission, and visualization technology, 3D information has become increasingly incorporated into real-world applications, from architecture, entertainment, and manufacturing to security. One of the fundamental requirements...
 
Viewport: A Distributed, Immersive Teleconferencing System with Infrared Dot Pattern
Found in: IEEE MultiMedia
By Cha Zhang,Qin Cai,Philip A. Chou,Zhengyou Zhang,Ricardo Martin-Brualla
Issue Date:January 2013
pp. 17-27
The Viewport immersive teleconferencing system reconstructs sparse 3D representations for each user and applies virtual seating to maintain the same seating geometry as face-to-face meetings.
 
Virtual View Reconstruction Using Temporal Information
Found in: 2012 IEEE International Conference on Multimedia and Expo (ICME)
By Shujie Liu,Philip A. Chou,Cha Zhang,Zhengyou Zhang,Chang Wen Chen
Issue Date:July 2012
pp. 115-120
The most significant problem in generating virtual views from a limited number of video camera views is handling areas that have become dis-occluded by shifting the virtual view away from the camera view. We propose using temporal information to address th...
 
Calibration between depth and color sensors for commodity depth cameras
Found in: Multimedia and Expo, IEEE International Conference on
By Cha Zhang, Zhengyou Zhang
Issue Date:July 2011
pp. 1-6
Commodity depth cameras have created many interesting new applications in the research community recently. These applications often require the calibration information between the color and the depth cameras. Traditional checkerboard based calibration sche...
 
A novel see-through screen based on weave fabrics
Found in: Multimedia and Expo, IEEE International Conference on
By Cha Zhang, Ruigang Yang,Tim Large, Zhengyou Zhang
Issue Date:July 2011
pp. 1-6
See-through screens (STS) have found important applications in remote collaboration systems to enhance non-verbal communication and gaze awareness. Existing STS designs often sacrifice the display quality significantly, rendering low-contrast images that d...
 
Boosted multi-task learning for face verification with applications to web image and video search
Found in: Computer Vision and Pattern Recognition, IEEE Computer Society Conference on
By Xiaogang Wang, Cha Zhang, Zhengyou Zhang
Issue Date:June 2009
pp. 142-149
Face verification has many potential applications including filtering and ranking image/video search results on celebrities. Since these images/videos are taken under uncontrolled environments, the problem is very challenging due to dramatic lighting and p...
 
Taylor expansion based classifier adaptation: Application to person detection
Found in: Computer Vision and Pattern Recognition, IEEE Computer Society Conference on
By Cha Zhang, Raffay Hamid, Zhengyou Zhang
Issue Date:June 2008
pp. 1-8
Because of the large variation across different environments, a generic classifier trained on extensive data-sets may perform sub-optimally in a particular test environment. In this paper, we present a general framework for classifier adaptation, which imp...
 
See-through Image Enhancement through Sensor Fusion
Found in: 2012 IEEE International Conference on Multimedia and Expo (ICME)
By Bo Fu,Mao Ye,Ruigang Yang,Cha Zhang
Issue Date:July 2012
pp. 687-692
Many hardware designs have been developed to allow a camera to be placed optically directly behind the screen. The purpose of such setups is to enable two-way video teleconferencing that maintains eye-contact. However, the image from the see-through camera...
 
Personal 3D audio system with loudspeakers
Found in: Multimedia and Expo, IEEE International Conference on
By Myung-Suk Song, Cha Zhang, Dinei Florencio, Hong-Goo Kang
Issue Date:July 2010
pp. 1600-1605
Traditional 3D audio systems often have a limited sweet spot for the user to perceive 3D effects successfully. In this paper, we present a personal 3D audio system with loudspeakers that has unlimited sweet spots. The idea is to have a camera track the use...
 
Turning enemies into friends: Using reflections to improve sound source localization
Found in: Multimedia and Expo, IEEE International Conference on
By Flavio Ribeiro, Demba Ba, Cha Zhang, Dinei Florencio
Issue Date:July 2010
pp. 731-736
Sound Source Localization (SSL) based on microphone arrays has numerous applications, and has received significant research attention. Common to all published research is the observation that the accuracy of SSL degrades with reverberation. Indeed, early (...
 
Multiview video compression and streaming based on predicted viewer position
Found in: Acoustics, Speech, and Signal Processing, IEEE International Conference on
By Dinei Florencio, Cha Zhang
Issue Date:April 2009
pp. 657-660
Recent technological advances have made possible a number of new applications in the area of 3D video. One of the enabling technologies for many of these 3D applications is multiview video coding, which has received significant attention in the last severa...
 
Monocular Video Foreground/Background Segmentation by Tracking Spatial-Color Gaussian Mixture Models
Found in: Motion and Video Computing, IEEE Workshop on
By Ting Yu, Cha Zhang, Michael Cohen, Yong Rui, Ying Wu
Issue Date:February 2007
pp. 5
This paper presents a new approach to segmenting monocular videos captured by static or hand-held cameras filming large moving non-rigid foreground objects. The foreground and background objects are modeled using spatialcolor Gaussian mixture models (SCGMM...
 
Robust Visual Tracking via Pixel Classification and Integration
Found in: Pattern Recognition, International Conference on
By Cha Zhang, Yong Rui
Issue Date:August 2006
pp. 37-42
<p>We propose a novel framework for tracking non-rigid objects via pixel classification and integration (PCI). Given a new input frame, the tracker first performs object classification on each pixel and then finds the region that has the highest inte...
 
A Three-Layer Virtual Director Model for Supporting Automated Multi-Site Distributed Education
Found in: Multimedia and Expo, IEEE International Conference on
By Bin Yu, Cha Zhang, Yong Rui, Klara Nahrstedt
Issue Date:July 2006
pp. 637-640
In multi-site distributed education (MSDE), video streams from multiple sites are available. To best utilize the limited screen space at each site, we develop a customizable, automated display management system in this paper, i.e., only user-preferred stre...
 
Multi-View Imaging: Capturing and Rendering Interactive Environments
Found in: Computer Vision for Interactive and Intelligent Environment
By Cha Zhang, Tsuhan Chen
Issue Date:November 2005
pp. 51-67
This paper presents a self-reconfigurable camera array system that captures and renders 3D virtual scenes interactively. It is composed of an array of 48 cameras mounted on mobile platforms. We propose an efficient algorithm that is capable of rendering hi...
 
Non-Uniform Sampling for Image-Based Rendering: Convergence of Image, Vision, and Graphics
Found in: Multi-Media Modeling Conference, International
By Cha Zhang, Tsuhan Chen
Issue Date:January 2004
pp. 2
Recent convergence of image processing, computer vision, and computer graphics has resulted in an exciting research topic referred to as image-based rendering (IBR). Widely used in applications ranging from movie special effects (e.g.,
 
A system for active image-based rendering
Found in: Multimedia and Expo, IEEE International Conference on
By Cha Zhang, Tsuhan Chen
Issue Date:July 2003
pp. 233-236
In this paper, we develop a system for active image-based rendering (IBR). Active IBR is a framework that is capable of estimating the final rendering quality and capturing the next view at the position where the rendering quality can be improved the most....
 
Compression of Lumigraph with Multiple Reference Frame (MRF) Prediction and Just-in-Time Rendering
Found in: Data Compression Conference
By Cha Zhang, Jin Li
Issue Date:March 2000
pp. 253
In the form of 2D image array, Lumigraph captures the complete appearance of an object or a scene, and is able to quickly render a novel view independent of the scene/object complexity. Since the data amount of Lumigraph is huge, the efficient storage and ...
 
Improving multiview face detection with multi-task deep convolutional neural networks
Found in: 2014 IEEE Winter Conference on Applications of Computer Vision (WACV)
By Cha Zhang,Zhengyou Zhang
Issue Date:March 2014
pp. 1036-1041
Multiview face detection is a challenging problem due to dramatic appearance changes under various pose, illumination and expression conditions. In this paper, we present a multi-task deep learning scheme to enhance the detection performance. More specific...
   
Wide-Baseline Hair Capture Using Strand-Based Refinement
Found in: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
By Linjie Luo,Cha Zhang,Zhengyou Zhang,Szymon Rusinkiewicz
Issue Date:June 2013
pp. 265-272
We propose a novel algorithm to reconstruct the 3D geometry of human hairs in wide-baseline setups using strand-based refinement. The hair strands are first extracted in each 2D view, and projected onto the 3D visual hull for initialization. The 3D positio...
 
Robust part-based face matching with multiple templates
Found in: 2013 10th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2013)
By Kye-Hyeon Kim, Cha Zhang, Zhengyou Zhang, Seungjin Choi
Issue Date:April 2013
pp. 1-7
Part-based matching can handle significant variations in facial expression and partial occlusion. However, under large pose variations, it is sometimes difficult to find corresponding local parts in a pair of images. In this paper, we propose a novel part-...
   
Video Enhancement of People Wearing Polarized Glasses: Darkening Reversal and Reflection Reduction
Found in: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
By Mao Ye,Cha Zhang,Ruigang Yang
Issue Date:June 2013
pp. 1179-1186
With the wide-spread of consumer 3D-TV technology, stereoscopic videoconferencing systems are emerging. However, the special glasses participants wear to see 3D can create distracting images. This paper presents a computational framework to reduce undesira...
 
Requirements and recommendations for an enhanced meeting viewing experience
Found in: Proceeding of the 16th ACM international conference on Multimedia (MM '08)
By Cha Zhang, Phil A. Chou, Rajesh Hegde, Sasa Junuzovic, Zhengyou Zhang, Zicheng Liu
Issue Date:October 2008
pp. 40-42
We have found that viewing recorded meetings using traditional meeting viewers whose interfaces consist of an automatic speaker and a fixed context view does not provide sufficient information and control to the users. In particular, a survey of users who ...
     
An automated end-to-end lecture capturing and broadcasting system
Found in: Proceedings of the 13th annual ACM international conference on Multimedia (MULTIMEDIA '05)
By Cha Zhang, Jim Crawford, Li-wei He, Yong Rui
Issue Date:November 2005
pp. 808-809
We present a complete end-to-end system that is fully automated and supports capturing, broadcasting, viewing, archiving and search. Specifically, we describe a system architecture that minimizes the pre- and post-production time, and a fully automated lec...
     
Indexing and retrieval of 3D models aided by active learning
Found in: Proceedings of the ninth ACM international conference on Multimedia (MULTIMEDIA '01)
By Cha Zhang, Tsuhan Chen
Issue Date:September 2001
pp. 615-616
We demonstrate a system for indexing and retrieval of 3D models aided by active learning. We propose a new set of region-based features for 3D models. Each model is treated as a solid volume with a uniform density. Features such as the volume-surface ratio...
     
Smart rebinning for compression of concentric mosaics
Found in: Proceedings of the eighth ACM international conference on Multimedia (MULTIMEDIA '00)
By Cha Zhang, Jin Li, Jizheng Xu, Yunnan Wu
Issue Date:October 2000
pp. 201-209
Concentric mosaics offer a quick solution to construct a virtual copy of a real environment, and navigate in the virtual environment. However, the huge amount of data associated with concentric mosaics is a heavy burden for its application. A 3D wavelet tr...
     
 1