2006 IEEE International Conference on Multimedia and Expo Automatic Extraction of Geometric Lip Features with Application to Multi-Modal Speaker Identification Toronto, ON, Canada July 09-July 12 ISBN: 1-4244-0366-7
In this paper we consider the problem of automatic extraction of the geometric lip features for the purposes of multi-modal speaker identification. The use of visual information from the mouth region can be of great importance for improving the speaker identification system performance in noisy conditions. We propose a novel method for automated lip features extraction that utilizes color space transformation and a fuzzy-based c-means clustering technique. Using the obtained visual cues closed-set audio-visual speaker identification experiments are performed on the CUAVE database, [1] showing promising results.
Citation:
Ivana Arsic, Roger Vilagut, Jean-philippe Thiran, "Automatic Extraction of Geometric Lip Features with Application to Multi-Modal Speaker Identification," icme, pp.161-164, 2006 IEEE International Conference on Multimedia and Expo, 2006 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||