loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
16th International Conference on Pattern Recognition (ICPR'02) - Volume 3
Boosting and Structure Learning in Dynamic Bayesian Networks for Audio-Visual Speaker Detection
Quebec City, QC, Canada
August 11-August 15
ISBN: 0-7695-1695-X
Tanzeem Choudhury, Massachusetts Institute of Technology
James M. Rehg, Georgia Institute of Technology
Vladimir Pavlović, Boston University
Alex Pentland, Massachusetts Institute of Technology
Bayesian networks are an attractive modeling tool for human sensing, as they combine an intuitive graphical representation with efficient algorithms for inference and learning. Earlier work has demonstrated that boosted parameter learning could be used to improve the performance of Bayesian network classifiers for complex multi-modal inference problems such as speaker detection. In speaker detection, the goal is to use video and audio cues to infer when a person is speaking to a user interface. In this paper we introduce a new boosted structure learning algorithm based on AdaBoost. Given labeled data, our algorithm modifies both the network structure and parameters so as to improve classification accuracy. We compare its performance to both standard structure learning and boosted parameter learning on a fixed structure. We present results for speaker detection and for the UCI "chess" dataset.
Citation:
Tanzeem Choudhury, James M. Rehg, Vladimir Pavlović, Alex Pentland, "Boosting and Structure Learning in Dynamic Bayesian Networks for Audio-Visual Speaker Detection," icpr, vol. 3, pp.30789, 16th International Conference on Pattern Recognition (ICPR'02) - Volume 3, 2002
Usage of this product signifies your acceptance of the Terms of Use.