loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
16th International Conference on Pattern Recognition (ICPR'02) - Volume 4
The Performance Analysis of a Chi-square Similarity Measure for Topic Related Clustering of Noisy Transcripts
Quebec City, QC, Canada
August 11-August 15
ISBN: 0-7695-1695-X
Oktay lbrahimov, Oakland University
lshwar Sethi, Oakland University
Nevenka Dimitrova, Philips Research

The goal of the paper is to present a novel Chi-square similarity measure and assess its performance through comparison with well-known similarity measures such as Cosine, Dice, and Jaccard.

The Chi-square similarity measure has been designed to withstand the imperfections of transcribed spoken documents. The major difference of our similarity measure from others consists in the fact that in addition to searching for co-occurring words in documents, we also match informative closeness of common words. We assume that co-occurring words, which had been employed to convey the same information, should have the compatible significance in matching documents. To test it we apply the Chi-Square method.

Experimental results obtained via using an archive of transcribed news broadcasts demonstrate the high efficacy of the proposed methodology.

Citation:
Oktay lbrahimov, lshwar Sethi, Nevenka Dimitrova, "The Performance Analysis of a Chi-square Similarity Measure for Topic Related Clustering of Noisy Transcripts," icpr, vol. 4, pp.40285, 16th International Conference on Pattern Recognition (ICPR'02) - Volume 4, 2002
Usage of this product signifies your acceptance of the Terms of Use.