loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06)
Clustering Workflow Requirements Using Compression Dissimilarity Measure
Hong Kong, China
December 18-December 22
ISBN: 0-7695-2702-7
Li Wei, University of California, Riverside, CA
John Handley, Xerox Corporation
Nathaniel Martin, Xerox Corporation
Tong Sun, Xerox Corporation
Eamonn Keogh, University of California, Riverside
Xerox offers a bewildering array of printers and software configurations to satisfy the needs of production print shops. A configuration tool in the hands of sales analysts elicits requirements from customers and recommends a list of product configurations. This tool generates special question and answer case logs that provide useful historical data. Given the unusual semi-structured question and answer format, this data is not amenable to any standard document clustering method. We discovered that a hierarchical agglomerative approach using a compression-based dissimilarity measure (CDM) provided readily interpretable clusters. We compare this method empirically to two reasonable alternatives, latent semantic analysis and probabilistic latent semantic analysis, and conclude that CDM offers an accurate and easily implemented approach to validate and augment our configuration tool.
Citation:
Li Wei, John Handley, Nathaniel Martin, Tong Sun, Eamonn Keogh, "Clustering Workflow Requirements Using Compression Dissimilarity Measure," icdmw, pp.50-54, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.