10th IEEE International Symposium on Software Metrics (METRICS'04)
Measuring XML Document Similarity: A Case Study for Evaluating Information Extraction Systems
Chicago, Illinois
September 11-September 17
ISBN: 0-7695-2129-0
Measuring similarity between trees, such as XML structured information, has an important role in many applications, and in particular in the evaluation of the effectiveness of Information Extraction Systems (IES). In this paper we present an experience in evaluating the effectiveness of IES in terms of extraction and adaptation effectiveness. In the first part of the paper a similarity measure between XML trees based on a common sub tree detection algorithm is introduced; then, a case study aimed at the evaluation of the effectiveness of a group of IES is presented as an example of application.
Citation:
Gerardo Canfora, Luigi Cerulo, Rita Scognamiglio, "Measuring XML Document Similarity: A Case Study for Evaluating Information Extraction Systems," metrics, pp.36-45, 10th IEEE International Symposium on Software Metrics (METRICS'04), 2004