San Jose, California
Feb. 24, 2002 to Feb. 25, 2002
ISBN: 0-7695-1480-4
pp: 0034
Xiao-Ling Wang , Southeast University
Yi-Sheng Dong , Southeast University
Ji-Rong Wen , Microsoft Research Asia
Liu Wen-Yin , Microsoft Research Asia
Structured documents, especially the XML documents, are made up of a few logical components, such as title, sections, subsections and paragraphs. The components in each structured document can be represented by an ordered tree model, which can also be viewed as a hierarchical concept relationship. To meet the user's requirements for more precise and concentrated search results, the retrieval techniques should allow the user to retrieve document components with varying granularity. To enable this, suitable querying strategies, especially some special indexing techniques must be developed. This paper put forward an enhancive indexing technique for approximate search by taking advantage of the natural hierarchical structure in text documents. Experiments show that improved performance can be obtained by using such a kind of enhancive indexing technique.
Structured document retrieval, document indexing, XML, ordered tree model
Xiao-Ling Wang, Yi-Sheng Dong, Ji-Rong Wen, Liu Wen-Yin, "Enhancive Index for Structured Document Retrieval", RIDE, 2002, Research Issues in Data Engineering, International Workshop on, Research Issues in Data Engineering, International Workshop on 2002, pp. 0034, doi:10.1109/RIDE.2002.995096
