The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.12 - December (2010 vol.22)
pp: 1781-1796
Kamal Taha , University of Texas at Arlington, Arlington
Ramez Elmasri , University of Texas at Arlington, Arlington
ABSTRACT
We present in this paper, a context-driven search engine called XCDSearch for answering XML Keyword-based queries as well as Loosely Structured queries, using a stack-based sort-merge algorithm. Most current research is focused on building relationships between data elements based solely on their labels and proximity to one another, while overlooking the contexts of the elements, which may lead to erroneous results. Since a data element is generally a characteristic of its parent, its context is determined by its parent. We observe that we could treat each set of elements consisting of a parent and its children data elements as one unified entity, and then use a stack-based sort-merge algorithm employing context-driven search techniques for determining the relationships between the different unified entities. We evaluated XCDSearch experimentally and compared it with five other search engines. The results showed marked improvement.
INDEX TERMS
XML keyword search, keyword search, keyword-based querying, XML search engine.
CITATION
Kamal Taha, Ramez Elmasri, "XCDSearch: An XML Context-Driven Search Engine", IEEE Transactions on Knowledge & Data Engineering, vol.22, no. 12, pp. 1781-1796, December 2010, doi:10.1109/TKDE.2009.210
REFERENCES
[1] S. Amer-Yahia, E. Curtmola, and A. Deutsch, "Flexible and Efficient XML Search with Complex Full-Text Predicates," Proc. ACM SIGMOD '06, 2006.
[2] D. Alorescu and I. Manolescu, "Integrating Keyword Search in XML Query Processing," Computer Networks, vol. 33, pp. 119-135, 2000.
[3] C. Agrawal and G. Das, "DBXplorer: A System for Keyword- Based Search over Relational Databases," Proc. Int'l Conf. Data Eng. (ICDE '02), 2002.
[4] B. Aditya and S. Sudarshan, "BANKS: Browsing and Keyword Searching in Relational Databases," Proc. Int'l Conf. Very Large Data Bases (VLDB '02), 2002.
[5] B. Balmin, V. Hristidis, and N. Koudas, "A System for Keyword Proximity Search on XML Databases," Proc. Int'l Conf. Very Large Data Bases (VLDB '03), 2003.
[6] B. Balmin, V. Hristidis, and Y. Papakonstantinon, "Keyword Proximity Search on XML Graphs," Proc. Int'l Conf. Data Eng. (ICDE '03), 2003.
[7] B. Balmin and V. Hristidis, "ObjectRank: Authority-Based Keyword Search in Databases," Proc. Int'l Conf. Very Large Data Bases (VLDB '04), 2004.
[8] C. Botev, L. Guo, and F. Shao, "XRANK: Ranked Keyword Search over XML Documents," Proc. ACM SIGMOD '03, 2003.
[9] Books24×7, http://www.books24x7.combooks24x7.asp, 2010.
[10] S. Cohen, J. Mamou, and Y. Sagiv, "XSEarch: A Semantic Search Engine for XML," Proc. Int'l Conf. Very Large Data Bases (VLDB '03), 2003.
[11] S. Cohen and Y. Kanza, "Interconnection Semantics for Keyword Search in XML," Proc. Int'l Conf. Information and Knowledge Management (CIKM '05), 2005.
[12] R. Conrad, D. Scheffner, and C. Freytag, "XML Conceptual Modeling Using UML," Proc. Int'l Conf. Conceptual Modeling (ER '00), 2000.
[13] R. Elmasri and S. Navathe, Fundamentals of Database Systems. Addison-Wesley, 2007.
[14] L. Guo, F. Shao, and C. Botev, "XRANK: Ranked Keyword Search over XML Documents," Proc. ACM SIGMOD '03, 2003.
[15] V. Hristidis and Y. Papakonstantinou, "DISCOVER: Keyword Search in Relational Databases," Proc. Int'l Conf. Very Large Data Bases (VLDB '02), 2002.
[16] Initiative for the Evaluation of XML Retrieval (INEX), http://inex.is.informatik.uni-duisburg.de 2005/, 2005.
[17] Initiative for the Evaluation of XML Retrieval (INEX), http://inex.is.informatik.uni-duisburg.de 2006/, 2006.
[18] J. Kamps, M. Marx, M. Rijke, and B. Sigurbjornsson, "Structured Queries in XML Retrieval," Proc. Int'l Conf. Information and Knowledge Management (CIKM '05), 2005.
[19] Y. Li, C. Yu, and H. Jagadish, "Schema-Free XQuery," Proc. Int'l Conf. Very Large Data Bases (VLDB '04), 2004.
[20] H. Leung, F. Chung, and C. Chan, "On the Use of Hierarchical Information in Sequential Mining Based XML Document Similarity Computation," Knowledge and Information Systems, vol. 4, no. 7, pp. 476-498, 2004.
[21] G. Li, S. Bressan, G. Dobbie, and B. Wadhwa, "XOO7: Applying O7 Benchmark to XML Query Processing Tools," Proc. Int'l Conf. Information and Knowledge Management (CIKM '01), 2001.
[22] http://www.xml.com/2002/11/06Ontology_Editor_Survey. html , 2010.
[23] mydeco, http:/mydeco.com/, 2010.
[24] Protégé Ontology Editor, http:/protege.stanford.edu/, 2010.
[25] E. Pardede, J. Rahayu, and D. Taniar, "On Using Collection for Aggregation and Association Relationships in XML Object-Relational Storage," Proc. ACM Symp. Applied Computing (SAC '04), 2004.
[26] K. Taha and R. Elmasri, "OOXSearch: A Search Engine for Answering Loosely Structured XML Queries Using OO Programming," Proc. 24th British Nat'l Conf. Databases (BNCOD '07), 2007.
[27] M. Snoeck and G. Dedene, "Existence Dependency: The Key to Semantic Integrity between Structural and Behavioral Aspects of Object Types," IEEE Trans. Software Eng., vol. 24, no. 24, pp. 233-251, Apr. 1998.
[28] K. Taha and R. Elmasri, "CXLEngine: A Comprehensive XML Loosely Structured Search Engine," Proc. Int'l Conf. Extending Database Technology (EDBT) Workshop Database Technologies for Handling XML Information on the Web (DataX '08), 2008.
[29] ToXgene, a Template-Based Generator for Large XML Documents, http://www.cs.toronto.edu/toxtoxgene/, 2010.
[30] TIMBER, http://www.eecs.umich.edu/dbtimber/, 2010.
[31] Wiley InterScience, http://www3.interscience.wiley.com/ cgi-bin home, 2010.
[32] N. Widjaya, D. Taniar, and W. Rahayu, "Aggregation Transformation of XML Schema to Object-Relational Databases," Proc. Int'l Workshop Innovative Internet Community Systems, pp. 251-262, 2003.
[33] N. Widjaya and W. Rahayu, "Association Relationship Transformation of XML Schemas to Object-Relational Databases," Proc. Int'l Conf. Information Integration and Web-Based Applications and Services (iiWAS '02), 2002.
[34] R. Warner, Applied Statistics: From Bivariate through Multivariate Techniques. Sage Publications, 2007.
[35] Y. Xu and Y. Papakonstantinou, "Efficient Keyword Search for Smallest LCAs in XML Databases," Proc. ACM SIGMOD '05, 2005.
[36] R. Xiaou, T. Dillon, and L. Feng, "Modeling and Transformation of Object-Oriented Conceptual Models into XML Schema," Proc. Int'l Conf. Database and Expert Systems Applications (DEXA '01), 2001.
[37] XML Query Use Cases, W3C Working Draft, 2007.
17 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool