The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.03 - March (2012 vol.24)
pp: 426-439
Elena Demidova , L3S Research Center and Leibniz Universitä t Hannover, Germany
Xuan Zhou , Renmin University of China, Beijing
Wolfgang Nejdl , L3S Research Center and Leibniz Universitä t Hannover, Germany
ABSTRACT
Databases enable users to precisely express their informational needs using structured queries. However, database query construction is a laborious and error-prone process, which cannot be performed well by most end users. Keyword search alleviates the usability problem at the price of query expressiveness. As keyword search algorithms do not differentiate between the possible informational needs represented by a keyword query, users may not receive adequate results. This paper presents IQ^P—a novel approach to bridge the gap between usability of keyword search and expressiveness of database queries. IQ^P enables a user to start with an arbitrary keyword query and incrementally refine it into a structured query through an interactive interface. The enabling techniques of IQ^P include: 1) a probabilistic framework for incremental query construction; 2) a probabilistic model to assess the possible informational needs represented by a keyword query; 3) an algorithm to obtain the optimal query construction process. This paper presents the detailed design of IQ^P, and demonstrates its effectiveness and scalability through experiments over real-world data and a user study.
INDEX TERMS
Query formulation, search process.
CITATION
Elena Demidova, Xuan Zhou, Wolfgang Nejdl, "A Probabilistic Scheme for Keyword-Based Incremental Query Construction", IEEE Transactions on Knowledge & Data Engineering, vol.24, no. 3, pp. 426-439, March 2012, doi:10.1109/TKDE.2011.40
REFERENCES
[1] I. Androutsopoulos, G.D. Ritchie, and P. Thanisch, "Natural Language Interfaces to Databases—An Introduction," J. Language Eng., vol. 1, no. 1, pp. 29-81, 1995.
[2] S. Agrawal, S. Chaudhuri, and G. Das, "DBXplorer: A System for Keyword-Based Search over Relational Databases," Proc. Int'l Conf. Data Eng. (ICDE), 2002.
[3] M. Al-Muhammed and D.W. Embley, "Ontology-Based Constraint Recognition for Free-Form Service Requests," Proc. Int'l Conf. Data Eng. (ICDE '07), 2007.
[4] H. Bast, A. Chitea, F. Suchanek, and I. Weber, "ESTER: Efficient Search on Text, Entities, and Relations," Proc. 30th Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval (SIGIR), 2007.
[5] D. Braga, A. Campi, and S. Ceri, "XQBE (XQuery By Example): A Visual Interface to the Standard XML Query Language," ACM Trans. Database Systems, vol. 30, no. 2, pp. 398-443, 2005.
[6] A. Blum, "Microsoft English Query 7.5: Automatic Extraction of Semantics from Relational Databases and OLAP Cubes," Proc. 25th Int'l Conf. Very Large Data Bases (VLDB), 1999.
[7] V.T. Chakaravarthy, H. Gupta, P. Roy, and M. Mohania, "Efficiently Linking Text Documents with Relevant Structured Information," Proc. Int'l Conf. Very Large Data Bases (VLDB), 2006.
[8] J. Chambers, W. Cleveland, B. Kleiner, and P. Tukey, Graphical Methods for Data Analysis. Wadsworth, 1983.
[9] Clusty, http:/clusty.com/, 2011.
[10] E. Demidova, X. Zhou, and W. Nejdl, "IQP: Incremental Query Construction, a Probabilistic Approach," Proc. 26th IEEE Int'l Conf. Data Eng. (ICDE), 2010.
[11] E. Demidova, X. Zhou, G. Zenz, and W. Nejdl, "SUITS: Faceted User Interface for Constructing Structured Queries from Keywords," Proc. 14th Int'l Conf. Database Systems for Advanced Applications (DASFAA), 2009.
[12] H. He, H. Wang, J. Yang, and P.S. Yu, "BLINKS: Ranked Keyword Searches on Graphs," Proc. ACM SIGMOD Int'l Conf. Management of Data (SIGMOD), 2007.
[13] M.A. Hearst, "Clustering versus Faceted Categories for Information Exploration," Comm. ACM, vol. 49, no. 4, pp. 59-61, Apr. 2006.
[14] V. Hristidis, L. Gravano, and Y. Papakonstantinou, "Efficient IR-Style Keyword Search over Relational Databases," Proc. Int'l Conf. Very Large Data Bases (VLDB), 2003.
[15] V. Hristidis and Y. Papakonstantinou, "DISCOVER: Keyword Search in Relational Databases," Proc. Int'l Conf. Very Large Data Bases (VLDB), 2002.
[16] H.V. Jagadish, "Making Database Systems Usable," Proc. ACM SIGMOD Int'l Conf. Management of Data (SIGMOD), 2007.
[17] M. Jayapandian and H.V. Jagadish, "Expressive Query Specification through Form Customization," Proc. 11th Int'l Conf. Extending Database Technology: Advances in Database Technology (EDBT), 2008.
[18] E. Kandogan, R. Krishnamurthy, S. Raghavan, S. Vaithyanathan, and H. Zhu, "Avatar Semantic Search: A Database Approach to Information Retrieval," Proc. ACM SIGMOD Int'l Conf. Management of Data (SIGMOD), 2006.
[19] G. Koutrika, A. Simitsis, and Y.E. Ioannidis, "Explaining Structured Queries in Natural Language," Proc. Int'l Conf. Data Eng. (ICDE), 2010.
[20] M. Käki, "Findex: Search Result Categories Help Users when Document Ranking Fails," Proc. SIGCHI Conf. Human Factors in Computing Systems, 2005.
[21] Y. Li, H. Yang, and H.V. Jagadish, "Constructing a Generic Natural Language Interface for an XML Database," Proc. Int'l Conf. Extending Database Technology (EDBT), 2006.
[22] F. Liu, C. Yu, W. Meng, and A. Chowdhury, "Effective Keyword Search in Relational Databases," Proc. ACM SIGMOD Int'l Conf. Management of Data (SIGMOD), 2006.
[23] Z. Liu and Y. Chen, "Identifying Meaningful Return Information for XML Keyword Search," Proc. ACM SIGMOD Int'l Conf. Management of Data (SIGMOD), 2007.
[24] Y. Luo, X. Lin, W. Wang, and X. Zhou, "SPARK: Top-k Keyword Query in Relational Databases," Proc. ACM SIGMOD Int'l Conf. Management of Data (SIGMOD), 2007.
[25] C.D. Manning, P. Raghavan, and H. Schütze, Introduction to Information Retrieval. Cambridge Univ. Press, 2008.
[26] M. McCandless, E. Hatcher, and O. Gospodnetic, Lucene in Action, second ed. Manning, 2008.
[27] F. Mesquita et al., "LABRADOR: Efficiently Publishing Relational Databases on the Web by Using Keyword-Based Query Interfaces," Information Processing and Management: An Int'l J., vol. 43, no. 4, pp. 983-1004, 2007.
[28] A. Nandi and H.V. Jagadish, "Assisted Querying Using Instant-Response Interfaces," Proc. ACM SIGMOD Int'l Conf. Management of Data (SIGMOD), 2007.
[29] G. Pass, A. Chowdhury, and C. Torgeson, "A Picture of Search," Proc. First Int'l Conf. Scalable Information Systems (InfoScale '06), 2006.
[30] J.R. Quinlan, "Induction of Decision Trees," Machine Learning, vol. 1, no. 1, pp. 81-106, Mar. 1986.
[31] S. Roy, H. Wang, G. Das, U. Nambiar, and M.K. Mohania, "Minimum Effort Driven Dynamic Faceted Search in Structured Databases," Proc. ACM Conf. Information and Knowledge Management (CIKM), 2008.
[32] S. Tata and G.M. Lohman, "SQAK: Doing More with Keywords," Proc. ACM SIGMOD Int'l Conf. Management of Data (SIGMOD), 2008.
[33] T. Tran, P. Cimiano, S. Rudolph, and R. Studer, "Ontology-Based Interpretation of Keywords for Semantic Search," Proc. Sixth Int'l the Semantic Web and Second Asian Conf. Asian Semantic Web Conf. (ISWC), 2007.
[34] P. Wu, Y. Sismanis, and B. Reinwald, "Towards Keyword-Driven Analytical Processing," Proc. ACM SIGMOD Int'l Conf. Management of Data (SIGMOD), 2007.
[35] G. Zenz, X. Zhou, E. Minack, W. Siberski, and W. Nejdl, "From Keywords to Semantic Queries—Incremental Query Construction on the Semantic Web," J. Web Semantics, vol. 7, no. 3, pp. 166-176, 2009.
[36] M.M. Zloof, "Query-by-Example: A Data Base Language," IBM Systems J., vol. 16, no. 4, pp. 324-343, 1977.
[37] Q. Zhou, C. Wang, M. Xiong, H. Wang, and Y. Yu, "SPARK: Adapting Keyword Query to Semantic Search," Proc. Sixth Int'l the Semantic Web and Second Asian Conf. Asian Semantic Web Conf. (ISWC), 2007.
[38] X. Zhou, G. Zenz, E. Demidova, and W. Nejdl, "SUITS: Constructing Structured Queries from Keywords," technical report, L3S Research Center, Hannover, Germany, http://www.l3s.de/~demidovasuits-TR.pdf, 2008.
24 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool