Issue No.04 - April (2011 vol.23)
Lei Chen , Hong Kong University of Science and Technology, Hong Kong
Jeffrey Xu Yu , The Chinese University of Hong Kong, Hong Kong
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TKDE.2010.146
Existing solutions for probabilistic inference queries mainly focus on answering a single inference query, but seldom address the issues of efficiently returning results for a sequence of frequent queries, which is more popular and practical in many real applications. In this paper, we mainly study the computation caching and sharing among a sequence of inference queries in databases. The clique tree propagation (ctp) algorithm is first introduced in databases for probabilistic inference queries. We use the materialized views to cache the intermediate results of the previous inference queries, which might be shared with the following queries, and consequently reduce the time cost. Moreover, we take the query workload into account to identify the frequently queried variables. To optimize probabilistic inference queries with ctp, we cache these frequent query variables into the materialized views to maximize the reuse. Due to the existence of different query plans, we present heuristics to estimate costs and select the optimal query plan. Finally, we present the experimental evaluation in relational databases to illustrate the validity and superiority of our approaches in answering frequent probabilistic inference queries.
Probabilistic inference, variable elimination, clique tree propagation.
Lei Chen, Jeffrey Xu Yu, "Answering Frequent Probabilistic Inference Queries in Databases", IEEE Transactions on Knowledge & Data Engineering, vol.23, no. 4, pp. 512-526, April 2011, doi:10.1109/TKDE.2010.146