|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2009 Ninth IEEE International Conference on Data Mining
Efficient Discovery of Confounders in Large Data Sets
Miami, Florida
December 06-December 09
ISBN: 978-0-7695-3895-2
| ASCII Text | x | ||
| Wenjun Zhou, Hui Xiong, "Efficient Discovery of Confounders in Large Data Sets," Data Mining, IEEE International Conference on, pp. 647-656, 2009 Ninth IEEE International Conference on Data Mining, 2009. | |||
| BibTex | x | ||
| @article{ 10.1109/ICDM.2009.77, author = {Wenjun Zhou and Hui Xiong}, title = {Efficient Discovery of Confounders in Large Data Sets}, journal ={Data Mining, IEEE International Conference on}, volume = {0}, year = {2009}, issn = {1550-4786}, pages = {647-656}, doi = {http://doi.ieeecomputersociety.org/10.1109/ICDM.2009.77}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Data Mining, IEEE International Conference on TI - Efficient Discovery of Confounders in Large Data Sets SN - 1550-4786 SP647 EP656 A1 - Wenjun Zhou, A1 - Hui Xiong, PY - 2009 KW - Phi Correlation coefficient KW - Correlation KW - Partial Correlation KW - Local Association KW - Confounder VL - 0 JA - Data Mining, IEEE International Conference on ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDM.2009.77
Given a large transaction database, association analysis is concerned with efficiently finding strongly related objects. Unlike traditional associate analysis, where relationships among variables are searched at a global level, we examine confounding factors at a local level. Indeed, many real-world phenomena are localized to specific regions and times. These relationships may not be visible when the entire data set is analyzed. Specially, confounding effects that change the direction of correlation is the most significant. Along this line, we propose to efficiently find confounding effects attributable to local associations. Specifically, we derive an upper bound by a necessary condition of confounders, which can help us prune the search space and efficiently identify confounders. Experimental results show that the proposed CONFOUND algorithm can effectively identify confounders and the computational performance is an order of magnitude faster than benchmark methods.
Index Terms:
Phi Correlation coefficient, Correlation, Partial Correlation, Local Association, Confounder
Citation:
Wenjun Zhou, Hui Xiong, "Efficient Discovery of Confounders in Large Data Sets," icdm, pp.647-656, 2009 Ninth IEEE International Conference on Data Mining, 2009
Usage of this product signifies your acceptance of the Terms of Use.
