2012 IEEE 12th International Conference on Data Mining Workshops (2012)

Brussels, Belgium Belgium

Dec. 10, 2012 to Dec. 10, 2012

ISBN: 978-1-4673-5164-5

pp: 659-668

ABSTRACT

We have been considering a problem of finding significant connection strengths of variables in a linear non-Gaussian causal model called LiNGAM. In our previous work, bootstrap confidence intervals of connection strengths were simultaneously computed in order to test their statistical significance. However, the distribution of estimated elements in an adjacency matrix obtained by the bootstrap method was not close enough to the real distribution even though the number of bootstrap replications was increased. Moreover, such a naive approach raised the multiple comparison problem which many directed edges were likely to be falsely found significant. In this study, we propose a new approach used to correct the distribution obtained by the bootstrap method. We also apply a representative technique of multiple comparison, the Bonferroni correction, then evaluate its performance. The result of this study shows that the new distribution is more stable and also even closer to the real distribution. Besides, the number of falsely found significant edges is less than the previous approach.

INDEX TERMS

Vectors, Niobium, Mathematical model, Adaptation models, Equations, Bayesian methods, Data models, bootstrap method, Structural equation models, Bayesian networks, non-Gaussianity, causal discovery, Bayesian information criteria, adaptive Lasso

CITATION

K. Thamvitayakul, S. Shimizu, T. Ueno, T. Washio and T. Tashiro, "Bootstrap Confidence Intervals in DirectLiNGAM,"

*2012 IEEE 12th International Conference on Data Mining Workshops(ICDMW)*, Brussels, Belgium Belgium, 2012, pp. 659-668.

doi:10.1109/ICDMW.2012.134