2011 IEEE 11th International Conference on Data Mining Workshops (2011)
Dec. 11, 2011 to Dec. 11, 2011
Discovering infrequent causal relationships can help us prevent or correct negative outcomes caused by their antecedents. In this paper, we propose an innovative data mining framework and apply it to mine potential causal associations in electronic patient datasets where the drug-related events of interest occur infrequently. Specifically, we created a novel interestingness measure, exclusive causal-leverage, based on a computational, fuzzy recognition-primed decision (RPD) model that we previously developed. On the basis of this new measure, a data mining algorithm was developed to mine the causal relationship between drugs and their associated adverse drug reactions (ADRs). The algorithm was tested on real patient data retrieved from the Veterans Affairs Medical Center in Detroit, Michigan. The exclusive causal-leverage was employed to rank the potential causal associations between each of the two selected drugs (i.e., enalapril and pravastatin) and 3,954 recorded symptoms, each of which corresponds to a potential ADR. The top 10 drug-symptom pairs for each drug were evaluated by our physicians on the project team. The results showed that the number of symptoms considered as real ADRs for enalapril and pravastatin was 8 and 7 out of 10, respectively.
Causal association rules, electronic health database, adverse drug reactions, fuzzy recognition-primed decision model
Yanqing Ji, Hao Ying, John Tran, Peter Dews, Ayman Mansour, R. Michael Massanari, "Mining Infrequent Causal Associations in Electronic Health Databases", 2011 IEEE 11th International Conference on Data Mining Workshops, vol. 00, no. , pp. 421-428, 2011, doi:10.1109/ICDMW.2011.120