2012 IEEE 12th International Conference on Data Mining Workshops (2012)

Brussels, Belgium Belgium

Dec. 10, 2012 to Dec. 10, 2012

ISBN: 978-1-4673-5164-5

pp: 953-960

DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDMW.2012.68

ABSTRACT

Data mining tools often include a workbench of algorithms to model a given dataset but lack sufficient guidance to select the most accurate algorithm according to the nature of the dataset. The most accurate algorithm is not known in advance and no single model format is superior for all datasets. An a priori comparison experiment to determine which algorithm leads to the most optimal model fit could offer relief but is rather time consuming. A meta model which is able to predict and explain which model would fit best could address this problem. The relevance of such a meta model increases with the number and runtime of algorithms under consideration and the size of the dataset. In this paper a novel meta model is proposed to automatically provide support to the user about whether to use a linear, spline, tree, linear tree or linear spline model, given a particular dataset. This study distinguishes itself from previous meta learning studies by focusing on comprehensible regression models, regression specific dataset characteristics, artificially generated datasets and a score based user recommendation.

INDEX TERMS

Splines (mathematics), Prediction algorithms, Correlation, Accuracy, Data models, Data mining, Feature extraction, meta learning, regression, accuracy, comprehensibility

CITATION

G. Loterman and C. Mues, "Selecting Accurate and Comprehensible Regression Algorithms through Meta Learning,"

*2012 IEEE 12th International Conference on Data Mining Workshops(ICDMW)*, Brussels, Belgium Belgium, 2012, pp. 953-960.

doi:10.1109/ICDMW.2012.68

CITATIONS