19th IEEE International Conference on Tools with Artificial Intelligence - Vol.2 (ICTAI 2007) An Empirical Study of Learning from Imbalanced Data Using Random Forest Paris, France October 29-October 31 ISBN: 0-7695-3015-X
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICTAI.2007.46
This paper discusses a comprehensive suite of experi- ments that analyze the performance of the random forest (RF) learner implemented in Weka. RF is a relatively new learner, and to the best of our knowledge, only preliminary experimentation on the construction of random forest clas- sifiers in the context of imbalanced data has been reported in previous work. Therefore, the contribution of this study is to provide an extensive empirical evaluation of RF learn- ers built from imbalanced data. What should be the rec- ommended default number of trees in the ensemble? What should the recommended value be for the number of at- tributes? How does the RF learner perform on imbalanced data when compared with other commonly-used learners? We address these and other related issues in this work.
Citation:
Taghi M. Khoshgoftaar, Moiz Golawala, Jason Van Hulse, "An Empirical Study of Learning from Imbalanced Data Using Random Forest," ictai, vol. 2, pp.310-317, 19th IEEE International Conference on Tools with Artificial Intelligence - Vol.2 (ICTAI 2007), 2007 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||