This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2012 IEEE Fifth International Conference on Cloud Computing
Cloud Guided Stream Classification Using Class-Based Ensemble
Honolulu, HI, USA USA
June 24-June 29
ISBN: 978-1-4673-2892-0
We propose a novel class-based micro-classifier ensemble classification technique (MCE) for  classifying data streams. Traditional  ensemble-based data stream classification techniques build  a classification model from each data chunk and  keep an ensemble of such models. Due to the fixed length of  the ensemble, when a new model is trained, one existing model is discarded. This creates several problems. First, if a  class disappears from the stream and reappears after a long  time, it would be misclassified if a majority of the classifiers  in the ensemble does not contain any model of that class. Second,  discarding a model means discarding the corresponding data  chunk completely. However, knowledge obtained from some classes  might be still useful and if they are discarded, the overall error rate would increase. To address these problems, we propose an ensemble model where each class information is stored separately.  From each data chunk, we train a model for each class of data.  We call each such model a micro-classifier. This approach is more robust than existing chunk-based  ensembles in handling dynamic changes in the data stream.  To the best of our knowledge, this is the first attempt to  classify data streams using the class-based ensembles approach.  When the number of classes grow in the stream, class-based ensembles may degrade in performance (speed). Hence, we sketch a cloud-based solution of our class-based ensembles to handle a large number of classes effectively. We compare our technique with several state-of-the-art data  stream classification techniques on both synthetic and benchmark data streams, and obtain much higher accuracy.
Index Terms:
Data models,Training,Testing,Training data,Accuracy,Data mining,Cloud computing,classification MapReduce Ensemble cloud
Citation:
Tahseen M. Al-Khateeb, Mohammad M. Masud, Latifur Khan, Bhavani Thuraisingham, "Cloud Guided Stream Classification Using Class-Based Ensemble," cloud, pp.694-701, 2012 IEEE Fifth International Conference on Cloud Computing, 2012
Usage of this product signifies your acceptance of the Terms of Use.