|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2011 IEEE 11th International Conference on Data Mining
Detecting Recurring and Novel Classes in Concept-Drifting Data Streams
Vancouver, Canada
December 11-December 14
ISBN: 978-0-7695-4408-3
| ASCII Text | x | ||
| Mohammad M. Masud, Tahseen M. Al-Khateeb, Latifur Khan, Charu Aggarwal, Jing Gao, Jiawei Han, Bhavani Thuraisingham, "Detecting Recurring and Novel Classes in Concept-Drifting Data Streams," Data Mining, IEEE International Conference on, pp. 1176-1181, 2011 IEEE 11th International Conference on Data Mining, 2011. | |||
| BibTex | x | ||
| @article{ 10.1109/ICDM.2011.49, author = {Mohammad M. Masud and Tahseen M. Al-Khateeb and Latifur Khan and Charu Aggarwal and Jing Gao and Jiawei Han and Bhavani Thuraisingham}, title = {Detecting Recurring and Novel Classes in Concept-Drifting Data Streams}, journal ={Data Mining, IEEE International Conference on}, volume = {0}, year = {2011}, issn = {1550-4786}, pages = {1176-1181}, doi = {http://doi.ieeecomputersociety.org/10.1109/ICDM.2011.49}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Data Mining, IEEE International Conference on TI - Detecting Recurring and Novel Classes in Concept-Drifting Data Streams SN - 1550-4786 SP1176 EP1181 A1 - Mohammad M. Masud, A1 - Tahseen M. Al-Khateeb, A1 - Latifur Khan, A1 - Charu Aggarwal, A1 - Jing Gao, A1 - Jiawei Han, A1 - Bhavani Thuraisingham, PY - 2011 KW - stream classification KW - novel class KW - recurring class VL - 0 JA - Data Mining, IEEE International Conference on ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDM.2011.49
Concept-evolution is one of the major challenges in data stream classification, which occurs when a new class evolves in the stream. This problem remains unaddressed by most state-of-the-art techniques. A recurring class is a special case of concept-evolution. This special case takes place when a class appears in the stream, then disappears for a long time, and again appears. Existing data stream classification techniques that address the concept-evolution problem, wrongly detect the recurring classes as novel class. This creates two main problems. First, much resource is wasted in detecting a recurring class as novel class, because novel class detection is much more computationally- and memory-intensive, as compared to simply recognizing an existing class. Second, when a novel class is identified, human experts are involved in collecting and labeling the instances of that class for future modeling. If a recurrent class is reported as novel class, it will be only a waste of human effort to find out whether it is really a novel class. In this paper, we address the recurring issue, and propose a more realistic novel class detection technique, which remembers a class and identifies it as "not novel" when it reappears after a long disappearance. Our approach has shown significant reduction in classification error over state-of-the-art stream classification techniques on several benchmark data streams.
Index Terms:
stream classification, novel class, recurring class
Citation:
Mohammad M. Masud, Tahseen M. Al-Khateeb, Latifur Khan, Charu Aggarwal, Jing Gao, Jiawei Han, Bhavani Thuraisingham, "Detecting Recurring and Novel Classes in Concept-Drifting Data Streams," icdm, pp.1176-1181, 2011 IEEE 11th International Conference on Data Mining, 2011
Usage of this product signifies your acceptance of the Terms of Use.
