CSDL Home D DEXA 2006 Seventeenth International Conference on Database and Expert Systems Applications
Sept. 4, 2006 to Sept. 8, 2006
Nittaya Kerdprasop , Suranaree University of Technology, Thailand
Kittisak Kerdprasop , Suranaree University of Technology, Thailand
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/DEXA.2006.49
Density estimation is an important pre-processing step in the problem of data stream classification in which the number of data is overwhelming and the exact data distribution is unknown. We simplify the problem by employing a statistical sampling technique to obtain an approximate solution. With the proposed method, an unbounded large data set can be sampled in a number of random configurations, and that data can be used to describe the data set as a whole. The efficiency of the method depends largely on the ability to draw samples effectively which in turn depends on how close we can estimate the target density. We use finite mixture models to represent the probability density functions of the data stream. Then, we apply the EM algorithm twice to learn the model parameters. The efficiency of our estimation technique has been shown in the experimental results.
Nittaya Kerdprasop, Kittisak Kerdprasop, "Density Estimation Technique for Data Stream Classification", DEXA, 2006, Seventeenth International Conference on Database and Expert Systems Applications, Seventeenth International Conference on Database and Expert Systems Applications 2006, pp. 662-666, doi:10.1109/DEXA.2006.49