Fourth Asia-Pacific Software Engineering and International Computer Science Conference (APSEC'97 / ICSC'97)
Reducing Uncertainties in Data Mining
Clear Water Bay, HONG KONG
December 02-December 05
ISBN: 0-8186-8271-X
Data mining, which is also referred to as knowledge discovery in databases, has attracted much research interest in recent years. Data mining among independently developed databases often involves uncertain information. These uncertainties can be generated during both processes of combining relations and merging tuples. In this paper, we propose a framework in which uncertainties can be measured. The objective is to determine the best way to combine and merge tuples in multiple databases and avoid generating unexpected uncertainties. The Shannon entropy theory plays a key part in our approach to reduce uncertainties when merging related tuples in a combined relation. Detailed examples are provided in the paper to address key issues.
Index Terms:
Data mining, knowledge discovery, database, uncertainty, Dempster-Shafer theory, entropy
Citation:
Yuhe Li, Haihong Dai, "Reducing Uncertainties in Data Mining," apsec, pp.97, Fourth Asia-Pacific Software Engineering and International Computer Science Conference (APSEC'97 / ICSC'97), 1997