This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Extraction and Applications of Statistical Relationships in Relational Databases
December 1996 (vol. 8 no. 6)
pp. 939-945

Abstract—In this paper, we discuss modeling and extraction of statistical relationships among attributes. Different methods are used for extraction of different types of relationships. A complete methodology for extraction is developed by integrating widely accepted statistical methods. Statistical relationships manifest embedded relationships in data and thus lend themselves naturally to estimating unknown attribute values and detecting unlikely values. We will carefully examine these applications and evaluate the usefulness of statistical relationships in these applications using a real-life database.

[1] R. Agrawal, S. Ghosh, T. Imielinski, B. Iyer, and A. Swami, “An Interval Classifier for Database Mining Applications,” Proc. 18th Conf. Very Large Databases, pp. 560–573, 1992.
[2] R. Agrawal, T. Imielinski, and A. Swami, “Mining Association Rules Between Sets of Items in Large Databases,” Proc. 1993 ACM-SIGMOD Int'l Conf. Management of Data, pp. 207-216, May 1993.
[3] D. Barbara, H. Garcia-Molina, and D. Porter, "A Probabilistic Relational Data Model," Proc. 1990 EDBT Conf., pp. 60-74., 1990.
[4] J. Devore, Probability and Statistics for Eng. and the Sciences. Brooks/Cole, 1984.
[5] D. Freeman, Applied Categorical Data Analysis. Dekker, 1987.
[6] J. Han, Y. Cai, and N. Cercone, “Knowledge Discovery in Databases: an Attribute-Oriented Approach,” Proc. 18th Conf. Very Large Databases, pp. 547–559, 1992.
[7] W. Hou and G. Ozsoyoglu, “Processing Real-Time Aggregate Queries in CASE-DB,” ACM Trans. Database Systems, June 1993.
[8] R.A. Johnson and D.W. Wichern,Applied multivariate statistical analysis, Prentice Hall, 1988.
[9] S.-K. Lee,“Imprecise and uncertain information in databases: An evidentialapproach.” Proc. IEEE Int’l Conf. Data Engineering, pp. 614-621, 1992.
[10] A. Ola,“Relational databases with exclusive disjunctions,” Proc. IEEE Data Eng. Conf., pp. 328-336, 1992.
[11] Proc. IJCAI 89 Workshop Knowledge Discovery in Databases, G. Piatetsky-Shapiro and W. Frawley, eds., Aug. 1989.
[12] W. Frawley, G. Piatetsky-Shapiro, and C. Matheus, "Knowledge Discovery in Databases: An Overview," Knowledge Discovery in Databases, G. Piatetsky-Shapiro and W. Frawley, eds., pp. 1-27, AAAI/MIT Press, 1991.
[13] G. Piatetsky-Shapiro, "Discovery, Analysis, and Presentation of Strong Rules," Knowledge Discovery in Databases, G. Piatetsky-Shapiro and W. Frawely, eds., pp. 229-248, AAAI/MIT Press, 1991.
[14] "SAS/STAT User's Guide," Release 6.03 edition, SAS Inst., North Carolina, 1991.
[15] M. Tatsuoka, Multivariate Analysis. Macmillan, 1988.

Index Terms:
Data mining, estimating unknown attribute values, integration of data mining techniques, integrity constraints, knowledge discovery in databases, statistical relationships.
Citation:
Wen-Chi Hou, "Extraction and Applications of Statistical Relationships in Relational Databases," IEEE Transactions on Knowledge and Data Engineering, vol. 8, no. 6, pp. 939-945, Dec. 1996, doi:10.1109/69.553160
Usage of this product signifies your acceptance of the Terms of Use.