
This Article  
 
Share  
Bibliographic References  
Add to:  
Digg Furl Spurl Blink Simpy Del.icio.us Y!MyWeb  
Search  
 
ASCII Text  x  
Xiaofeng Zhu, Shichao Zhang, Zhi Jin, Zili Zhang, Zhuoming Xu, "Missing Value Estimation for MixedAttribute Data Sets," IEEE Transactions on Knowledge and Data Engineering, vol. 23, no. 1, pp. 110121, January, 2011.  
BibTex  x  
@article{ 10.1109/TKDE.2010.99, author = {Xiaofeng Zhu and Shichao Zhang and Zhi Jin and Zili Zhang and Zhuoming Xu}, title = {Missing Value Estimation for MixedAttribute Data Sets}, journal ={IEEE Transactions on Knowledge and Data Engineering}, volume = {23}, number = {1}, issn = {10414347}, year = {2011}, pages = {110121}, doi = {http://doi.ieeecomputersociety.org/10.1109/TKDE.2010.99}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, }  
RefWorks Procite/RefMan/Endnote  x  
TY  JOUR JO  IEEE Transactions on Knowledge and Data Engineering TI  Missing Value Estimation for MixedAttribute Data Sets IS  1 SN  10414347 SP110 EP121 EPD  110121 A1  Xiaofeng Zhu, A1  Shichao Zhang, A1  Zhi Jin, A1  Zili Zhang, A1  Zhuoming Xu, PY  2011 KW  Classification KW  data mining KW  methodologies KW  machine learning. VL  23 JA  IEEE Transactions on Knowledge and Data Engineering ER   
[1] I.A. Ahamad and P.B. Cerrito, "Nonparametric Estimation of Joint DiscreteContinuous Probability Densities with Applications," J. Statistical Planning and Inference, vol. 41, pp. 349364, 1994.
[2] P. Allison, Missing Data. Sage Publication, Inc., 2001.
[3] J. Aitchison and C.G.G. Aitken, "Multivariate Binary Discrimination by the Kernel Method," Biometrika, vol. 63, pp. 413420, 1976.
[4] J. Barnard and D. Rubin, "SmallSample Degrees of Freedom with Multiple Imputation," Biometrika, vol. 86, pp. 948955, 1999.
[5] G. Batista and M. Monard, "An Analysis of Four Missing Data Treatment Methods for Supervised Learning," Applied Artificial Intelligence, vol. 17, pp. 519533, 2003.
[6] H. Bierens, "Uniform Consistency of Kernel Estimators of a Regression Function under Generalized Conditions," J. Am. Statistical Assoc., vol. 78, pp. 699707, 1983.
[7] C. Blake and C. Merz UCI Repository of Machine Learning Database, http://www.ics.uci.edu/~mlearnMLResoesitory. html , 1998.
[8] M.L. Brown,, "Data Mining and the Impact of Missing Data," Industrial Management and Data Systems, vol. 103, no. 8, pp. 611621, 2003.
[9] R. Caruana, "A NonParametric EMStyle Algorithm for Imputing Missing Value," Artificial Intelligence and Statistics, Jan. 2001.
[10] K. Cios and L. Kurgan, "Knowledge Discovery in Advanced Information Systems," Trends in Data Mining and Knowledge Discovery, N. Pal, L. Jain, and N. Teoderesku, eds., Springer, 2002.
[11] M.A. Delgado and J. Mora, "Nonparametric and SemiParametric Estimation with Discrete Regressors," Econometrica, vol. 63, pp. 14771484, 1995.
[12] A. Dempster and D. Rubin, Incomplete Data in Sample Surveys: Theory and Bibliography, W.G. Madow, I. Olkin, and D. Rubin, eds., vol. 2, pp. 310, Academic Press, 1983.
[13] A. Dempster, N.M. Laird, and D. Rubin, "Maximum Likelihood from Incomplete Data via the EM Algorithm," J. Royal Statistical Soc., vol. 39, pp. 138, 1977.
[14] U. Dick et al., "Learning from Incomplete Data with Infinite Imputation," Proc. Int'l Conf. Machine Learning (ICML '08), pp. 232239, 2008.
[15] Z. Ghahramani and M. Jordan, "Mixture Models for Learning from Incomplete Data," Computational Learning Theory and Natural Learning Systems, R. Greiner, T. Petsche, and S.J. Hanson, eds., vol. IV: Making Learning Systems Practical, pp. 6785, The MIT Press, 1997.
[16] J. Han and M. Kamber, Data Mining Concepts and Techniques, second ed. Morgan Kaufmann Publishers, 2006.
[17] M. Huisman, "Missing Data in Social Network," Proc. Int'l Sunbelt Social Network Conf. (Sunbelt XXVII), 2007.
[18] G. John et al., "IrRelevant Features and the Subset Selection Problem," Proc. 11th Int'l Conf. Machine Learning, W. Cohen and H. Hirsch, eds., pp. 121129, 1994.
[19] M.C. Jones, J.S. Marron, and S.J. Sheather, "A Brief Survey of Bandwidth Selection for Density Estimation," J. Am. Statistical Assoc., vol. 91, no. 433, pp. 401407, 1996.
[20] E.M. Jordaan, "Development of Robust Inferential Sensors: Industrial Application of Support Vector Machines for Regression," PhD thesis, Technical University Eindhoven, 2002.
[21] K. Lakshminarayan et al., "Imputation of Missing Data in Industrial Databases," Applied Intelligence, vol. 11, pp. 259275, 1999.
[22] R. Little and D. Rubin, Statistical Analysis with Missing Data, second ed. John Wiley and Sons, 2002.
[23] R. Marco, "Learning Bayesian Networks from Incomplete Databases," Technical Report kmi976, Knowledge Media Inst., The Open Univ., 1997.
[24] C. Peng and J. Zhu, "Comparison of Two Approaches for Handling Missing Covariates in Logistic Regression," Educational and Psychological Measurement, vol. 68, no. 1, pp. 5877, 2008.
[25] Y.S. Qin et al., "SemiParametric Optimization for Missing Data Imputation," Applied Intelligence, vol. 21, no. 1, pp. 7988, 2007.
[26] Y.S. Qin et al., "POP Algorithm: KernelBased Imputation to Treat Missing Values in Knowledge Discovery from Databases," Expert Systems with Applications, vol. 36, pp. 27942804, 2009.
[27] J.R. Quinlan, "Unknown Attribute values in Induction," Proc. Sixth Int'l Workshop Machine Learning, pp. 164168, 1989.
[28] J.R. Quinlan, C4.5: Programs for Machine Learning. Morgan Kaufmann, 1993.
[29] J. Racine and Q. Li, "Nonparametric Estimation of Regression Functions with Both Categorical and Continuous Data," J. Econometrics, vol. 119, no. 1, pp. 99130, 2004.
[30] V.C. Raykar and R. DuraiswamiFast, "Fast Optimal Bandwidth Selection for Kernel Density Estimation," Proc. SIAM Int'l Conf. Data Mining (SDM '06), pp. 524528, 2006.
[31] D. Rubin, Multiple Imputation for Nonresponse in Surveys. Wiley, 1987.
[32] J.L. Schafer, Analysis of Incomplete Multivariate Data. Chapman and Hall/CRC, 1997.
[33] B. Silverman, Density Estimation for Statistics and Data Analysis. Chapman and Hall, 1986.
[34] G.F. Smits and E.M. Jordaan, "Improved SVM Regression Using Mixtures of Kernels," Proc. 2002 Int'l Joint Conf. Neural Networks, pp. 27852790, 2002.
[35] Q.H. Wang and R. Rao, "Empirical LikelihoodBased Inference under Imputation for Missing Response Data," Annals of Statistics, vol. 30, pp. 896924, 2002.
[36] S.C. Zhang et al., "Missing Is Useful: Missing Values in CostSensitive Decision Trees," IEEE Trans. Knowledge and Data Eng., vol. 17, no. 12, pp. 16891693, Dec. 2005.
[37] S.C. Zhang et al., "Information Enhancement for Data Mining," IEEE Intelligent Systems, vol. 19, no. 2, pp. 1213, Mar./Apr. 2004.
[38] S.C. Zhang, "Parimputation: From Imputation and NullImputation to Partially Imputation," IEEE Intelligent Informatics Bull., vol. 9, no. 1, pp. 3238, Nov. 2008.
[39] S. Zhang, "ShellNeighbor Method and Its Application in Missing Data Imputation," Applied Intelligence, doi: 10.1007/s1048900902076.
[40] W. Zhang, "Association Based Multiple Imputation in Multivariate Data Sets: A Summary," Proc. Int'l Conf. Data Eng. (ICDE), p. 310, 2000.
[41] S. Zheng, J. Liu, and J. Tian, "An Efficient Star Acquisition Method Based on SVM with Mixtures of Kernels," Pattern Recognition Letters, vol. 26, pp. 147165, 2005.
[42] C. Zhang, X. Zhu, J. Zhang, Y. Qin, and S. Zhang, "GBKII: An Imputation Method for Missing Values," Proc. 11th PacificAsia Knowledge Discovery and Data Mining Conf. (PAKDD '07), pp. 10801087, 2007.
[43] J. Hausman, B.H. Hall, and Z. Griliches, "Econometric Models for Count Data with an Application to the PatentsR&D Relationship," Econometrica, vol. 52, no. 4, pp. 909938, 1984.