The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.08 - Aug. (2013 vol.35)
pp: 1930-1943
Hoo-Chang Shin , Inst. of Cancer Res. & R. Marsden, NHS Found. Trust, Sutton, UK
M. R. Orton , Inst. of Cancer Res. & R. Marsden, NHS Found. Trust, Sutton, UK
D. J. Collins , Inst. of Cancer Res. & R. Marsden, NHS Found. Trust, Sutton, UK
S. J. Doran , Inst. of Cancer Res. & R. Marsden, NHS Found. Trust, Sutton, UK
M. O. Leach , Inst. of Cancer Res. & R. Marsden, NHS Found. Trust, Sutton, UK
ABSTRACT
Medical image analysis remains a challenging application area for artificial intelligence. When applying machine learning, obtaining ground-truth labels for supervised learning is more difficult than in many more common applications of machine learning. This is especially so for datasets with abnormalities, as tissue types and the shapes of the organs in these datasets differ widely. However, organ detection in such an abnormal dataset may have many promising potential real-world applications, such as automatic diagnosis, automated radiotherapy planning, and medical image retrieval, where new multimodal medical images provide more information about the imaged tissues for diagnosis. Here, we test the application of deep learning methods to organ identification in magnetic resonance medical images, with visual and temporal hierarchical features learned to categorize object classes from an unlabeled multimodal DCE-MRI dataset so that only a weakly supervised training is required for a classifier. A probabilistic patch-based method was employed for multiple organ detection, with the features learned from the deep learning model. This shows the potential of the deep learning model for application to medical images, despite the difficulty of obtaining libraries of correctly labeled training datasets and despite the intrinsic abnormalities present in patient datasets.
INDEX TERMS
Liver, Training, Machine learning, Medical diagnostic imaging, Visualization, Feature extraction,biomedical image processing, Edge and feature detection, object recognition, pixel classification, machine learning
CITATION
Hoo-Chang Shin, M. R. Orton, D. J. Collins, S. J. Doran, M. O. Leach, "Stacked Autoencoders for Unsupervised Feature Learning and Multiple Organ Detection in a Pilot Study Using 4D Patient Data", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.35, no. 8, pp. 1930-1943, Aug. 2013, doi:10.1109/TPAMI.2012.277
REFERENCES
[1] D.G. Lowe, "Object Recognition from Local Scale-Invariant Features," Proc. IEEE Int'l Conf. Computer Vision, vol. 2, pp. 1150-1157, 1999.
[2] H. Bay, T. Tuytelaars, and L.V. Gool, "Surf: Speeded Up Robust Features," Proc. European Conf. Computer Vision, pp. 404-417, 2006.
[3] N. Dalal and B. Triggs, "Histograms of Oriented Gradients for Human Detection," Proc. IEEE Conf. Computer Vision and Pattern Recognition pp. 886-893, 2005.
[4] L. Fei-Fei, R. Fergus, and P. Perona, "One-Shot Learning of Object Categories," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 29, no. 4, pp. 594-611, Apr. 2006.
[5] G. Griffin, A. Holub, and P. Perona, "The Caltech-256 Object Category Dataset," technical report, California Inst. of Tech nology, 2007.
[6] D.J. Collins and A.R. Padhani, "Dynamic Magnetic Resonance Imaging of Tumor Perfusion," IEEE Eng. in Medicine and Biology Magazine, vol. 23, no. 5, pp. 65-83, Sept./Oct. 2004.
[7] A.R. Padhani, G. Liu, D. Mu-Koh, T.L. Chenevert, H.C. Thoney, T. Takahara, A. Dzik-Jurasz, B.D. Ross, M.V. Cauteren, D. Collins, D.A. Hammoud, R.JS. Gordon, T. Bachir, and C.L. Peter, "Diffusion-Weighted Magnetic Resonance Imaging as a Cancer Biomarker: Consensus and Recommendations," Neoplasia, vol. 11, pp. 102-125, 2009.
[8] R. Raina, A. Battle, H. Lee, B. Packer, and A.Y. Ng, "Self-Taught Learning: Transfer Learning from Unlabled Data," Proc. Int'l Conf. Machine Learning, pp. 759-766, 2007.
[9] I. Goodfellow, Q. Le, A. Saxe, H. Lee, and A. Ng, "Measuring Invariances in Deep Networks," Advances in Neural Information Processing Systems, vol. 22, pp. 646-654, 2009.
[10] H. Shin, M. Orton, D.J. Collins, S. Doran, and M.O. Leach, "Autoencoder in Time-Series Analysis for Unsupervised Tissues Characterisation in a Large Unlabelled Medical Image Data Set," Proc. IEEE Int'l Conf. Machine Learning and Application, pp. 259-264, 2011.
[11] M. Weber, M. Welling, and P. Perona, "Towards Automatic Discovery of Object Categories," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 101-108, 2000.
[12] R. Fergus, P. Perona, and A. Zisserman, "Object Class Recognition by Unsupervised Scale-Invariant Learning," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 264-271, 2003.
[13] E.J. Bernstein and Y. Amit, "Part-Based Statistical Models for Object Classification and Detection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 734-740, 2005.
[14] A. Torralba, K.P. Murphy, and W.T. Freeman, "Sharing Visual Features for Multiclass and Multiview Object Detection," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 29, no. 5, pp. 854-869, May 2007.
[15] P.F. Felzenszwalb, R.B. Girshick, D. McAllester, and D. Ramanan, "Object Detection with Discriminatively Trained Part-Based Models," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 32, no. 9, pp. 1627-1645, Sept. 2010.
[16] S. Ji, W. Xu, M. Yang, and K. Yu, "3D Convolutional Neural Networks for Human Action Recognition," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 35, no. 1, pp. 221-321, Jan. 2013.
[17] J.C. Niebles, J. Wang, and L. Fei-Fei, "Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words," Int'l J. Computer Vision, vol. 79, pp. 299-318, 2008.
[18] H. Lee, R. Grosse, R. Ranganath, and A. Ng, "Unsupervised Learning of Hierarchical Representations with Convolutional Deep Belief Networks," Comm. ACM, vol. 54, no. 10, pp. 95-103, 2011.
[19] M.A. Ranzato, F.J. Huang, Y.L. Boureau, and Y. LeCun, "Unsupervised Learning of Invariant Feature Hierarchies with Applications to Object Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, June 2007.
[20] M.D. Zeiler, G.W. Tayler, and R. Fergus, "Adaptive Deconvolutional Networks for Mid and High Level Feature Learning," Proc. IEEE Int'l Conf. Computer Vision, pp. 2018-2025, Nov. 2011.
[21] K. Sohn, D.Y. Jung, H. Lee, and A.O. Hero, "Efficient Learning of Sparse, Distributed, Convolutional Feature Representations for Object Recognition," Proc. IEEE Int'l Conf. Computer Vision, pp. 2643-2650, 2011.
[22] X. Glorot, A. Bordes, and Y. Bengio, "Domain Adaptation for Large-Scale Sentiment Classification: A Deep Learning Approach," Proc. 28th Int'l Conf. Machine Learning, pp. 513-520, 2011.
[23] K. Yu, W. Xu, and Y. Gong, "Deep Learning with Kernel Regularization for Visual Recognition," Advances in Neural Information Processing Systems, vol. 21, pp. 1889-1896, 2008.
[24] L. Bazzani, N. Freitas, H. Larochelle, V. Murino, and T. Jo-Anne, "Learning Attentional Policies for Tracking and Recognition in Video with Deep Networks," Proc. 28th Int'l Conf. Machine Learning, pp. 937-944, 2011.
[25] D.E. Rumelhart and J.L. McClelland, "Parallel Distributed Processing: Psychological and Biological Models," Information Processing in Dynamical Systems: Foundations of Harmony Theory, vol. 1, pp. 194-281, 1986.
[26] T. Schmah, G.E. Hinton, R. Zemel, S.L. Small, and S. Strother, "Generative versus Discriminative Training of RBMs for Classification of fMRI Images," Advances in Neural Information Processing Systems, vol. 21, pp. 1409-1416, 2009.
[27] J. Ngiam, A. Khosla, M. Kim, J. Nam, H. Lee, and A.Y. Ng, "Multimodal Deep Learning," Proc. Int'l Conf. Machine Learning, 2011.
[28] Q.V. Le, W.Y. Zou, S.Y. Yeung, and A.Y. Ng, "Learning Hierarchical Invariant Spatio-Temporal Features for Action Recognition with Independent Subspace Analysis," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 3361-3368, 2011.
[29] L. Li and B.A. Prakash, "Time Series Clustering: Complex Is Simpler!" Proc. Int'l Conf. Machine Learning, pp. 185-192, 2011.
[30] E. Geremia, B. Menze, O. Clatz, E. Konukoglu, A. Criminisini, and N. Ayache, "Spatial Decision Forests for MS Lesion Segmentation in Multi-Channel MR Images," Proc. Medical Image Computing and Computer-Assisted Intervention, pp. 111-118, 2010.
[31] J. Corso, E. Sharon, S. Dube, S. El-Saden, U. Sinha, and A. Yuille, "Efficient Multilevel Brain Tumor Segmentation with Integrated Bayesian Model Classification," IEEE Trans. Medical Imaging, vol. 27, no. 5, pp. 629-640, May 2008.
[32] M. Clark, L. Hall, D. Goldgof, R. Velthuizen, F. Murtagh, and M. Silbiger, "Automatic Tumor Segmentation Using Knowledge-Based Techniques," IEEE Trans. Medical Imaging, vol. 17, no. 2, pp. 187-201, Apr. 1998.
[33] A. Farhangfar, R. Greiner, and C. Szepesvá'ri, "Learning to Segment from a Few Well-Selected Training Images," Proc. Int'l Conf. Machine Learning, pp. 305-312, 2009.
[34] H. Shin, "Hybrid Clustering and Logistic Regression for Multi-Modal Brain Tumor Segmentation," Proc. Workshops and Challenges in Medical Image Computing and Computer-Assisted Intervention, 2012.
[35] T. Okada, K. Yokota, M. Hori, M. Nakamoto, H. Nakamura, and Y. Sato, "Construction of Hierarchical Multi-Organ Statistical Atlases and Their Application to Multi-Organ Segmentation from CT Images," Proc. Medical Image Computing and Computer-Assisted Intervention, pp. 502-509, 2008.
[36] M.G. Linguraru and R.M. Summers, "Multi-Organ Automatic Segmentation in 4D Contrast-Enhanced Abdominal CT," Proc. IEEE Int'l Symp. Biomedical Imaging, pp. 45-48, 2008.
[37] O. Pauly, B. Glocker, A. Criminisi, D. Mateus, A. Möller, S. Nekolla, and N. Navab, "Fast Multiple Organ Detection and Localization in Whole-Body MR Dixon Sequences," Proc. Medical Image Computing and Computer-Assisted Intervention, pp. 239-247, 2011.
[38] J. Iglesias, E. Konukoglu, A. Montillo, Z. Tu, and A. Crimisini, "Combining Generative and Discriminative Models for Semantic Segmentation of CT Scans via Active Learning," Information Processing in Medical Imaging, vol. 6801, pp. 25-36, 2011.
[39] M.R. Orton, K. Miyazaki, D.M. Koh, D.J. Collins, D.J. Hawkes, D. Atkinson, and M.O. Leach, "Optimizing Functional Parameter Accuracy for Breath-Hold DCE-MRI of Liver Tumours," Physics in Medicine and Biology, vol. 54, pp. 2197-2215, 2009.
[40] A.P. Dempster, N.M. Laird, and D.B. Rubin, "Maximum Likelihood from Incomplete Data via the EM Algorithm," J. Royal Statistical Soc., vol. 39, pp. 1-38, 1977.
[41] R. Marc'Aurelio, L. Boureau, and Y. LeCun, "Sparse Feature Learning for Deep Belief Networks," Advances in Neural Information Processing Systems, vol. 20, 2007.
[42] Y. Bengio, P. Lamblin, D. Popovici, H. Larochelle, and U. Montreal, "Greedy Layer-Wise Training of Deep Networks," Advances in Neural Information Processing Systems, vol. 19, p. 153, 2007.
[43] P. Vincent, H. Larochelle, Y. Bengio, and P.A. Manzagol, "Extracting and Composing Robust Features with Denoising Autoencoders," Proc. 25th Int'l Conf. Machine Learning, pp. 1096-1103, 2008.
[44] H. Lee, C. Ekanadham, and A. Ng, "Sparse Deep Belief Net Model for Visual Area V2," Advances in Neural Information Processing Systems, vol. 20, pp. 873-880, 2008.
[45] H. Larochelle, Y. Bengio, J. Louradour, and P. Lamblin, "Exploring Strategies for Training Deep Neural Networks," The J. Machine Learning Research, vol. 10, pp. 1-40, 2009.
[46] S. Kullback and R.A. Leibler, "On Information and Sufficiency," The Annals of Math. Statistics, vol. 22, pp. 79-86, 1951.
[47] B.A. Olshausen and D.J. Field, "Sparse Coding with an Overcomplete Basis Set: A Strategy Employed by VI?" Vision Research, vol. 37, pp. 3311-3325, 1997.
[48] D.E. Rumelhart, G.E. Hinton, and R.J. Williams, "Learning Representations by Back-Propagating Errors," Nature, vol. 323, pp. 533-536, 1986.
[49] D.C. Liu and J. Nocedal, "On the Limited Memory BFGS Method for Large Scale Optimization," Math. Programming, vol. 45, pp. 503-528, 1989.
[50] A. Coates, H. Lee, and A.Y. Ng, "An Analysis of Single-Layer Networks in Unsupervised Feature Learning," Proc. Int'l Conf. Artificial Intelligence and Statistics, vol. 15, pp. 215-223, 2011.
[51] G.E. Hinton, S. Osindero, and Y.W. Teh, "A Fast Learning Algorithm for Deep Belief Nets," Neural Computation, vol. 18, no. 7, pp. 1527-1554, 2006.
[52] D. Hubel and T. Wiesel, "Receptive Fields and Functional Architecture in Two Nonstriate Visual Areas (18 and 19) of the Cat," J. Neurophysiology, vol. 28, pp. 229-89, 1965.
[53] J. Sivic, B.C. Russel, A.A. Efros, A. Zisserman, and W.T. Freeman, "Discovering Objects and Their Location in Images," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 370-377, 2005.
[54] E. Nowak, F. Jurie, and B. Triggs, "Sampling Strategies for Bag-of-Features Image Classification," Proc. European Conf. Computer Vision pp. 490-503, 2006.
[55] S. Lazebnik, C. Schmid, and J. Ponce, "Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 2169-2178, 2006.
[56] J. Bergstra and Y. Bengio, "Random Search for Hyper-Parameter Optimization," J. Machine Learning Research, vol. 13, pp. 281-305, 2012.
[57] J. Snoek, H. Larochelle, and R.P. Adams, "Practical Bayesian Optimization of Machine Learning Algorithms," Proc. Advances in Neural Information Processing Systems Conf., 2012.
[58] Q.V. Le, A. Karpenko, J. Ngiam, and A.Y. Ng, "ICA with Reconstruction Cost for Efficient Overcomplete Feature Learning," Advances in Neural Information Processing Systems, vol. 24, pp. 1017-1025, 2011.
[59] Y. LeCun and Y. Bengio, "Convolutional Networks for Images, Speech, and Time Series," The Handbook of Brain Theory and Neural Networks, pp. 255-257, MIT Press, 1995.
35 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool