This Article 
 Bibliographic References 
 Add to: 
Accelerating Video-Mining Applications Using Many Small, General-Purpose Cores
September/October 2008 (vol. 28 no. 5)
pp. 8-21
Eric Li, Intel
Wenlong Li, Intel
Jianguo Li, Intel
Yurong Chen, Intel
Tao Wang, Intel
Wei Hu, Intel
Yangzhou Du, Intel
Yimin Zhang, Intel
Emerging video-mining applications such as image and video retrieval and indexing will require real-time processing capabilities. A many-core architecture with 64 small, in-order, general-purpose cores as the accelerator can help meet the necessary performance goals and requirements. The key video-mining modules can achieve parallel speedups of 19 to 62 from 64 cores and get an extra 2.3 speedup from 128-bit SIMD vectorization on the proposed architecture.

1. J. Yuan et al., "THU and ICRC at TRECVID 2007," Proc. TREC Video Retrieval Workshop (TRECVID 07), 2007, tv7.papersthu-icrc.pdf.
2. Y. Chen et al., "Media Mining—Emerging Tera-Scale Computing Applications," Intel Technology J., vol. 11, no. 3, 2007, 2007/v11i3/7-media_miningvol11-i3-art07.pdf .
3. Intel Open Source Computer Vision Library (OpenCV); /.
4. D. Lowe, "Distinctive Image Features from Scale-Invariant Keypoints," Int'l J. Computer Vision, vol. 60, no. 2, 2004, pp. 91-110.
5. M.A. Fischler and R.C. Bolles, "Random Sample Consensus: A Paradigm for Model Fitting with Application to Image Analysis and Automated Cartography," Comm. ACM, vol. 24, no. 6, 1981, pp. 381-395.
6. Y. Gao et al., "Semi Cast Indexing for Videos by NCuts and Page Ranking," Proc. ACM Conf. Image and Video Retrieval (CIVR 07), ACM Press, 2007, pp. 441-447.
7. C. Huang et al., "Vector Boosting for Rotation Invariant Multi-View Face Detection," Proc. IEEE Int'l Conf. Computer Vision (ICCV 05), vol. 1, IEEE CS Press, 2005, pp. 446-453.
8. L. Zhang, H.Z. Ai, and S.H. Lao, "Robust Face Alignment Based On Hierarchical Classifier Network," Proc. ECCV Workshop Human-Computer Interaction, 2006.
9. X-S. Hua, L. Lu, and H.-J Zhang, "Optimization-Based Automated Home Video Editing System," IEEE Trans. Circuits System and Video Technology, vol. 14, no. 5, 2004, pp. 572-583.
10. P.F. Felzenszwalb and D.P. Huttenlocher, "Efficient Graph-Based Image Segmentation," Int'l J. Computer Vision, vol. 59, no. 2, 2004, pp. 167-181.
11. W.L. Li and Y-K. Chen, "Performance Analysis, and Algorithm Consideration of Hough Transform on Chip Multiprocessors," Proc. Workshop Design, Architecture, and Simulation of Chip Multi-Processors, ACM Press, 2007, /.
12. Y. Chen et al., "Data Sharing Performance of Emerging Media Mining Workloads," to be published in Proc. Int'l Conf. High-Performance Computing (HiPC), 2008.
13. C.J. Hughes et al., "Physical Simulation for Animation and Visual Effects: Parallelization and Characterization for Chip Multiprocessors," Proc. Int'l Symp. Computer Architecture (ISCA 07), ACM Press, 2007, pp. 220-231.
14. G. Gerosa, "A Sub-1W to 2W Low-Power IA Processor for Mobile Internet Devices in 45 nm High-κMetal-Gate CMOS," Proc. Int'l Solid-State Circuits Conf. (ISSCC 08), IEEE Press, 2008, pp. 256-257.
15. L. Seiler et al., "Larrabee: A Many-Core ×86 Architecture for Visual Computing," ACM Trans. Graphics (Siggraph), vol. 27, no. 3, Aug. 2008, article no. 18.
16. S. Kumar, C.J. Hughes, and A. Nguyen, "Carbon: Architectural Support for Fine-Grained Parallelism on Chip Multiprocessors," Proc. Int'l Symp. Computer Architecture (ISCA 07), ACM Press, 2007, pp. 162-173.
17. T.Y. Yeh et al., "ParallAX: An Architecture for Real-Time Physics," Proc. Int'l Symp. Computer Architecture (ISCA 07), ACM Press, 2007, pp. 232-243.

Index Terms:
video mining, accelerator, multicore, SIMD, thread-level parallelism, data-level parallelism
Eric Li, Wenlong Li, Xiaofeng Tong, Jianguo Li, Yurong Chen, Tao Wang, Patricia P. Wang, Wei Hu, Yangzhou Du, Yimin Zhang, Yen-Kuang Chen, "Accelerating Video-Mining Applications Using Many Small, General-Purpose Cores," IEEE Micro, vol. 28, no. 5, pp. 8-21, Sept.-Oct. 2008, doi:10.1109/MM.2008.64
Usage of this product signifies your acceptance of the Terms of Use.