The Community for Technology Leaders
20th Annual International Conference on High Performance Computing (2012)
Pune, India India
Dec. 18, 2012 to Dec. 22, 2012
ISBN: 978-1-4673-2372-7
TABLE OF CONTENTS
Papers

I/O efficient QR and QZ algorithms (PDF)

Sajith Gopalan , Dept. of Computer Science & Engineering Indian Institute of Technology Guwahati Guwahati, Assam, India, PIN-781039
pp. 1-9

Author index (PDF)

pp. 1-22

[Front cover] (PDF)

pp. 1

Task-based parallel breadth-first search in heterogeneous environments (PDF)

Lluis-Miquel Munguia , Barcelona School of Informatics, Universitat Politècnica de Catalunya, Barcelona, Spain
David A. Bader , College of Computing Georgia Institute of Technology, Atlanta GA 30332
Eduard Ayguade , Barcelona Supercomputing Center (BSC), Spain
pp. 1-10

GMProf: A low-overhead, fine-grained profiling approach for GPU programs (PDF)

Mai Zheng , Dept. of Computer Science and Engineering, The Ohio State University 2015 Neil Avenue, Columubs, OH, USA
Vignesh T. Ravi , Dept. of Computer Science and Engineering, The Ohio State University 2015 Neil Avenue, Columubs, OH, USA
Wenjing Ma , Pacific Northwest National Lab P.O. Box 999, Richland, WA, USA
Feng Qin , Dept. of Computer Science and Engineering, The Ohio State University 2015 Neil Avenue, Columubs, OH, USA
Gagan Agrawal , Dept. of Computer Science and Engineering, The Ohio State University 2015 Neil Avenue, Columubs, OH, USA
pp. 1-10

A fault tolerant self-scheduling scheme for parallel loops on shared memory systems (PDF)

Yizhuo Wang , School of Computer Science and Technology Beijing Institute of Technology Beijing, China
Alexandru Nicolau , Department of Computer Science University of California Irvine, USA
Rosario Cammarota , Department of Computer Science University of California Irvine, USA
Alexander V. Veidenbaum , Department of Computer Science University of California Irvine, USA
pp. 1-10

Parallelization of Faugere's improved F4 algorithm (PDF)

Yashodhan Karandikar , Computational Research Laboratories Pune, India
Prakrati Agrawal , Computational Research Laboratories Pune, India
Habeeb Syed , Computational Research Laboratories Pune, India
Jojumon Kavalan , Computational Research Laboratories Pune, India
pp. 1-8

VMAP: Proactive thermal-aware virtual machine allocation in HPC cloud datacenters (PDF)

Eun Kyung Lee , NSF Cloud and Autonomic Computing Center Department of Electrical and Computer Engineering, Rutgers University, New Brunswick
Hariharasudhan Viswanathan, , NSF Cloud and Autonomic Computing Center Department of Electrical and Computer Engineering, Rutgers University, New Brunswick
Dario Pompili , NSF Cloud and Autonomic Computing Center Department of Electrical and Computer Engineering, Rutgers University, New Brunswick
pp. 1-10

Shared memory parallelization of fully-adaptive simulations using a dynamic tree-split and -join approach (PDF)

Martin Schreiber , Technische Universität München Munich, Germany
Hans-Joachim Bungartz , Technische Universität München Munich, Germany
Michael Bader , Technische Universität München Munich, Germany
pp. 1-10

Elasticat: A load rebalancing framework for cloud-based key-value stores (PDF)

Xiulei Qin , Institute of Software, Chinese Academy of Sciences
Wei Wang , Institute of Software, Chinese Academy of Sciences
Wenbo Zhang , Institute of Software, Chinese Academy of Sciences
Jun Wei , Institute of Software, Chinese Academy of Sciences
Xin Zhao , Institute of Software, Chinese Academy of Sciences
Tao Huang , Institute of Software, Chinese Academy of Sciences
pp. 1-10

Energy-aware scheduling under reliability and makespan constraints (PDF)

Guillaume Aupy , LIP, Ecole Normale Supérieure de Lyon
Anne Benoit , LIP, Ecole Normale Supérieure de Lyon
Yves Robert , LIP, Ecole Normale Supérieure de Lyon
pp. 1-10

Efficient update of ghost regions using active messages (PDF)

Josh Milthorpe , Research School of Computer Science Australian National University
Alistair P. Rendell , Research School of Computer Science Australian National University
pp. 1-9

CU2rCU: Towards the complete rCUDA remote GPU virtualization and sharing solution (PDF)

C. Reano , Universitat Politècnica de València 46.022-València Spain
A. J. Peña , Universitat Politècnica de València 46.022-València Spain
F. Silla , Universitat Politècnica de València 46.022-València Spain
J. Duato , Universitat Politècnica de València 46.022-València Spain
R. Mayo , Universitat Jaume I 12.071-Castellón Spain
E. S. Quintana-Orti , Universitat Jaume I 12.071-Castellón Spain
pp. 1-10

OSQR: A framework for ontology-based semantic query routing in unstructured P2P networks (PDF)

D M Rasanjalee Himali , Department of Computer Science Georgia State University Atlanta, GA, USA
Shamkant B. Navathe , College of Computing, Georgia Institute of Technology Atlanta, GA, USA
Sushil K Prasad , Department of Computer Science Georgia State University Atlanta, GA, USA
pp. 1-10

A fault-tolerant environment for large-scale query processing (PDF)

Mehmet Can Kurt , Department of Computer Science and Engineering Ohio State University Columbus, OH, 43210
Gagan Agrawal , Department of Computer Science and Engineering Ohio State University Columbus, OH, 43210
pp. 1-10

Massively parallel landscape-evolution modelling using general purpose graphical processing units (PDF)

A. S. McGough , School of Computing Science Newcastle University Newcastle upon Tyne, UK
S. Liang , School of Computing Science Newcastle University Newcastle upon Tyne, UK
M. Rapoportas , School of Computing Science Newcastle University Newcastle upon Tyne, UK
R. Grey , School of Computing Science Newcastle University Newcastle upon Tyne, UK
G. Kumar Vinod , School of Computing Science Newcastle University Newcastle upon Tyne, UK
D. Maddy , School of Geography, Politics and Sociology Newcastle University Newcastle upon Tyne, UK
A. Trueman , School of Geography, Politics and Sociology Newcastle University Newcastle upon Tyne, UK
J. Wainwright , Department of Geography Durham University Durham, UK
pp. 1-10

Campaign scheduling (PDF)

Vinicius Pinheiro , Lab. for Parallel and Distributed Computing University of São Paulo, Brasil
Krzysztof Rzadca , Institute of Informatics University of Warsaw, Poland
Denis Trystram , Grenoble Institute of Technology Institut Universitaire de France
pp. 1-10

Design and implementation of a parallel priority queue on many-core architectures (PDF)

Xi He , Department of Computer Science Georgia State University
Dinesh Agarwal , Department of Computer Science Georgia State University
Sushil K. Prasad , Department of Computer Science Georgia State University
pp. 1-10

Intrusion detection techniques for virtual domains (PDF)

Udaya Tupakula , INSS Research, Faculty of Science Macquarie University, Sydney, Australia
Vijay Varadharajan , INSS Research, Faculty of Science Macquarie University, Sydney, Australia
Dipankar Dutta , Amazon Development Center Bangalore, India
pp. 1-9

A theory and methodology for combining data centre networks (PDF)

Frank Olaf Sem-Jacobsen , Simula Research Laboratory Norway
Ralph Lorentzen , Simula Research Laboratory Norway
Olav Lysne , Simula Research Laboratory and Univeristy of Oslo Norway
pp. 1-8

Mapping strategies for the PERCS architecture (PDF)

Venkatesan T. Chakaravarthy , IBM Research - India
Monu Kedia , IBM Research - India
Yogish Sabharwal , IBM Research - India
Naga Praveen Kumar Katta , Princeton University
Aruna Ramanan , IBM USA
pp. 1-10

Modern HPC cluster virtualization using KVM and palacios (PDF)

Alexander Kudryavtsev , Institute for System Programming, Russian Academy of Sciences 109004, Moscow, Alexander Solzhenitsyn st., 25, Russian Federation
Vladimir Koshelev , Institute for System Programming, Russian Academy of Sciences 109004, Moscow, Alexander Solzhenitsyn st., 25, Russian Federation
Arutvun Avetisyan , Institute for System Programming, Russian Academy of Sciences 109004, Moscow, Alexander Solzhenitsyn st., 25, Russian Federation
pp. 1-9

A hybrid parallel algorithm for computing and tracking level set topology (PDF)

Senthilnathan Maadasamy , Department of Computer Science and Automation, Indian Institute of Science, Bangalore 560012, India
Harish Doraiswamy , Department of Computer Science and Automation, Indian Institute of Science, Bangalore 560012, India
Vijay Natarajan , Department of Computer Science and Automation, Indian Institute of Science, Bangalore 560012, India
pp. 1-10

Distributed hierarchical co-clustering and collaborative filtering algorithm (PDF)

Ankur Narang , IBM India Research Laboratory New Delhi, India
Abhinav Srivastava , IBM India Research Laboratory New Delhi, India
Naga Praveen Kumar Katta , Princeton University New Jersey, USA
pp. 1-10

Scalable performance of ScaleGraph for large scale graph analysis (PDF)

Miyuru Dayarathna , Department of Computer Science, Tokyo Institute of Technology
Charuwat Houngkaew , Department of Computer Science, Tokyo Institute of Technology
Hidefumi Ogata , Department of Computer Science, Tokyo Institute of Technology
Toyotaro Suzumura , Department of Computer Science, Tokyo Institute of Technology
pp. 1-9

Optimizing resource utilization with software-based temporal multi-threading (stmt) (PDF)

Vicenc Beltran , Barcelona Supercomputing Center (BSC) Barcelona, Spain
Eduard Ayguade , Barcelona Supercomputing Center (BSC) Technical University of Catalonia (UPC) Barcelona, Spain
pp. 1-10

Executing a biological sequence comparison application on a federated cloud environment (PDF)

Alessandro Ferreira Leite , Department of Computer Science University of Brasilia, Brasilia, Brazil
Alba Cristina Magalhaes Alves de Melo , Department of Computer Science University of Brasilia, Brasilia, Brazil
pp. 1-9

Dynamic load-balancing with variable number of processors based on graph repartitioning (PDF)

Clement Vuchener , Univ. Bordeaux, LaBRI, UMR 5800, F-33400 Talence, France
Aurelien Esnard , Univ. Bordeaux, LaBRI, UMR 5800, F-33400 Talence, France
pp. 1-9

Grabfast: A CUDA based GPU accelerated fast short sequence alignment algorithm (PDF)

Ankur Narang , IBM India Research Labs Vasant Kunj, Delhi, India-110070
Jyothish Soman , IBM India Research Labs Vasant Kunj, Delhi, India-110070
Sheetal Lahabar , IBM India Research Labs Vasant Kunj, Delhi, India-110070
pp. 1-10

Fault tolerant parallel data-intensive algorithms (PDF)

Mucahid Kutlu , Department of Computer Science and Engineering Ohio State University Columbus, OH, 43210
Gagan Agrawal , Department of Computer Science and Engineering Ohio State University Columbus, OH, 43210
Oguz Kurt , Department of Mathematics Ohio State University Columbus, OH, 43210
pp. 1-10

Automatic efficient data layout for multithreaded stencil codes on CPU sand GPUs (PDF)

Julien Jaeger , University of Versailles St Quentin, FR
Denis Barthou , Univcrsiry of Bordeaux / INRIA Bordeaux Sud-Ouest, FR
pp. 1-10

Password recovery using MPI and CUDA (PDF)

David Apostal , Department of Computer Science, University of North Dakota
Kyle Foerster , Department of Electrical Engineering, University of North Dakota
Amrita Chatterjee , Department of Computer Science, University of North Dakota
Travis Desell , Department of Computer Science, University of North Dakota
pp. 1-9

Designing scalable PGAS communication subsystems on cray gemini interconnect (PDF)

Abhinav Vishnu , Pacific Northwest National Laboratory 902 Battelle Blvd, Richland, WA 99352
Jeff Daily , Pacific Northwest National Laboratory 902 Battelle Blvd, Richland, WA 99352
Bruce Palmer , Pacific Northwest National Laboratory 902 Battelle Blvd, Richland, WA 99352
pp. 1-10

I/O performance characterization of Lustre and NASA applications on Pleiades (PDF)

Subhash Saini , NASA Advanced Supercomputing (NAS) Division NASA Ames Research Center Moffett Field, California 94035-1000, USA
Jason Rappleye , NASA Advanced Supercomputing (NAS) Division NASA Ames Research Center Moffett Field, California 94035-1000, USA
Johnny Chang , NASA Advanced Supercomputing (NAS) Division NASA Ames Research Center Moffett Field, California 94035-1000, USA
David Barker , NASA Advanced Supercomputing (NAS) Division NASA Ames Research Center Moffett Field, California 94035-1000, USA
Piyush Mehrotra , NASA Advanced Supercomputing (NAS) Division NASA Ames Research Center Moffett Field, California 94035-1000, USA
Rupak Biswas , NASA Advanced Supercomputing (NAS) Division NASA Ames Research Center Moffett Field, California 94035-1000, USA
pp. 1-10

GPU task parallelism for scalable anomaly detection (PDF)

Koji Ueno , Tokyo Institute of Technology / JST CREST
Toyotaro Suzumura , Tokyo Institute of Technology IBM Research - Tokyo / JST CREST
pp. 1-10

A global address space approach to automated data management for parallel Quantum Monte Carlo applications (PDF)

Qingpeng Niu , Dept. of Comp. Sci. and Eng., The Ohio State Univ.
James Dinan , Math. and Comp. Sci. Div., Argonne National Lab.
Sravya Tirukkovalur , Dept. of Comp. Sci. and Eng., The Ohio State Univ.
Lubos Mitas , Dept. of Physics, North Carolina State Univ.
Lucas Wagner , Dept. of Physics, Univ. of Illinois at Urbana-Champaign
P. Sadayappan , Dept. of Comp. Sci. and Eng., The Ohio State Univ.
pp. 1-10

A load-sharing architecture for high performance optimistic simulations on multi-core machines (PDF)

Roberto Vitali , DIIAG, Sapienza, Università di Roma
Alessandro Pellegrini , DIIAG, Sapienza, Università di Roma
Francesco Quaglia , DIIAG, Sapienza, Università di Roma
pp. 1-10

Parallel hierarchical clustering on shared memory platforms (PDF)

William Hendrix , Department of Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208
Md. Mostofa Ali Patwary , Department of Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208
Ankit Agrawal , Department of Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208
Wei-keng Liao , Department of Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208
Alok Choudhary , Department of Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208
pp. 1-9

A scalable messaging system for accelerating discovery from large scale scientific simulations (PDF)

Tong Jin , The NSF Center for Cloud and Autonomic Computing Rutgers University, Piscataway NJ, USA
Fan Zhang , The NSF Center for Cloud and Autonomic Computing Rutgers University, Piscataway NJ, USA
Manish Parashar , The NSF Center for Cloud and Autonomic Computing Rutgers University, Piscataway NJ, USA
Scott Klasky , Oak Ridge National Laboratory P.O. Box 2008, Oak Ridge, TN, 37831, USA
Norbert Podhorszki , Oak Ridge National Laboratory P.O. Box 2008, Oak Ridge, TN, 37831, USA
Hasan Abbasi , Oak Ridge National Laboratory P.O. Box 2008, Oak Ridge, TN, 37831, USA
pp. 1-10

Optimization of the hop-byte metric for effective topology aware mapping (PDF)

C. D. Sudheer , Department of Mathematics and Computer Science Sri Sathya Sai Institute of Higher Learning, India
A. Srinivasan , Department of Computer Science Florida State University, Tallahassee, FL 32306, USA
pp. 1-9

The design and implementation of a multi-level content-addressable checkpoint file system (PDF)

Abhishek Kulkarni , Indiana University
Adam Manzanares , Los Alamos National Laboratory
Latchesar Ionkov , Los Alamos National Laboratory
Michael Lang , Los Alamos National Laboratory
Andrew Lumsdaine , Indiana University
pp. 1-10

A new method for face recognition with fewer features under illumination and expression variations (PDF)

Chandan Tripathi , Dept. of Computer Science Engg. Sharda University Greater Noida, India-201306
K. P. Singh , Dept. of Information Technology IIIT Allahabad Allahabad, India - 211012
pp. 1-9

A distributed abnormal packet generation engine based on MapReduce (PDF)

Zhang Qi-fei , College of Computer Science and Technology Zhejiang University Hanzzhou, Zhejsiang 31002 7, China
Lv Hong-bin , College of Computer Science and Technology Zhejiang University Hanzzhou, Zhejsiang 31002 7, China
Pan Xue-zeng , College of Computer Science and Technology Zhejiang University Hanzzhou, Zhejsiang 31002 7, China
Wang Chao , College of Computer Science and Technology Zhejiang University Hanzzhou, Zhejsiang 31002 7, China
Li Wen-juan , College of Qianjiang Hangzhou Normal University Hangzhou, Zhejiang 310036 China
pp. 1-5

Visualization of network data provenance (PDF)

Peng Chen , School of Informatics and Computing Indiana University, Bloomington, IN, USA
Beth Plale , School of Informatics and Computing Indiana University, Bloomington, IN, USA
You-Wei Cheah , School of Informatics and Computing Indiana University, Bloomington, IN, USA
Devarshi Ghoshal , School of Informatics and Computing Indiana University, Bloomington, IN, USA
Scott Jensen , School of Informatics and Computing Indiana University, Bloomington, IN, USA
Yuan Luo , School of Informatics and Computing Indiana University, Bloomington, IN, USA
pp. 1-9

An automatic machine scaling solution for cloud systems (PDF)

Marta Beltran , Computer Architecture Department, ETSII Rey Juan Carlos University Madrid, Spain
Antonio Guzman , Computer Architecture Department, ETSII Rey Juan Carlos University Madrid, Spain
pp. 1-10

Shared disk big data analytics with Apache Hadoop (PDF)

Anirban Mukherjee , Symantec Corporation ICON, Baner Road, Pune - 411021, India
Joydip Datta , Symantec Corporation ICON, Baner Road, Pune - 411021, India
Raghavendra Jorapur , Symantec Corporation ICON, Baner Road, Pune - 411021, India
Ravi Singhvi , Symantec Corporation ICON, Baner Road, Pune - 411021, India
Saurav Haloi , Symantec Corporation ICON, Baner Road, Pune - 411021, India
Wasim Akram , Symantec Corporation ICON, Baner Road, Pune - 411021, India
pp. 1-6

Framework and user migration strategy of cloud-based video conference multi-gateway system (PDF)

Hongen Feng , State Key Lab of Software Development Environment School of Computer Science and Engineering, Beihang University Beijing, China
Wenjun Wu , State Key Lab of Software Development Environment School of Computer Science and Engineering, Beihang University Beijing, China
pp. 1-8

Towards highly scalable X10 based spectral clustering (PDF)

Hidefumi Ogata , Department of Computer Science, Tokyo Institute of Technology
Miyuru Dayarathna , Department of Computer Science, Tokyo Institute of Technology
Toyotaro Suzumura , Department of Computer Science, Tokyo Institute of Technology
pp. 1-5

Towards scalable optimal sequence homology detection (PDF)

Jeff Daily , Pacific Northwest National Laboratory
Sriram Krishnamoorthy , Pacific Northwest National Laboratory
Ananth Kalyanaraman , Washington State University
pp. 1-8

Efficient cache exploration method for a tiled chip multiprocessor (PDF)

Aparna Mandke Dani , Indian Institute of Science
Y. N. Srikant , Indian Institute of Science
Bharadwaj Amrutur , Indian Institute of Science
pp. 1-6

Eiger: A framework for the automated synthesis of statistical performance models (PDF)

Andrew Kerr , School of Electrical and Computer Engineering Georgia Institute of Technology
Eric Anger , School of Electrical and Computer Engineering Georgia Institute of Technology
Gilbert Hendry , Sandia National Laboratories
Sudhakar Yalamanchili , School of Electrical and Computer Engineering Georgia Institute of Technology
pp. 1-6

Profiling and scalability of the high resolution NCEP model for weather and climate simulations (PDF)

R Phani , Indian Institute of Tropical Meteorology Dr. Homi Bhabha Road, Pashan, Pune
A. K. Sahai , Indian Institute of Tropical Meteorology Dr. Homi Bhabha Road, Pashan, Pune
A. Suryachandra Rao , Indian Institute of Tropical Meteorology Dr. Homi Bhabha Road, Pashan, Pune
Jeelani Smd , Indian Institute of Tropical Meteorology Dr. Homi Bhabha Road, Pashan, Pune
pp. 1-5

Energy aware colocation of workload in data centers (PDF)

Madhurima Pore , Impact Lab, School of Computing, Informatics and Decision Systems Engineering, ASU, Tempe, Arizona
Zahra Abbasi , Impact Lab, School of Computing, Informatics and Decision Systems Engineering, ASU, Tempe, Arizona
Sandeep K. S. Gupta , Impact Lab, School of Computing, Informatics and Decision Systems Engineering, ASU, Tempe, Arizona
Georgios Varsamopoulos , Impact Lab, School of Computing, Informatics and Decision Systems Engineering, ASU, Tempe, Arizona
pp. 1-6

Tool for performance tuning and regression analyses of HPC systems and applications (PDF)

Saumil Merchant , High Performance Computing, IBM India
Giri Prabhakar , High Performance Computing, IBM India
pp. 1-6
89 ms
(Ver )