The Community for Technology Leaders
20th Annual International Conference on High Performance Computing (2011)
Bengaluru, India
Dec. 18, 2011 to Dec. 21, 2011
ISBN: 978-1-4577-1951-6
TABLE OF CONTENTS
Papers

Program (PDF)

pp. 1-3

Author index (PDF)

pp. 1-8

A multi-GPU algorithm for communication in neuronal network simulations (Abstract)

Raphael Y. de Camargo , Center for Mathematics, Computation and Cognition, Universidade Federal do ABC (UFABC), Brazil
pp. 1-10

Comparing archival policies for Blue Waters (Abstract)

Loris Marchal , LIP laboratory, CNRS, INRIA, ENS-Lyon & University of Lyon, France
Franck Cappello , INRIA-UIUC joint laboratory for Petascale Computing, France
Mathias Jacquelin , LIP laboratory, CNRS, INRIA, ENS-Lyon & University of Lyon, France
Yves Robert , LIP laboratory, CNRS, INRIA, ENS-Lyon & University of Lyon, France
Marc Snir , INRIA-UIUC joint laboratory for Petascale Computing, France
pp. 1-10
Papers

Hybrid algorithms for list ranking and graph connected components (Abstract)

Dip Sankar Banerjee , International Institute of Information Technology, Hyderabad, Gachibowli, Hyderabad, India - 500 032
Kishore Kothapalli , International Institute of Information Technology, Hyderabad, Gachibowli, Hyderabad, India - 500 032
pp. 1-10

Parallel multiple precision division by a single precision divisor (Abstract)

Charles Weems , Computer Science Department, University of Massachusetts Amherst, MA 01003-4610, USA
Niall Emmart , Computer Science Department, University of Massachusetts Amherst, MA 01003-4610, USA
pp. 1-9

Scalable clustering using multiple GPUs (Abstract)

P. J. Narayanan , Center for Visual Information Technology, International Institute of Information and Technology, Hyderabad, India
Mohiuddin K. Wasif , Center for Visual Information Technology, International Institute of Information and Technology, Hyderabad, India
pp. 1-10

Hybrid implementation of error diffusion dithering (Abstract)

Aditya Deshpande , Centre for Visual Information Technology, International Institute of Information Technology, Hyderabad, India
P. J. Narayanan , Centre for Visual Information Technology, International Institute of Information Technology, Hyderabad, India
Ishan Misra , Centre for Visual Information Technology, International Institute of Information Technology, Hyderabad, India
pp. 1-10

Porting irregular reductions on heterogeneous CPU-GPU configurations (Abstract)

Gagan Agrawal , Department of Computer Science and Engineering, The Ohio State University, Columbus, OH 43210
Vignesh T. Ravi , Department of Computer Science and Engineering, The Ohio State University, Columbus, OH 43210
Xin Huo , Department of Computer Science and Engineering, The Ohio State University, Columbus, OH 43210
pp. 1-10

Building algorithmically nonstop fault tolerant MPI programs (Abstract)

Rui Wang , State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences
Guangming Tan , State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences
Pavan Balaji , Mathematics and Computer Science, Argonne National Laboratory
Darius Buntinas , Mathematics and Computer Science, Argonne National Laboratory
Mingyu Chen , State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences
Erlin Yao , State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences
pp. 1-9

High-level template for the task-based parallel wavefront pattern (Abstract)

Francisco Corbera , Dept. of Computer Architecture, University of Malaga, Spain
Antonio J. Dios , Dept. of Computer Architecture, University of Malaga, Spain
Rafael Asenjo , Dept. of Computer Architecture, University of Malaga, Spain
Emilio L. Zapata , Dept. of Computer Architecture, University of Malaga, Spain
Angeles Navarro , Dept. of Computer Architecture, University of Malaga, Spain
pp. 1-10

Enabling CUDA acceleration within virtual machines using rCUDA (Abstract)

Antonio J. Pena , D. Informática de Sistemas y Computadores, Universitat Politècnica de València, Camino de Vera s/n, 46022 Valencia, Spain
Enrique S. Quintana-Orti , D. Ingeniería y Ciencia de los Computadores, Universitat Jaume I, Av. Vicente Sos Baynat s/n, 12071 Castellón, Spain
Juan C. Fernandez , D. Ingeniería y Ciencia de los Computadores, Universitat Jaume I, Av. Vicente Sos Baynat s/n, 12071 Castellón, Spain
Rafael Mayo , D. Ingeniería y Ciencia de los Computadores, Universitat Jaume I, Av. Vicente Sos Baynat s/n, 12071 Castellón, Spain
Federico Silla , D. Informática de Sistemas y Computadores, Universitat Politècnica de València, Camino de Vera s/n, 46022 Valencia, Spain
Jose Duato , D. Informática de Sistemas y Computadores, Universitat Politècnica de València, Camino de Vera s/n, 46022 Valencia, Spain
pp. 1-10

Parallel implementation of MOPSO on GPU using OpenCL and CUDA (Abstract)

Jambhlekar Pushkar Arun , E&CE Department, IIT Roorkee, India
Manoj Mishra , E&CE Department, IIT Roorkee, India
Sheshasayee V. Subramaniam , High Performance Computing Lab, IBM STG India
pp. 1-10

Coordination mechanisms for selfish multi-organization scheduling (Abstract)

Johanne Cohen , PRiSM, Université de Versailles St-Quentin-en-Yvelines, Versailles, France
Frederic Wagner , Grenoble Technical University, Montbonnot Saint-Martin, France
Daniel Cordeiro , LIG, Grenoble University, Montbonnot Saint-Martin, France
Denis Trystram , Grenoble Technical University, Montbonnot Saint-Martin, France
pp. 1-9

Maximizing throughput of jobs with multiple resource requirements (Abstract)

Sambuddha Roy , IBM Research, New Delhi, India
Venkatesan T. Chakaravarthy , IBM Research, New Delhi, India
Yogish Sabharwal , IBM Research, New Delhi, India
Neha Sengupta , IBM Research, New Delhi, India
pp. 1-9

Weighted locality-sensitive scheduling for mitigating noise on multi-core clusters (Abstract)

William D. Gropp , Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
Abhinav Bhatele , Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, Livermore, CA 94551, USA
Vivek Kale , Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
pp. 1-10

Scheduling diverse high performance computing systems with the goal of maximizing utilization (Abstract)

R. Glenn Brook , National Institute for Computational Sciences, University of Tennessee, Oak Ridge, Tennessee, USA
Patricia Kovatch , National Institute for Computational Sciences, University of Tennessee, Oak Ridge, Tennessee, USA
Troy Baer , National Institute for Computational Sciences, University of Tennessee, Oak Ridge, Tennessee, USA
Tabitha K. Samuel , National Institute for Computational Sciences, University of Tennessee, Oak Ridge, Tennessee, USA
Matt Ezell , National Institute for Computational Sciences, University of Tennessee, Oak Ridge, Tennessee, USA
pp. 1-6

A dynamic scheduling framework for emerging heterogeneous systems (Abstract)

Gagan Agrawal , Department of Computer Science and Engineering, The Ohio State University Columbus OH 43210
Vignesh T. Ravi , Department of Computer Science and Engineering, The Ohio State University Columbus OH 43210
pp. 1-10

GVT algorithms and discrete event dynamics on 129K+ processor cores (Abstract)

Vinod Tipparaju , Oak Ridge National Laboratory, Oak Ridge, Tennessee, USA
Alfred J. Park , Oak Ridge National Laboratory, Oak Ridge, Tennessee, USA
Kalyan S. Perumalla , Oak Ridge National Laboratory, Oak Ridge, Tennessee, USA
pp. 1-11

Improving graph coloring on distributed-memory parallel computers (Abstract)

Ahmet Erdem Sariyuce , Department of Biomedical Informatics, The Ohio State University
Umit V. Catalyurek , Department of Biomedical Informatics, The Ohio State University
Erik Saule , Department of Biomedical Informatics, The Ohio State University
pp. 1-10

Modelling and analyzing the authorization and execution of video workflows (Abstract)

Chenlin Huang , Institute of Software, School of Computer Science, National University of Defense Technology, Changsha, China
Ligang He , Department of Computer Science, University of Warwick, Coventry, UK
Jianhua Sun , School of Computer and Communication, Hunan University, Changsha, China
Stephen A. Jarvis , Department of Computer Science, University of Warwick, Coventry, UK
Hao Chen , School of Computer and Communication, Hunan University, Changsha, China
Kewei Duan , Department of Computer Science, University of Bath, Bath, UK
Bo Gao , Department of Computer Science, University of Warwick, Coventry, UK
Kenli Li , School of Computer and Communication, Hunan University, Changsha, China
pp. 1-10

Multi-model prediction for enhancing content locality in elastic server infrastructures (Abstract)

Juan M. Tirado , Computer Architecture and Technology Area, Universidad Carlos III, Madrid, Spain
Florin Isaila , Computer Architecture and Technology Area, Universidad Carlos III, Madrid, Spain
Daniel Higuero , Computer Architecture and Technology Area, Universidad Carlos III, Madrid, Spain
Jesus Carretero , Computer Architecture and Technology Area, Universidad Carlos III, Madrid, Spain
pp. 1-9

Highly scalable barriers for future high-performance computing clusters (Abstract)

Alexander Giese , University of Heidelberg, Germany
Holger Froning , University of Heidelberg, Germany
Jose Duato , Universitat Politècnica de València, Spain
Federico Silla , Universitat Politècnica de València, Spain
Hector Montaner , Universitat Politècnica de València, Spain
pp. 1-10

Spectral evolution simulation on leading multi-socket, multicore platforms (Abstract)

Petar Mimica , Department of Astronomy and Astrophysics, University of Valencia, Valencia, Spain
Luis F. Romero , Department of Computer Architecture, University of Malaga, 29071 Malaga, Spain
Siham Tabik , Department of Computer Architecture, University of Malaga, 29071 Malaga, Spain
Emilio Zapata , Department of Computer Architecture, 29071 Malaga, Spain
Oscar Plata , Department of Computer Architecture, 29071 Malaga, Spain
pp. 1-10

Dynamic hosting management of web based applications over clouds (Abstract)

Georgios Varsamopoulos , Impact Lab, School of Computing, Informatics and Decision Systems Engineering, ASU, Tempe, AZ
Zahra Abbasi , Impact Lab, School of Computing, Informatics and Decision Systems Engineering, ASU, Tempe, AZ
Sandeep K. S. Gupta , Impact Lab, School of Computing, Informatics and Decision Systems Engineering, ASU, Tempe, AZ
Tridib Mukherjee , Xerox Research Center, Webster, NY
pp. 1-10

A fast centralized computation routing algorithm for self-configuring NoC systems (Abstract)

Francisco Trivino , Universidad de Castilla-La Mancha, Campus Universitario, s/n 02071, Albacete, Spain
Francisco J. Alfaro , Universidad de Castilla-La Mancha, Campus Universitario, s/n 02071, Albacete, Spain
Jose Flich , Universitat Politècnica de València, Department of Computer Engineering
Jose L. Sanchez , Universidad de Castilla-La Mancha, Campus Universitario, s/n 02071, Albacete, Spain
pp. 1-10

Partial globalization of partitioned address spaces for zero-copy communication with shared memory (Abstract)

Nilesh Mahajan , School of Informatics and Computing, Indiana University, Bloomington, Indiana 47405
Andrew Lumsdaine , School of Informatics and Computing, Indiana University, Bloomington, Indiana 47405
Fangzhou Jiao , School of Informatics and Computing, Indiana University, Bloomington, Indiana 47405
Arun Chauhan , School of Informatics and Computing, Indiana University, Bloomington, Indiana 47405
Jeremiah Willcock , School of Informatics and Computing, Indiana University, Bloomington, Indiana 47405
pp. 1-10

Multi-threaded UPC runtime with network endpoints: Design alternatives and evaluation on multi-core architectures (Abstract)

Miao Luo , Department of Computer Science and Engineering, The Ohio State University
Sayantan Sur , Department of Computer Science and Engineering, The Ohio State University
Dhabaleswar K. Panda , Department of Computer Science and Engineering, The Ohio State University
Jithin Jose , Department of Computer Science and Engineering, The Ohio State University
pp. 1-10

Increasing the energy efficiency of TLS systems using intermediate checkpointing (Abstract)

Marcelo Cintra , School of Informatics, University of Edinburgh
Polychronis Xekalakis , Intel Labs Barcelona - UPC
Nikolas Ioannou , School of Informatics, University of Edinburgh
Salman Khan , School of Computer Science, University of Manchester
pp. 1-10

A machine learning-based approach for thread mapping on transactional memory applications (Abstract)

Murray Cole , School of Informatics - ICSA - CARD Group - University of Edinburgh, UK
Marcio Castro , INRIA - LIG Laboratory - Grenoble University, France
Luis Fabricio Wanderley Goes , School of Informatics - ICSA - CARD Group - University of Edinburgh, UK
Jean-Francois Mehaut , INRIA - LIG Laboratory - Grenoble University, France
Christiane Pousa Ribeiro , INRIA - LIG Laboratory - Grenoble University, France
Marcelo Cintra , School of Informatics - ICSA - CARD Group - University of Edinburgh, UK
pp. 1-10

Robust thread-level speculation (Abstract)

Diego R. Llanos , Dpto. de Informa´tica Univ. de Valladolid, Spain
Arturo Gonzalez-Escribano , Dpto. de Informa´tica Univ. de Valladolid, Spain
Alvaro Garcia-Yaguez , Dpto. de Informa´tica Univ. de Valladolid, Spain
pp. 1-11

Implementing a hybrid SRAM / eDRAM NUCA architecture (Abstract)

David Brooks , School of Engineering and Applied Sciences, Harvard University, 02138 Cambridge, MA (USA)
Carlos Molina , Dept. of Computer Engineering, Universitat Rovira i Virgili, 43007 Tarragona, Spain
Javier Lira , Dept. of Computer Architecture, Universitat Politècnica de Catalunya, 08034 Barcelona, Spain
Antonio Gonzalez , Intel Barcelona Research Center, Intel Labs - UPC, 08034 Barcelona, Spain
pp. 1-10

High performance cache block replication using re-reference probability in CMPs (Abstract)

Jinglei Wang , Tsinghua National Laboratory for Information Science and Technology, Tsinghua University, Beijing 100084, China
Haixia Wang , Tsinghua National Laboratory for Information Science and Technology, Tsinghua University, Beijing 100084, China
Yibo Xue , Tsinghua National Laboratory for Information Science and Technology, Tsinghua University, Beijing 100084, China
Dongsheng Wang , Tsinghua National Laboratory for Information Science and Technology, Tsinghua University, Beijing 100084, China
pp. 1-10

Adaptive memory power management techniques for HPC workloads (Abstract)

Francesc Guim , Intel Barcelona, Barcelona, Spain
Manish Parashar , Center for Autonomic Computing, Rutgers University, Piscataway NJ, USA
Ivan Rodero , Center for Autonomic Computing, Rutgers University, Piscataway NJ, USA
Karthik Elangovan , Center for Autonomic Computing, Rutgers University, Piscataway NJ, USA
Isaac Hernandez , Intel Barcelona, Barcelona, Spain
pp. 1-11

Compute & memory optimizations for high-quality speech recognition on low-end GPU processors (Abstract)

Kshitij Gupta , Department of Electrical & Computer Engineering, University of California, Davis One Shields Avenue, Davis, California, USA
John D. Owens , Department of Electrical & Computer Engineering, University of California, Davis One Shields Avenue, Davis, California, USA
pp. 1-10

Dynamic selection of tile sizes (Abstract)

Sanket Tavarageri , Dept. of Computer Science and Engineering, The Ohio State University, 2015 Neil Ave, Columbus, OH, USA
Atanas Rountev , Dept. of Computer Science and Engineering, The Ohio State University, 2015 Neil Ave, Columbus, OH, USA
P. Sadayappan , Dept. of Computer Science and Engineering, The Ohio State University, 2015 Neil Ave, Columbus, OH, USA
J. Ramanujam , Dept. of Electrical & Computer Engineering, Louisiana State University, Baton Rouge, LA, USA
Louis-Noel Pouchet , Dept. of Computer Science and Engineering, The Ohio State University, 2015 Neil Ave, Columbus, OH, USA
pp. 1-10

The impact of hyper-threading on processor resource utilization in production applications (Abstract)

Piyush Mehrotra , NASA Advanced Supercomputing Division, NASA Ames Research Center, Moffett Field, CA 94035-1000, USA
Haoqiang Jin , NASA Advanced Supercomputing Division, NASA Ames Research Center, Moffett Field, CA 94035-1000, USA
David Barker , NASA Advanced Supercomputing Division, NASA Ames Research Center, Moffett Field, CA 94035-1000, USA
Subhash Saini , NASA Advanced Supercomputing Division, NASA Ames Research Center, Moffett Field, CA 94035-1000, USA
Rupak Biswas , NASA Advanced Supercomputing Division, NASA Ames Research Center, Moffett Field, CA 94035-1000, USA
Robert Hood , NASA Advanced Supercomputing Division, NASA Ames Research Center, Moffett Field, CA 94035-1000, USA
pp. 1-10

Optimizing multicore performance with message driven execution: A case study (Abstract)

Laxmikant V. Kale , Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
Pritish Jetley , Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
pp. 1-10

Reliable and randomized data distribution strategies for large scale storage systems (Abstract)

Toni Cortes , Barcelona Supercomputing Center (BSC), Barcelona, Spain
Alberto Miranda , Barcelona Supercomputing Center (BSC), Barcelona, Spain
Ethan L. Miller , University of California, Santa Cruz, CA, USA
Yangwook Kang , University of California, Santa Cruz, CA, USA
Sascha Effert , University of Paderborn, Paderborn, Germany
Andre Brinkmann , University of Paderborn, Paderborn, Germany
pp. 1-10

Supporting computational data model representation with high-performance I/O in parallel netCDF (Abstract)

Alok Choudhary , Electrical Engineering and Computer Science Department, Northwestern University
Chen Jin , Electrical Engineering and Computer Science Department, Northwestern University
Wei-keng Liao , Electrical Engineering and Computer Science Department, Northwestern University
Kui Gao , Electrical Engineering and Computer Science Department, Northwestern University
pp. 1-10

A multiresolution data model for improving simulation I/O performance (Abstract)

R. Daniel Bergeron , Computer Science Department, University of New Hampshire, Durham, NH 03824
Andrew Foulks , Computer Science Department, University of New Hampshire, Durham, NH 03824
pp. 1-10
98 ms
(Ver 3.3 (11022016))