Search For:

Displaying 1-17 out of 17 total
Trends in High-Performance Computing
Found in: Computing in Science and Engineering
By Volodymyr Kindratenko, Pedro Trancoso
Issue Date:May 2011
pp. 92-95
<p>HPC system architectures are shifting from the traditional clusters of homogeneous nodes to clusters of heterogeneous nodes and accelerators.</p>
Application Acceleration with the Cell Broadband Engine
Found in: Computing in Science and Engineering
By Guochun Shi, Volodymyr Kindratenko, Frederico Pratas, Pedro Trancoso, Michael Gschwind
Issue Date:January 2010
pp. 76-81
<p>The Cell Broadband Engine is a heterogeneous chip multiprocessor that combines a PowerPC processor core with eight single-instruction multiple-data accelerator cores and delivers high performance on many computationally intensive codes.</p>
The Fat-Link Computation on Large GPU Clusters for Lattice QCD
Found in: 2012 Symposium on Application Accelerators in High Performance Computing (SAAHPC)
By Guochun Shi,Ronald Babich,Michael A. Clark,B'lint Joo,Steven Gottlieb,Volodymyr Kindratenko
Issue Date:July 2012
pp. 1-10
Graphics Processing Units (GPU) are becoming increasingly popular in high performance computing due to their high performance, high power efficiency and low cost. In this paper, we present results of an effort to implement the fatlink computation -- an imp...
Scientific Computing with GPUs
Found in: Computing in Science & Engineering
By Volodymyr Kindratenko
Issue Date:May 2012
pp. 8-9
This special issue attests to the widespread use of GPUs in the scientific computing community. Here the guest editor discusses the articles selected for this issue, and considers how they represent the range of possibilities (and risks) for using GPUs in ...
GPU acceleration of an image characterization algorithm for document similarity analysis
Found in: Computer Systems and Applications, ACS/IEEE International Conference on
By Guochun Shi,Volodymyr Kindratenko,Rob Kooper,Peter Bajcsy
Issue Date:December 2011
pp. 209-216
This paper aims to provide decision support for selecting software and hardware architecture for content-based document comparison. We evaluate Java, C, CUDA C and OpenCL implementations of an image characterization algorithm used for content-based documen...
Porting Optimized GPU Kernels to a Multi-core CPU: Computational Quantum Chemistry Application Example
Found in: Application Accelerators in High-Performance Computing, Symposium on
By Dong Ye,Alexey Titov,Volodymyr Kindratenko,Ivan Ufimtsev,Todd Martinez
Issue Date:July 2011
pp. 72-75
We investigate techniques for optimizing a multi-core CPU code back ported from a highly optimized GPU kernel. We show that common sub-expression elimination and loop unrolling optimization techniques improve code performance on the GPU, but not on the CPU...
Design of MILC Lattice QCD Application for GPU Clusters
Found in: Parallel and Distributed Processing Symposium, International
By Guochun Shi,Steven Gottlieb,Aaron Torok,Volodymyr Kindratenko
Issue Date:May 2011
pp. 363-371
We present an implementation of the improved staggered quark action lattice QCD computation designed for execution on a GPU cluster. The parallelization strategy is based on dividing the space-time lattice along the time dimension and distributing the sub-...
Guest Editor's Introduction: Special Issue on High-Performance Computing with Accelerators
Found in: IEEE Transactions on Parallel and Distributed Systems
By David A. Bader, David Kaeli, Volodymyr Kindratenko
Issue Date:January 2011
pp. 3-6
No summary available.
Quantifying the impact of GPUs on performance and energy efficiency in HPC clusters
Found in: International Conference on Green Computing
By Jeremy Enos, Craig Steffen, Joshi Fullop, Michael Showerman, Guochun Shi, Kenneth Esler, Volodymyr Kindratenko, John E. Stone, James C. Phillips
Issue Date:August 2010
pp. 317-324
We present an inexpensive hardware system for monitoring power usage of individual CPU hosts and externally attached GPUs in HPC clusters and the software stack for integrating the power usage data streamed in real-time by the power monitoring hardware wit...
High-Performance Computing with Accelerators
Found in: Computing in Science and Engineering
By Volodymyr Kindratenko, Robert Wilhelmson, Robert Brunner, Todd J. Martïnez, Wen-mei Hwu
Issue Date:July 2010
pp. 12-16
This issue of CiSE is based on work presented at the US National Science Foundation workshop, Path to Petascale: Adapting Geo/Chem/Astro Applications for Accelerators and Accelerator Clusters, held at the US National Center for Supercomputing Applications ...
Novel Computing Architectures
Found in: Computing in Science and Engineering
By Volodymyr Kindratenko
Issue Date:May 2009
pp. 54-57
We can make smaller transistors, but we have a hard time making them run faster. Yet, the demand for computing power is growing. This challenge has led to a renewed interest in novel computing architectures that can take advantage of shrinking transistor s...
Phoenix: A Runtime Environment for High Performance Computing on Chip Multiprocessors
Found in: Parallel, Distributed, and Network-Based Processing, Euromicro Conference on
By Avneesh Pant, Hassan Jafri, Volodymyr Kindratenko
Issue Date:February 2009
pp. 119-126
Execution of applications on upcoming high-performance computing (HPC) systems introduces a variety of new challenges and amplifies many existing ones. These systems will be composed of a large number of “fat” nodes, where each node consists of multiple pr...
Implementation of NAMD molecular dynamics non-bonded force-field on the cell broadband engine processor
Found in: Parallel and Distributed Processing Symposium, International
By Guochun Shi, Volodymyr Kindratenko
Issue Date:April 2008
pp. 1-8
We present results of porting an important kernel of a production molecular dynamics simulation program, NAMD, to the Cell/B.E. processor. The non-bonded force-field kernel, as implemented in the NAMD SPEC 2006 CPU benchmark, has been implemented. Both sin...
The Promise of High-Performance Reconfigurable Computing
Found in: Computer
By Tarek El-Ghazawi, Esam El-Araby, Miaoqing Huang, Kris Gaj, Volodymyr Kindratenko, Duncan Buell
Issue Date:February 2008
pp. 69-76
Several high-performance computers now use field-programmable gate arrays as reconfigurable coprocessors. The authors describe the two major contemporary HPRC architectures and explore the pros and cons of each using representative applications from remote...
Guest Editors' Introduction: High-Performance Reconfigurable Computing
Found in: Computer
By Duncan Buell, Tarek El-Ghazawi, Kris Gaj, Volodymyr Kindratenko
Issue Date:March 2007
pp. 23-27
High-performance reconfigurable computers have the potential to exploit coarse-grained functional parallelism as well as fine-grained instruction-level parallelism through direct hardware execution on FPGAs.
A case study in porting a production scientific supercomputing application to a reconfigurable computer
Found in: Field-Programmable Custom Computing Machines, Annual IEEE Symposium on
By Volodymyr Kindratenko, David Pointer
Issue Date:April 2006
pp. 13-22
This case study presents the results of porting a production scientific code, called NAMD, to the SRC-6 high-performance reconfigurable computing platform based on Field Programmable Gate Array (FPGA) technology. NAMD is a molecular dynamics code designed ...
The Visible Radio: Process Visualization of a Software-Defined Radio
Found in: Visualization Conference, IEEE
By Matthew Hall, Alex Betts, Donna Cox, David Pointer, Volodymyr Kindratenko
Issue Date:October 2005
pp. 21
In this case study, a data-oriented approach is used to visualize a complex digital signal processing pipeline. The pipeline implements a Frequency Modulated (FM) Software-Defined Radio (SDR). SDR is an emerging technology where portions of the radio hardw...