Search For:

Displaying 1-7 out of 7 total
“Cool” Load Balancing for High Performance Computing Data Centers
Found in: IEEE Transactions on Computers
By Osman Sarood,Phil Miller,Ehsan Totoni,Laxmikant V. Kalé
Issue Date:December 2012
pp. 1752-1764
As we move to exascale machines, both peak power demand and total energy consumption have become prominent challenges. A significant portion of that power and energy consumption is devoted to cooling, which we strive to minimize in this work. We propose a ...
Scalable Algorithms for Distributed-Memory Adaptive Mesh Refinement
Found in: 2012 24th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)
By Akhil Langer,Jonathan Lifflander,Phil Miller,Kuo-Chuan Pan,Laxmikant V. Kale,Paul Ricker
Issue Date:October 2012
pp. 100-107
This paper presents scalable algorithms and data structures for adaptive mesh refinement computations. We describe a novel mesh restructuring algorithm for adaptive mesh refinement computations that uses a constant number of collectives regardless of the r...
Mapping Dense LU Factorization on Multicore Supercomputer Nodes
Found in: 2012 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)
By Jonathan Lifflander,Phil Miller,Ramprasad Venkataraman,Anshu Arya,Laxmikant Kale,Terry Jones
Issue Date:May 2012
pp. 596-606
Dense LU factorization is a prominent benchmark used to rank the performance of supercomputers. Many implementations use block-cyclic distributions of matrix blocks onto a two-dimensional process grid. The process grid dimensions drive a trade-off between ...
Asynchronous Collective Output with Non-dedicated Cores
Found in: Cluster Computing, IEEE International Conference on
By Phil Miller,Shen Li,Chao Mei
Issue Date:September 2011
pp. 494-502
Parallel applications are evolving to place larger demands not just on computation and network capabilities, but on storage systems as well. Storage hardware has scaled to keep up, but the software to drive it must evolve alongside to service this increase...
Using Shared Arrays in Message-Driven Parallel Programs
Found in: Parallel and Distributed Processing Workshops and PhD Forum, 2011 IEEE International Symposium on
By Phil Miller,Aaron Becker,Laxmikant Kalé
Issue Date:May 2011
pp. 1142-1150
This paper describes a safe and efficient combination of the object-based message-driven execution and shared array parallel programming models. In particular, we demonstrate how this combination engenders the composition of loosely coupled parallel module...
Adoption protocols for fanout-optimal fault-tolerant termination detection
Found in: Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of parallel programming (PPoPP '13)
By Jonathan Lifflander, Laxmikant Kale, Phil Miller
Issue Date:February 2013
pp. 13-22
Termination detection is relevant for signaling completion (all processors are idle and no messages are in flight) of many operations in distributed systems, including work stealing algorithms, dynamic data exchange, and dynamically structured computations...
The educational issues confronting Ada (Panel Discussion)
Found in: Proceedings of the thirteenth SIGCSE technical symposium on Computer science education (SIGCSE '82)
By Grady Booch, Hal Hart, Peter Wegner, Phil Miller, Vance Mall
Issue Date:February 1982
pp. 261
Sponsored by the Department of Defense, Ada is a programming language that embodies and enforces many modern software methodologies. Thus, the introduction of the language gives the opportunity for improvements in software reliability, maintainability, and...