Issue No.06 - Nov.-Dec. (2013 vol.15)
Scott Simmerman , Univ. of Tennessee, Knoxville, TN, USA
James Osborne , Univ. of Tennessee, Knoxville, TN, USA
Jian Huang , Univ. of Tennessee, Knoxville, TN, USA
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/MCSE.2012.92
As multiprocessor and multicore technology becomes prevalent, shared-memory architectures with 1,024 or more processing cores are becoming available for general-purpose applications. As an early operator of such a system, the Remote Data Analysis and Visualization (RDAV) center at the University of Tennessee observed a host of user applications needing to scale up their computation by running many concurrent instances of generic codes. This isn't a typical way of using high-performance computing systems, and naive solutions supporting such needs would cause significant issues that hamper system scalability and stability. The RDAV center's Eden software package helps manage large numbers of concurrent serial jobs with high throughput for any such application. Here, the authors describe the motivation and technical nature of Eden and report representative use cases they've participated in during the past two years.
Computer architecture, Career development, Program processors, High performance computing, Data analysis, Parallel processing, Operating systems,programming languages, Computer architecture, Career development, Program processors, High performance computing, Data analysis, Parallel processing, Operating systems, scientific computing, high-performance computing, scheduling, process management, operating systems, software, software engineering, multiprogramming, multiprocessing, multicore, concurrency, scripting languages
Scott Simmerman, James Osborne, Jian Huang, "Eden: Simplified Management of Atypical High-Performance Computing Jobs", Computing in Science & Engineering, vol.15, no. 6, pp. 46-54, Nov.-Dec. 2013, doi:10.1109/MCSE.2012.92