Search For:

Displaying 1-10 out of 10 total
PM/InfiniBand-FJ: A High Performance Communication Facility Using InfiniBand for Large Scale PC Clusters
Found in: High Performance Computing and Grid in Asia Pacific Region, International Conference on
By Shinji Sumimoto, Akira Naruse, Kouichi Kumon, Kouji Hosoe, Toshiyuki Shimizu
Issue Date:July 2004
pp. 104-113
This paper describes a design of high performance communication facility called the PM/InfiniBand-FJ using InfiniBand interconnect for large scale PC clusters. The PM/InfiniBand-FJ has developed to realize higher application performance than commercial sup...
Tofu: A 6D Mesh/Torus Interconnect for Exascale Computers
Found in: Computer
By Yuichiro Ajima, Shinji Sumimoto, Toshiyuki Shimizu
Issue Date:November 2009
pp. 36-40
A new architecture with a six-dimensional mesh/torus topology achieves highly scalable and fault-tolerant interconnection networks for large-scale supercomputers that can exceed 10 petaflops.
PACS-CS: A Large-Scale Bandwidth-Aware PC Cluster for Scientific Computations
Found in: Cluster Computing and the Grid, IEEE International Symposium on
By Taisuke Boku, Mitsuhisa Sato, Akira Ukawa, Daisuke Takahashi, Shinji Sumimoto, Kouichi Kumon, Takashi Moriyama, Masaaki Shimizu
Issue Date:May 2006
pp. 233-240
We have been developing a large scale PC cluster named PACS-CS (Parallel Array Computer System for Computational Sciences) at Center for Computational Sciences, University of Tsukuba, for wide variety of computational science applications such as computati...
PM2: High Performance Communication Middleware for Heterogeneous Network Environments
Found in: SC Conference
By Toshiyuki Takahashi, Shinji Sumimoto, Atsushi Hori, Hiroshi Harada, Yutaka Ishikawa
Issue Date:November 2000
pp. 16
This paper introduces a high performance communication middle layer, called PM2, for hetero-geneous network environments. PM2 currently supports Myrinet, Ethernet, and SMP. Binary code written in PM2 or written in a communication library, such as MPICH-SCo...
Dynamic Home Node Reallocation on Software Distributed Shared Memory
Found in: High-Performance Computing in the Asia-Pacific Region, International Conference on
By Hiroshi Harada, Yutaka Ishikawa, Atsushi Hori, Hiroshi Tezuka, Shinji Sumimoto, Toshiyuki Takahashi
Issue Date:May 2000
pp. 158
This paper proposes a dynamic home node reallocation mechanism in a software distributed system memory system to reduce the communication overhead at the memory barrier synchronization point. The mechanism has been implemented in a software distributed mem...
Parallel C++ Programming System on Cluster of Heterogeneous Computers
Found in: Heterogeneous Computing Workshop
By Yutaka Ishikawa, Atsushi Hori, Hiroshi Tezuka, Shinji Sumimoto, Toshiyuki Takahashi, Hiroshi Harada
Issue Date:April 1999
pp. 73
A parallel programming system, called MPC++, provides parallel primitives such as a remote function invocation, a global pointer, and a synchronization structure using the C++ template feature. The system has run on a cluster of homogeneous computers. In t...
PM/Ehernet-kRMA: A High Performance Remote Memory Access Facility Using Multiple Gigabit Ethernet Cards
Found in: Cluster Computing and the Grid, IEEE International Symposium on
By Shinji Sumimoto, Kouichi Kumon
Issue Date:May 2003
pp. 326
This paper proposes a high performance communication facility using multiple commodity network interface cards (NICs). Called PM/Ethernet-kRMA, it is NIC-hardware-independent and provides (k)ernel-level Remote Memory Access (kRMA) on multiple NICs. The PM/...
High Performance Communication using a Commodity Network for Cluster Systems
Found in: High-Performance Distributed Computing, International Symposium on
By Shinji Sumimoto, Hiroshi Tezuka, Atsushi Hori, Hiroshi Harada, Toshiyuki Takahashi, Yutaka Ishikawa
Issue Date:August 2000
pp. 139
This paper proposes a scheme to realize a high performance communication facility using a commodity network. This scheme does not require any special hardware or hardware specific device drivers in order to adapt to many kinds of network interface cards. I...
Dynamic memory usage analysis of MPI libraries using DMATP-MPI
Found in: Proceedings of the 20th European MPI Users' Group Meeting (EuroMPI '13)
By Hideyuki Akimoto, Kenichi Miura, Shinji Sumimoto, Takayuki Okamoto, Tomoya Adachi, Yuichiro Ajima
Issue Date:September 2013
pp. 149-150
This paper presents dynamic memory usage of Open MPI by DMATP-MPI dynamic memory usage analysis tool. The DMATP-MPI is developed to reduce memory usage of MPI communication library. The evaluation results show that the memory usage of MPI Init function inc...
The design and evaluation of high performance communication using a Gigabit Ethernet
Found in: Proceedings of the 13th international conference on Supercomputing (ICS '99)
By Atsushi Hori, Hiroshi Harada, Hiroshi Tezuka, Shinji Sumimoto, Toshiyuki Takahashi, Yutaka Ishikawa
Issue Date:June 1999
pp. 260-267
To minimize the amount of computation and storage for parallel sparse factorization, sparse matrices have to be reordered prior to factorization. We show that none of the popular ordering heuristics proposed before, namely, mulitple minimum degree and nest...