Search For:

Displaying 1-13 out of 13 total
Task Scheduling Greedy Heuristics for GPU Heterogeneous Cluster Involving the Weights of the Processor
Found in: 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)
By Keliang Zhang,Baifeng Wu
Issue Date:May 2013
pp. 1817-1827
Modern GPUs are gradually used by more and more cluster computing systems as the high performance computing units due to their outstanding computational power, whereas bringing system-level (among different nodes) architectural heterogeneity to cluster. In...
 
Task Scheduling for GPU Heterogeneous Cluster
Found in: 2012 IEEE International Conference on Cluster Computing Workshops and Posters (CLUSTER WORKSHOPS)
By Keliang Zhang,Baifeng Wu
Issue Date:September 2012
pp. 161-169
Modern GPUs are gradually used by more and more cluster computing systems as the high performance computing units due to their outstanding computational power, whereas bringing node-level architectural heterogeneity to cluster. In this paper, based on MPI ...
 
GPGPU Memory Estimation and Optimization Targeting OpenCL Architecture
Found in: 2012 IEEE International Conference on Cluster Computing (CLUSTER)
By Junfeng Zhu,Gang Chen,Baifeng Wu
Issue Date:September 2012
pp. 449-458
The enormous computational power available in modern graphics processing units (GPUs) has enabled the widely use of them for general-purpose applications. However, manual development of high-performance parallel codes for GPUs is still very challenging. In...
 
A Novel Parallel Approach of Radix Sort with Bucket Partition Preprocess
Found in: 2012 IEEE 14th Int'l Conf. on High Performance Computing and Communication (HPCC) & 2012 IEEE 9th Int'l Conf. on Embedded Software and Systems (ICESS)
By Keliang Zhang,Baifeng Wu
Issue Date:June 2012
pp. 989-994
Radix sort is an important sorting algorithm which is widely used in applications such as binary search and database. The most important advantage of radix sort is its time complexity is O($n$), lower than other sorting algorithms based on comparison opera...
 
Parallel Sparse Matrix Multiplication for Preconditioning and SSTA on a Many-Core Architecture
Found in: 2012 IEEE 7th International Conference on Networking, Architecture, and Storage (NAS)
By Keliang Zhang,Baifeng Wu
Issue Date:June 2012
pp. 59-68
Operations related to Sparse matrix multiplication are frequently used in scientific computing area, and these operations usually become a performance bottleneck because of their high operational complexity. For example, sparse matrix multiplying diagonal ...
 
A self-embedded watermarking scheme based on relationship function of corresponding inter-blocks DCT coefficient
Found in: International Conference on Computer Supported Cooperative Work in Design
By Guobo Li, Songwen Pei, Gang Chen, Wenjun Cao, Baifeng Wu
Issue Date:April 2009
pp. 107-112
In the realm of Computer Supported Cooperative Work in Design (CSCWD), how to ensure the authenticity and integrity of an image plays important roles. This paper presents a novel semi-fragile image watermarking scheme for authenticating and recovering imag...
 
GPGPU supported cooperative acceleration in molecular dynamics
Found in: International Conference on Computer Supported Cooperative Work in Design
By Gang Chen, Guobo Li, Songwen Pei, Baifeng Wu
Issue Date:April 2009
pp. 113-118
Molecular dynamics simulations have become a significant computational approach to study complicated physical phenomena at the atomic level. Nevertheless, accurate simulations are limited in size and timescale by the available computing resources, which ma...
 
SpMT WaveCache: Exploiting Thread-Level Parallelism in WaveScalar
Found in: Computer Science and Information Engineering, World Congress on
By Songwen Pei, Baifeng Wu, Min Du, Gang Chen, Leandro A.J. Marzulo, Felipe M.G. Franca
Issue Date:April 2009
pp. 530-535
Speculative Multithreading (SpMT) increases the performance by means of executing multiple threads speculatively to exploit thread-level parallelism. By combining software and hardware approaches, we have improved the capabilities of previous WaveScalar IS...
 
Codec System Design for Continuous Color Barcode Symbols
Found in: Computer and Information Technology, IEEE 8th International Conference on
By Songwen Pei, Guobo Li, Baifeng Wu
Issue Date:July 2008
pp. 539-544
To address the limits of capacity and the disadvantages of motional scanning capability of traditional two-dimensional barcodes, a novel 3-D Continuous Barcode (3-D CoBe) is proposed. It adds color as a third dimension based on our previous research on 2-D...
 
Extended-DDF Modeling Embedded System Design: Adapting to IP Technology
Found in: Computer and Information Technology, International Conference on
By Yi Jiao, Chenglin Guo, Baifeng Wu, Hui Luo
Issue Date:September 2005
pp. 829-835
<p>This paper extends DDF (Dynamic Date Flow graph) to a large-grain model which can guide system design in function level. The adoption of a novel algorithm, as well as a formal definition for cDDF, strengthens the capacity of DDF as a formal specif...
 
Test by Distributed Monitoring
Found in: Asian Test Symposium
By Chenglian Peng, Baifeng Wu, Xiaoguang Sun
Issue Date:November 1999
pp. 218
With the evolvement of computer technology, both software systems and hardware systems have been developed into large-scale and sophisticated infrastructure. To test these systems, traditional static analysis methods have proved inefficient. This paper int...
 
HW/SW Co-Design for Embedded System Based on DDF
Found in: Information Science and Engineering, International Conference on
By Songwen Pei, Gang Chen, Guobo Li, Baifeng Wu
Issue Date:December 2009
pp. 575-578
Embedded systems are targeted for specific applications under constraints on relative timing of their actions. In this paper, we proposed a co-synthesis approach to hardware/software (HW/SW) co-design for embedded system based on Dynamic Data Flow (DDF). F...
 
High Performance Computing via a GPU
Found in: Information Science and Engineering, International Conference on
By Gang Chen, Guobo Li, Songwen Pei, Baifeng Wu
Issue Date:December 2009
pp. 238-241
Graphics processor units (GPUs), such as the AMD FireStream series, offer a tremendous computing power that is frequently an order of magnitude larger than even the most modern multi-core CPUs, making them an attractive platform for high performance comput...
 
 1