Search For:

Displaying 1-31 out of 31 total
A layered design methodology of cluster system stack
Found in: Cluster Computing, IEEE International Conference on
By Jianfeng Zhan, Lei Wang, Bibo Tu, Zhihong Zhang, Yu Wen, Yuansheng Chen, Wei Zhou, Dan Meng, Ninghui Sun
Issue Date:September 2007
pp. 404-409
The application range of cluster has expanded beyond scientific computing, but the present cluster system software fails to provide a flexible architecture to promote code reuse and facilitate building cluster system software for different computing contex...
 
Precise, Scalable, and Online Request Tracing for Multitier Services of Black Boxes
Found in: IEEE Transactions on Parallel and Distributed Systems
By Bo Sang,Jianfeng Zhan,Gang Lu,Haining Wang,Dongyan Xu,Lei Wang,Zhihong Zhang,Zhen Jia
Issue Date:June 2012
pp. 1159-1167
As more and more multitier services are developed from commercial off-the-shelf components or heterogeneous middleware without source code available, both developers and administrators need a request tracing tool to 1) exactly know how a user request of in...
 
A Dynamic Provisioning Framework for Multi-tier Internet Applications in Virtualized Data Center
Found in: Parallel and Distributed Computing Applications and Technologies, International Conference on
By Yi Jin, Xu Liu, Jianfeng Zhan, Shuang Gao
Issue Date:December 2008
pp. 329-332
With the resurgence of virtualization technology, today’s Internet data centers are shifting towards virtualized data centers. Internet applications tend to see dynamically varying workloads. To address the problem of performance management for multitier a...
 
Grid Unit: A Self-Managing Building Block for Grid System
Found in: Parallel and Distributed Computing Applications and Technologies, International Conference on
By Jianfeng Zhan, Lei Wang, Ming Zou, Hui Wang, Shuang Gao, Yulei Ding
Issue Date:December 2007
pp. 303-310
Grid system software is inherently complex, hard to build and maintain. In this paper, we propose a self- managing building block: Grid Unit, which facilitates constructing Grid system with higher availability and lower management overhead. We present an a...
 
PWP: a Cluster Web Portal based on MVC
Found in: High Performance Computing and Grid in Asia Pacific Region, International Conference on
By Yan Hao, Bibo Tu, Jianfeng Zhan, Dan Meng
Issue Date:December 2005
pp. 235-239
How to provide better services for cluster users along with the cluster technology gains ground? An integrated cluster OS with friendly user environment is a trend. On the basis of Fire-Phoenix, a fully integrated, highly reliable and scalable cluster OS, ...
 
Parallel Streamline Placement for 2D Flow Fields
Found in: IEEE Transactions on Visualization and Computer Graphics
By Wenyao Zhang, Yi Wang, Jianfeng Zhan, Beichen Liu, Jianguo Ning
Issue Date:July 2013
pp. 1185-1198
Parallel streamline placement is still an open problem in flow visualization. In this paper, we propose an innovative method to place streamlines in parallel for 2D flow fields. This method is based on our proposed concept of local tracing areas (LTAs). An...
 
Multi-scale Entropy: One Metric of Software Aging
Found in: 2013 IEEE 7th International Symposium on Service Oriented System Engineering (SOSE)
By Pengfei Chen,Yong Qi,Pengfei Zheng,Jianfeng Zhan,Yihan Wu
Issue Date:March 2013
pp. 162-169
The phenomena of service performance or availability degradation have been widely observed in the long running software systems, which is called ¡®software aging'. It's hard to measure software aging due to the inherent complexity and dynamic of software s...
 
High Volume Throughput Computing: Identifying and Characterizing Throughput Oriented Workloads in Data Centers
Found in: 2012 26th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)
By Jianfeng Zhan,Lixin Zhang,Ninghui Sun,Lei Wang,Zhen Jia,Chunjie Luo
Issue Date:May 2012
pp. 1712-1721
For the first time, this paper systematically identifies three categories of throughput oriented workloads in data centers: services, data processing applications, and interactive real-time applications, whose targets are to increase the volume of throughp...
 
Characterization of real workloads of web search engines
Found in: IEEE Workload Characterization Symposium
By Huafeng Xi,Jianfeng Zhan,Zhen Jia,Xuehai Hong,Lei Wang,Lixin Zhang,Ninghui Sun,Gang Lu
Issue Date:November 2011
pp. 15-25
Search is the most heavily used web application in the world and is still growing at an extraordinary rate. Understanding the behaviors of web search engines, therefore, is becoming increasingly important to the design and deployment of data center systems...
 
In Cloud, Can Scientific Communities Benefit from the Economies of Scale?
Found in: IEEE Transactions on Parallel and Distributed Systems
By Lei Wang,Jianfeng Zhan,Weisong Shi,Yi Liang
Issue Date:February 2012
pp. 296-303
The basic idea behind cloud computing is that resource providers offer elastic resources to end users. In this paper, we intend to answer one key question to the success of cloud computing: in cloud, can small-to-medium scale scientific communities benefit...
 
Transformer: A New Paradigm for Building Data-Parallel Programming Models
Found in: IEEE Micro
By Peng Wang, Dan Meng, Jizhong Han, Jianfeng Zhan, Bibo Tu, Xiaofeng Shi, Le Wan
Issue Date:July 2010
pp. 55-64
<p>Cloud computing drives the design and development of diverse programming models for massive data processing. The Transformer programming framework aims to facilitate the building of diverse data-parallel programming models. Transformer has two lay...
 
Accurate Analytical Models for Message Passing on Multi-core Clusters
Found in: Parallel, Distributed, and Network-Based Processing, Euromicro Conference on
By Bibo Tu, Jianping Fan, Jianfeng Zhan, Xiaofang Zhao
Issue Date:February 2009
pp. 133-139
Memory hierarchy on multi-core clusters has two-fold characteristics: vertical memory hierarchy and horizontal memory hierarchy. Vertical memory hierarchy has been modeled by previous work (e.g. memory logP, lognP, log3P etc.) to analyze middleware’s effec...
 
A Performance Model for Domino Mail Server
Found in: Computer Science and Software Engineering, International Conference on
By Yi Liang, Lei Wang, Jianfeng Zhan, Ruihua Di
Issue Date:December 2008
pp. 473-476
Using performance model to analyze the internet service is prevailing. However, the research of the performance model for Mail server retrains in the immature status. In this paper, based on the Queue Theory, we propose a performance model for Domino mail ...
 
Design Techniques for the Scalability of Cluster Management Software on Dawning Supercomputers
Found in: Parallel and Distributed Processing with Applications, International Symposium on
By Bibo Tu, Ming Zou, Jianfeng Zhan, Jianping Fan
Issue Date:December 2008
pp. 559-565
Cluster management software has faced more increased scalability challenge with ever enlarged cluster scale. Its good scalability rests with feasible design techniques focusing on hybrid software topologies with partitioning policy, non-blocking I/O multip...
 
A Fast-Start, Fault-Tolerant MPI Launcher on Dawning Supercomputers
Found in: Parallel and Distributed Computing Applications and Technologies, International Conference on
By Xu Liu, Bibo Tu, Jianfeng Zhan, Dan Meng
Issue Date:December 2008
pp. 263-266
Daemon-based MPI launchers are the mainstream in nowadays, because they can startup processes rapidly. However, effective task management and fault tolerance become more important as the scale of supercomputers enlarges. A new fast-start and fault tolerant...
 
Gingko: correlating causal paths in distributed systems
Found in: Network and Parallel Computing Workshops, IFIP International Conference on
By Zhihong Zhang, Dan Meng, Jianfeng Zhan, Lei Wang, Yi Jin, Yu Wen, Hui Wang
Issue Date:September 2007
pp. 762-767
Many large-scale systems are distributed systems of multiple communicating components. Finding causal paths of message traces between components throughout these systems is important to uncover runtime behaviors and identify the root cause of failures, but...
 
Design Patterns of Scalable Cluster System Software
Found in: Parallel and Distributed Computing Applications and Technologies, International Conference on
By Bibo Tu, Ming Zou, Jianfeng Zhan, Lei Wang, Jianping Fan
Issue Date:December 2006
pp. 415-420
The design pattern of cluster system software has an important influence on scalability of massive cluster system. The paper presents design patterns of scalable cluster system software, including scalable software topologies and optimized communication mo...
 
An Integrated Adaptive Management System for Cluster-based Web Services
Found in: Cluster Computing, IEEE International Conference on
By Ying Jiang, Dan Meng, Chao Ren, Jianfeng Zhan
Issue Date:September 2006
pp. 1-10
The complexity of the cluster-based Web service challenges the traditional approaches, which fail to guarantee the reliability and real-time performance required. In this paper, we present an integrated adaptive management system (JAMS) for such service. T...
 
A Failure-Aware Scheduling Strategy in Large-Scale Cluster System
Found in: Cluster Computing and the Grid, IEEE International Symposium on
By Wu Linping, Meng Dan, Jianfeng Zhan, Wang Lei, Tu Bibo
Issue Date:May 2006
pp. 645-648
As the scale is expanding, node failure becomes a commonplace feature of large-scale cluster systems. As an important part of cluster operating system software, job scheduling takes charge with high efficient resource management and reasonable job scheduli...
 
PhoenixG: A Unified Management Framework for Industrial Information Grid
Found in: Cluster Computing and the Grid, IEEE International Symposium on
By Jianfeng Zhan, Gengpu Liu, Lei Wang, Bibo Tu, Yi Jin, Yang Li, Yan Hao, Xuehai Hong, Dan Meng, Ninghui Sun
Issue Date:May 2006
pp. 489-496
The Industrial Information Grid is a special kind of system, the users of which exclusively own geographically distributed computing resources for business service, and try to maintain the lowest total cost of ownership while guaranteeing quality of servic...
 
Adaptive Mechanisms for Managing the High Performance Web-based Applications
Found in: High Performance Computing and Grid in Asia Pacific Region, International Conference on
By Ying Jiang, Danjun Liu, Dan Meng, Jianfeng Zhan
Issue Date:December 2005
pp. 392-397
<p>The complexity of the high performance web-based application challenges the traditional approaches, which fail to guarantee the reliability and real-time performance required. In this paper, we present a prototype of a cluster-based adaptive appli...
 
Adaptive Management of a Utility Computing
Found in: Cluster Computing, IEEE International Conference on
By Ying Jiang, Dan Meng, Yi Liang, Danjun Liu, Jianfeng Zhan
Issue Date:September 2005
pp. 1-2
The complexity of the high performance Web-based application challenges the traditional approaches, which fail to guarantee the reliability and real-time performance required. In this paper, we have studied the adaptive mechanisms for managing such applica...
 
Fire Phoenix Cluster Operating System Kernel and its Evaluation
Found in: Cluster Computing, IEEE International Conference on
By Jianfeng Zhan, Ninghui Sun
Issue Date:September 2005
pp. 1-9
Fire Phoenix cluster operating system kernel (Phoenix kernel) is a minimum set of cluster core junctions with scalability and fault-tolerance support. In this paper, we define components of cluster operating system kernel, and introduce its internal mechan...
 
BigDataBench: A big data benchmark suite from internet services
Found in: 2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA)
By Lei Wang,Jianfeng Zhan,Chunjie Luo,Yuqing Zhu,Qiang Yang,Yongqiang He,Wanling Gao,Zhen Jia,Yingjie Shi,Shujie Zhang,Chen Zheng,Gang Lu,Kent Zhan,Xiaona Li,Bizhu Qiu
Issue Date:February 2014
pp. 488-499
As architecture, systems, and data management communities pay greater attention to innovative big data systems and architecture, the pressure of benchmarking and evaluating these systems rises. However, the complexity, diversity, frequently changed workloa...
   
PowerTracer: Tracing Requests in Multi-tier Services to Reduce Energy Inefficiency
Found in: IEEE Transactions on Computers
By Gang Lu,Jianfeng Zhan,Haining Wang,Lin Yuan,Yunwei Gao,Chuliang Weng,Yong Qi
Issue Date:April 2014
pp. 1
As energy has become one of the key operating costs in running a data center and power waste commonly exists, it is essential to reduce energy inefficiency inside data centers. In this paper, we develop an innovative framework, called PowerTracer, for diag...
 
Characterizing data analysis workloads in data centers
Found in: 2013 IEEE International Symposium on Workload Characterization (IISWC)
By Zhen Jia,Lei Wang,Jianfeng Zhan,Lixin Zhang,Chunjie Luo
Issue Date:September 2013
pp. 66-76
As the amount of data explodes rapidly, more and more corporations are using data centers to make effective decisions and gain a competitive edge. Data analysis applications play a significant role in data centers, and hence it has became increasingly impo...
   
A Relationship-Based VM Placement Framework of Cloud Environment
Found in: 2013 IEEE 37th Annual Computer Software and Applications Conference (COMPSAC)
By Xiaodong Zhang,Ying Zhang,Xing Chen,Kai Liu,Gang Huang,Jianfeng Zhan
Issue Date:July 2013
pp. 124-133
Managing computation resources in a cost-effective way has become the core competence for a Cloud provider to win over the market because of the "pay-as-you-go" business model. Therefore, VM placement has become more and more important in the res...
 
LogMaster: Mining Event Correlations in Logs of Large-Scale Cluster Systems
Found in: 2012 IEEE 31st International Symposium on Reliable Distributed Systems (SRDS)
By Xiaoyu Fu,Rui Ren,Jianfeng Zhan,Wei Zhou,Zhen Jia,Gang Lu
Issue Date:October 2012
pp. 71-80
This paper presents a set of innovative algorithms and a system, named Log Master, for mining correlations of events that have multiple attributions, i.e., node ID, application ID, event type, and event severity, in logs of large-scale cloud and HPC system...
   
In cloud, do MTC or HTC service providers benefit from the economies of scale?
Found in: Proceedings of the 2nd Workshop on Many-Task Computing on Grids and Supercomputers (MTAGS '09)
By Jianfeng Zhan, Lei Wang, Lin Yuan, Weisong Shi, Yi Liang
Issue Date:November 2009
pp. 1-10
Cloud computing, which is advocated as an economic platform for daily computing, has become a hot topic for both industrial and academic communities in the last couple of years. The basic idea behind cloud computing is that resource providers, which own th...
     
The design methodology of Phoenix cluster system software stack
Found in: Proceedings of the 2007 Asian technology information program's (ATIP's) 3rd workshop on High performance computing in China: solution approaches to impediments for high performance computing (CHINA HPC '07)
By Bibo Tu, Bizhu Qiu, Dan Meng, Hui Wang, Jianfeng Zhan, Lei Wang, Ninghui Sun, Peng Wang, Yi Jin, Yu Wen, Yuansheng Chen, Zhihong Zhang
Issue Date:November 2007
pp. 305-305
Though many research groups have explored the design methodology of cluster system software stack, few works discuss what constitutes a good one. In this paper, we choose four criteria throughout the lifecycle of cluster system software stack to evaluate i...
     
TSAC: Enforcing Isolation of Virtual Machines in Clouds
Found in: IEEE Transactions on Computers
By CHULIANG WENG,jianfeng Zhan,Yuan Luo
Issue Date:May 2014
pp. 1
Virtualization plays a vital role in building the infrastructure of Clouds, and isolation is considered as one of its important features. However, we demonstrate with practical measurements that there exist two kinds of isolation problems in current virtua...
 
 1