Search For:

Displaying 1-18 out of 18 total
Ensuring the Performance of Apache HTTP Server Affected by Aging
Found in: IEEE Transactions on Dependable and Secure Computing
By Jing Zhao,Kishor S. Trivedi,Michael Grottke,Javier Alonso,Yanbin Wang
Issue Date:March 2014
pp. 130-141
Failures due to software aging are typically caused by resource exhaustion, which is often preceded by progressive software performance degradation. Response time as a customer-affecting metric can thus be used to detect the onset of software aging. In thi...
 
An empirical investigation of fault repairs and mitigations in space mission system software
Found in: 2013 43rd Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN)
By Javier Alonso,Michael Grottke,Allen P. Nikora,Kishor S. Trivedi
Issue Date:June 2013
pp. 1-8
Faults in software systems can have different characteristics. In an earlier paper, the anomaly reports for a number of JPL/NASA missions were analyzed and the underlying faults were classified as Bohrbugs, non-aging-related Mandelbugs, and aging-related b...
 
Optimal Resource Allocation in a Virtualized Software Aging Platform with Software Rejuvenation
Found in: Software Reliability Engineering, International Symposium on
By Javier Alonso,Ã?ñigo Goiri,Jordi Guitart,Ricard Gavaldà,Jordi Torres
Issue Date:December 2011
pp. 250-259
Nowadays, virtualized platforms have become the most popular option to deploy complex enough services. The reason is that virtualization allows resource providers to increase resource utilization. Deployed services are expected to be always available, but ...
 
A Comparative Evaluation of Software Rejuvenation Strategies
Found in: Workshop on Software Aging and Rejuvenation
By Javier Alonso,Rivalino Matias,Elder Vicente,Ana M. Carvalho,Kishor Trivedi
Issue Date:December 2011
pp. 26-31
In this paper we present an experimental comparative study of most of the rejuvenation techniques developed so far, divided into two groups: i) simple approaches: physical node reboot (switch off/on), VM reboot, OS reboot and standalone application restart...
 
Predicting Software Anomalies Using Machine Learning Techniques
Found in: Network Computing and Applications, IEEE International Symposium on
By Javier Alonso,Lluís Belanche,Dimiter R. Avresky
Issue Date:August 2011
pp. 163-170
In this paper, we present a detailed evaluation of a set of well-known Machine Learning classifiers in front of dynamic and non-deterministic software anomalies. The system state prediction is based on monitoring system metrics. This allows software proact...
 
Prediction of Job Resource Requirements for Deadline Schedulers to Manage High-Level SLAs on the Cloud
Found in: Network Computing and Applications, IEEE International Symposium on
By Gemma Reig, Javier Alonso, Jordi Guitart
Issue Date:July 2010
pp. 162-167
For a non IT expert to use services in the Cloud is more natural to negotiate the QoS with the provider in terms of service-level metrics --e.g. job deadlines-- instead of resource-level metrics --e.g. CPU MHz. However, current infrastructures only support...
 
Adaptive on-line software aging prediction based on machine learning
Found in: Dependable Systems and Networks, International Conference on
By Javier Alonso, Jordi Torres, Josep Ll. Berral, Ricard Gavalda
Issue Date:July 2010
pp. 507-516
The growing complexity of software systems is resulting in an increasing number of software faults. According to the literature, software faults are becoming one of the main sources of unplanned system outages, and have an important impact on company benef...
 
J2EE instrumentation for software aging root cause application component determination with AspectJ
Found in: Parallel and Distributed Processing Workshops and PhD Forum, 2011 IEEE International Symposium on
By Javier Alonso,Jordi Torres,Josep Ll. Berral,Ricard Gavalda
Issue Date:April 2010
pp. 1-8
Unplanned system outages have a negative impact on company revenues and image. While the last decades have seen a lot of efforts from industry and academia to avoid them, they still happen and their impact is increasing. According to many studies, one of t...
 
Using Virtualization to Improve Software Rejuvenation
Found in: IEEE Transactions on Computers
By Luis Moura Silva, Javier Alonso, Jordi Torres
Issue Date:November 2009
pp. 1525-1538
In this paper, we present an approach for software rejuvenation based on automated self-healing techniques that can be easily applied to off-the-shelf Application Servers. Software aging and transient failures are detected through continuous monitoring of ...
 
Predicting Web Server Crashes: A Case Study in Comparing Prediction Algorithms
Found in: Autonomic and Autonomous Systems, International Conference on
By Javier Alonso, Jordi Torres, Ricard Gavaldà
Issue Date:April 2009
pp. 264-269
Traditionally, performance has been the most important metrics when evaluating a system. However, in the last decades industry and academia have been paying increasing attention to another metric to evaluate servers: availability. A web server may serve ma...
 
Work in Progress: Building a Distributed Generic Stress Tool for Server Performance and Behavior Analysis
Found in: Autonomic and Autonomous Systems, International Conference on
By Ada Casanovas, Javier Alonso, Jordi Torres, Artur Andrzejak
Issue Date:April 2009
pp. 342-345
One of the primary tools for performance analysis of multi-tier systems are standardized
 
High-available grid services through the use of virtualized clustering
Found in: Grid Computing, IEEE/ACM International Workshop on
By Javier Alonso, Luis Silva, Artur Andrzejak, Paulo Silva, Jordi Torres
Issue Date:September 2007
pp. 34-41
Grid applications comprise several components and web-services that make them highly prone to the occurrence of transient software failures and aging problems. This type of failures often incur in undesired performance levels and unexpected partial crashes...
 
Using Virtualization to Improve Software Rejuvenation
Found in: Network Computing and Applications, IEEE International Symposium on
By Luis Moura Silva, Javier Alonso, Paulo Silva, Jordi Torres, Artur Andrzejak
Issue Date:July 2007
pp. 33-44
In this paper, we present an approach for software rejuvenation based on automated self-healing techniques that can be easily applied to off-the-shelf Application Servers and Internet sites. Software aging and transient failures are detected through contin...
 
Performance and Availability Modeling of ITSystems with Data Backup and Restore
Found in: IEEE Transactions on Dependable and Secure Computing
By Ruofan Xia,Xiaoyan Yin,Javier Alonso Lopez,Fumio Machida,Kishor S. Trivedi
Issue Date:July 2014
pp. 375-389
In modern IT systems, data backup and restore operations are essential for providing protection against data loss from both natural and man-made incidents. On the other hand, data backup and restore operations can be resource-intensive and lead to performa...
 
Towards fast OS rejuvenation: An experimental evaluation of fast OS reboot techniques
Found in: 2013 IEEE 24th International Symposium on Software Reliability Engineering (ISSRE)
By Antonio Bovenzi,Javier Alonso,Hiroshi Yamada,Stefano Russo,Kishor S. Trivedi
Issue Date:November 2013
pp. 61-70
Continuous or high availability is a key requirement for many modern IT systems. Computer operating systems play an important role in IT systems availability. Due to the complexity of their architecture, they are prone to suffer failures due to several typ...
   
Defects Per Million (DPM) Computation in Service-Oriented Environments
Found in: IEEE Transactions on Services Computing
By Subrota K. Mondal,Xiaoyan Yin,Jogesh K. Muppala,Javier Alonso,Kishor S. Trivedi
Issue Date:November 2013
pp. 1
Traditional system-oriented dependability metrics like reliability and availability do not fully reflect the impact of system failure-repair behavior in service-oriented environments. The telecommunication systems community prefers to use Defects Per Milli...
 
Availability Modeling and Analysis for Data Backup and Restore Operations
Found in: 2012 IEEE 31st International Symposium on Reliable Distributed Systems (SRDS)
By Xiaoyan Yin,Javier Alonso,Fumio Machida,Ermeson C. Andrade,Kishor S. Trivedi
Issue Date:October 2012
pp. 141-150
Data backup operation is an essential part of common IT system administration to protect against data loss caused by any storage failures, human errors, or disasters. Lost data can be recovered from the backed up data if it exists. Since the backup and res...
   
Human - robot swarm interaction for entertainment: from animation display to gesture based control
Found in: Proceedings of the 2014 ACM/IEEE international conference on Human-robot interaction (HRI '14)
By Javier Alonso-Mora, Paul Beardsley, Roland Siegwart
Issue Date:March 2014
pp. 98-98
This work shows experimental results with three systems that take real-time user input to direct a robot swarm formed by tens of small robots. These are: real-time drawing, gesture based interaction with an RGB-D sensor and control via a hand-held tablet c...
     
 1