Sciweavers

4273 search results - page 319 / 855
» Autonomic power and performance management for computing sys...
Sort
View
CSE
2009
IEEE
15 years 10 months ago
Self-Adaptation of Fault Tolerance Requirements Using Contracts
Fault tolerance is a constant concern in data centers where servers have to run with a minimal level of failures. Changes on the operating conditions or on server demands, and var...
André Luiz B. Rodrigues, Leila N. Bezerra, ...
ICAC
2009
IEEE
15 years 10 months ago
A decentralized, architecture-based framework for self-growing applications
In large-scale, distributed software systems, an important management undertaking concerns the creation and runtime modification of application instances. This short paper propose...
Ada Diaconescu, Philippe Lalanda
RTSS
2003
IEEE
16 years 1 days ago
Power-aware QoS Management in Web Servers
Power management in data centers has become an increasingly important concern. Large server installations are designed to handle peak load, which may be significantly larger than...
Vivek Sharma, Arun Thomas, Tarek F. Abdelzaher, Ke...
SC
2004
ACM
16 years 6 days ago
Fastpath Optimizations for Cluster Recovery in Shared-Disk Systems
We describe the design and implementation of a clustering service for a high-performance, shared-disk file system. The service provides failure detection and recovery, reliable e...
Randal C. Burns
IAW
2003
IEEE
16 years 2 days ago
Assuring Consistency and Increasing Reliability in Group Communication Mechanisms in Computational Resiliency
— The Computational Resiliency library (CRLib) provides distributed systems with the ability to sustain operation and dynamically restore the level of assurance in system functio...
Norka B. Lucena, Steve J. Chapin, Joohan Lee