Sciweavers

2498 search results - page 229 / 500
» Software Fault Tolerance
Sort
View
IPPS
2007
IEEE
16 years 25 days ago
Implementing and Evaluating Automatic Checkpointing
As the size and popularity of computer clusters go on growing, fault tolerance is becoming a crucial factor to ensure high performance and reliability for applications. To provide...
Antonio S. Martins, Ronaldo Augusto Lara Gon&ccedi...
CCGRID
2006
IEEE
16 years 18 days ago
MPI-Mitten: Enabling Migration Technology in MPI
Group communications are commonly used in parallel and distributed environment. However, existing migration mechanisms do not support group communications. This weakness prevents ...
Cong Du, Xian-He Sun
PODC
2005
ACM
16 years 3 days ago
On reliable broadcast in a radio network
— We consider the problem of reliable broadcast in an infinite grid (or finite toroidal) radio network under Byzantine and crash-stop failures. We present bounds on the maximum...
Vartika Bhandari, Nitin H. Vaidya
ICPPW
1999
IEEE
15 years 10 months ago
A Group Communication Protocol for CORBA
Group communication protocols are used in fault-tolerant systems to maintain strong replica consistency. The FaultTolerant Multicast Protocol (FTMP) described here is a group comm...
Louise E. Moser, P. M. Melliar-Smith, Ruppert R. K...
DSN
2004
IEEE
15 years 10 months ago
Implementing Simple Replication Protocols using CORBA Portable Interceptors and Java Serialization
The goal of this paper is to assess the value of simple features that are widely available in off-the-shelf CORBA and Java platforms for the implementation of faulttolerance mecha...
Taha Bennani, Laurent Blain, Ludovic Courtè...