Search Sciweavers | Sciweavers

288

LADC
2011
Springer

289views Applied Computing» more LADC 2011»

Byzantine Fault-Tolerant Deferred Update Replication

14 years 9 months ago

Abstract—Replication is a well-established approach to increasing database availability. Many database replication protocols have been proposed for the crash-stop failure model, ...

Fernando Pedone, Nicolas Schiper, José Enri...

claim paper

Read More »

162

click to vote

ICAC
2005
IEEE

163views Applied Computing» more ICAC 2005»

Distributed Troubleshooting Agents

15 years 11 months ago

Download www.stottlerhenke.com

Key issues to address in autonomic job recovery for cluster computing are recognizing job failure; understanding the failure sufficiently to know if and how to restart the job; an...

Charles Earl, Emilio Remolina, Jim Ong, John Brown

claim paper

Read More »

144

click to vote

DSOM
2004
Springer

126views Computer Networks» more DSOM 2004»

ABHA: A Framework for Autonomic Job Recovery

15 years 11 months ago

Download www.stottlerhenke.com

Key issues to address in autonomic job recovery for cluster computing are recognizing job failure; understanding the failure sufficiently to know if and how to restart the job; an...

Charles Earl, Emilio Remolina, Jim Ong, John Brown...

claim paper

Read More »

258

click to vote

PRDC
2007
IEEE

84views Applied Computing» more PRDC 2007»

Implementation of a Flexible Membership Protocol on a Real-Time Ethernet Prototype

16 years 14 days ago

Download www.ce.chalmers.se

This paper describes the implementation of a processorgroup membership protocol in an experimental real-time network. The protocol is appropriate for fault-tolerant distributed sy...

Raul Barbosa, António Ferreira, Johan Karls...

claim paper

Read More »

136

click to vote

OSDI
2004
ACM

87views Operating System» more OSDI 2004»

Microreboot - A Technique for Cheap Recovery

16 years 6 months ago

Download www.usenix.org

A significant fraction of software failures in large-scale Internet systems are cured by rebooting, even when the exact failure causes are unknown. However, rebooting can be expen...

George Candea, Shinichi Kawamoto, Yuichi Fujiki, G...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers