Sciweavers

13489 search results - page 322 / 2698
» Operating System
Sort
View
SRDS
1996
IEEE
15 years 10 months ago
Exploiting Data-Flow for Fault-Tolerance in a Wide-Area Parallel System
Wide-area parallel processing systems will soon be available to researchers to solve a range of problems. In these systems, it is certain that host failures and other faults will ...
Anh Nguyen-Tuong, Andrew S. Grimshaw, Mark Hyett
LISA
2008
15 years 9 months ago
Petascale System Management Experiences
Petascale HPC systems are among the largest systems in the world. Intrepid, one such system, is a 40,000 node, 556 teraflop Blue Gene/P system that has been deployed at Argonne Na...
Narayan Desai, Rick Bradshaw, Cory Lueninghoener, ...
SRDS
2008
IEEE
16 years 1 months ago
An Incremental File System Consistency Checker for Block-Level CDP Systems
A block-level continuous data protection (CDP) system logs every disk block update from an application server (e.g., a file or DBMS server) to a storage system so that any disk u...
Maohua Lu, Tzi-cker Chiueh, Shibiao Lin
OSDI
2006
ACM
16 years 7 months ago
The Chubby Lock Service for Loosely-Coupled Distributed Systems
We describe our experiences with the Chubby lock service, which is intended to provide coarse-grained locking as well as reliable (though low-volume) storage for a loosely-coupled...
Michael Burrows
ECRTS
1999
IEEE
15 years 11 months ago
Distributed real-time task monitoring in the safety-critical system Melody
Safety-critical systems typically operate in unpredictable environments. Requirements for safety and reliability are in conflict with those for real-time responsiveness. Due to un...
Horst F. Wedde, Jon A. Lind, Guido Seghert