Sciweavers

PVM
2007
Springer
16 years 1 days ago
Using CMT in SCTP-Based MPI to Exploit Multiple Interfaces in Cluster Nodes
Many existing clusters use inexpensive Gigabit Ethernet and often have multiple interfaces cards to improve bandwidth and enhance fault tolerance. We investigate the use of Concurr...
Brad Penoff, Mike Tsai, Janardhan R. Iyengar, Alan...
147
Voted
LADS
2007
Springer
16 years 2 days ago
A Step Towards Fault Tolerance for Multi-Agent Systems
Robustness, through fault tolerance, is a property often put forward in order to advocate MAS. The question is: What is the first step to be fault tolerant? Obviously the answer i...
Katia Potiron, Patrick Taillibert, Amal El Fallah-...
EMSOFT
2007
Springer
16 years 3 days ago
A dynamic scheduling approach to designing flexible safety-critical systems
The design of safety-critical systems has typically adopted static techniques to simplify error detection and fault tolerance. However, economic pressure to reduce costs is exposi...
Luís Almeida, Sebastian Fischmeister, Madhu...
SRDS
2007
IEEE
16 years 6 days ago
Customizable Fault Tolerance for Wide-Area Replication
Constructing logical machines out of collections of physical machines is a well-known technique for improving the robustness and fault tolerance of distributed systems. We present...
Yair Amir, Brian A. Coan, Jonathan Kirsch, John La...
155
Voted
NCA
2007
IEEE
16 years 6 days ago
GRIDTS: A New Approach for Fault-Tolerant Scheduling in Grid Computing
This paper proposes GRIDTS, a grid infrastructure in which the resources select the tasks they execute, on the contrary to traditional infrastructures where schedulers find resou...
Fábio Favarim, Joni da Silva Fraga, Lau Che...
165
Voted
MSS
2007
IEEE
82views Hardware» more  MSS 2007»
16 years 6 days ago
Tornado Codes for MAID Archival Storage
This paper examines the application of Tornado Codes, a class of low density parity check (LDPC) erasure codes, to archival storage systems based on massive arrays of idle disks (...
Matthew Woitaszek, Henry M. Tufo
153
Voted
LAWEB
2007
IEEE
16 years 6 days ago
A Fault Tolerant Web Service Architecture
Web services have been pointed as a suitable technology for the development and execution of distributed applications. However, the Web service architecture still lacks facilities...
Diego Zuquim Guimarães Garcia, Maria Beatri...
ISQED
2007
IEEE
166views Hardware» more  ISQED 2007»
16 years 6 days ago
Reducing the Energy Consumption in Fault-Tolerant Distributed Embedded Systems with Time-Constraint
In this paper we address the problem of reducing the energy consumption in distributed embedded systems associated with time-constraints and equipped with fault-tolerant technique...
Yuan Cai, Sudhakar M. Reddy, Bashir M. Al-Hashimi
151
Voted
ISQED
2007
IEEE
206views Hardware» more  ISQED 2007»
16 years 6 days ago
Provisioning On-Chip Networks under Buffered RC Interconnect Delay Variations
Abstract—A Network-on-Chip (NoC) replaces on-chip communication implemented by point-to-point interconnects in a multi-core environment by a set of shared interconnects connected...
Mosin Mondal, Tamer Ragheb, Xiang Wu, Adnan Aziz, ...
172
Voted
ISORC
2007
IEEE
16 years 6 days ago
Exploiting Tuple Spaces to Provide Fault-Tolerant Scheduling on Computational Grids
Scheduling tasks on large-scale computational grids is difficult due to the heterogeneous computational capabilities of the resources, node unavailability and unreliable network ...
Fábio Favarim, Joni da Silva Fraga, Lau Che...