Sciweavers

3373 search results - page 17 / 675
» Malleable applications for scalable high performance computi...
Sort
View
HIPC
2007
Springer
16 years 4 days ago
A Scalable Asynchronous Replication-Based Strategy for Fault Tolerant MPI Applications
As computational clusters increase in size, their mean-time-to-failure reduces. Typically checkpointing is used to minimize the loss of computation. Most checkpointing techniques, ...
John Paul Walters, Vipin Chaudhary
IPPS
2003
IEEE
15 years 11 months ago
Recovery Schemes for High Availability and High Performance Distributed Real-Time Computing
Clusters and distributed systems offer fault tolerance and high performance through load sharing, and are thus attractive in real-time applications. When all computers are up and ...
Lars Lundberg, Daniel Häggander, Kamilla Klon...
HPCN
1997
Springer
15 years 10 months ago
Evaluation of High Performance Fortran Through Application Kernels
Since the de nition of the High Performance Fortran HPF standard, we have been maintaining a suite of application kernel codes with the aim of using them to evaluate the availabl...
H. W. Yau, Geoffrey Fox, Kenneth A. Hawick
ISCA
2009
IEEE
239views Hardware» more  ISCA 2009»
16 years 19 days ago
Scalable high performance main memory system using phase-change memory technology
The memory subsystem accounts for a significant cost and power budget of a computer system. Current DRAM-based main memory systems are starting to hit the power and cost limit. A...
Moinuddin K. Qureshi, Vijayalakshmi Srinivasan, Ju...
CORR
2008
Springer
134views Education» more  CORR 2008»
15 years 6 months ago
Algorithmic Based Fault Tolerance Applied to High Performance Computing
: We present a new approach to fault tolerance for High Performance Computing system. Our approach is based on a careful adaptation of the Algorithmic Based Fault Tolerance techniq...
George Bosilca, Remi Delmas, Jack Dongarra, Julien...