banner

header-bar

 

Welcome

Getting Started

Conference Information

Technical Sessions

Workshops

Book of Abstracts

Author Index

Search Proceedings

 

 

sponsors02

ieee

 

comp-soc
Technical Committee on
Parallel Processing

 

acm

 

 

Session 24: Fault Tolerance and Checkpointing

 

A Job Pause Service under LAM/MPI+BLCR for Transparent Fault Tolerance
Chao Wang, Frank Mueller, Christian Engelmann and Stephen L. Scott

An optimistic checkpointing and selective message logging approach for consistent global checkpoint collection in distributed systems
Qiangfeng Jiang and D. Manivannan

DejaVu: Transparent User-Level Checkpointing, Migration, and Recovery for Distributed Systems
Joseph F. Ruscio, Michael A. Heffner and Srinidhi Varadarajan

A Fault Tolerance Protocol with Fast Fault Recovery
Sayantan Chakravorty and Laxmikant V. Kale

 

 

 

CD-ROM produced by X-CD Technologies Inc.