×

Performance evaluation of fault tolerance techniques in grid computing system. (English) Zbl 1213.68135

Summary: Fault tolerance is the ability of a system to perform its function correctly even in the presence of faults. Therefore, fault tolerance techniques (FTTs) are important for improving the efficient utilization of expensive resources in high performance grid computing systems.
This paper presents a performance evaluation of most commonly used FTTs in grid computing systems. We consider different system centric parameters, such as throughput, turnaround time, waiting time and network delay for the evaluation of these FTTs. For comprehensive evaluation, we set up various conditions in which we vary the average percentage of faults in a system, along with different workloads in order to find out the behavior of FTTs under these conditions. The empirical evaluation shows that the workflow level alternative task techniques have performance priority on task level checkpointing techniques. This comparative study will help grid computing researchers to understand the behavior and performance of different FTTs in detail.

MSC:

68M15 Reliability, testing and fault tolerance of networks and computer systems
68M20 Performance evaluation, queueing, and scheduling in the context of computer systems

Software:

GridSim
PDFBibTeX XMLCite
Full Text: DOI