GRID COMPUTING AND FAULT TOLERANCE APPROACH

Abstract

Grid computing is a means of allocating the computational power of a large number of computers to complex difficult computation or problem. Grid computing is a distributed computing paradigm that differs from traditional distributed computing in that it is aimed toward large scale systems that even span organizational boundaries. This paper proposes a method to achieve maximum fault tolerance in the Grid environment system by using Reliability consideration by using Replication approach and Check-point approach. Fault tolerance is an important property for large scale computational grid systems, where geographically distributed nodes co-operate to execute a task. In order to achieve high level of reliability and availability, the grid infrastructure should be a foolproof fault tolerant. Since the failure of resources affects job execution fatally, fault tolerance service is essential to satisfy QOS requirement in grid computing. Commonly utilized techniques for providing fault tolerance are job check pointing and replication. Both techniques mitigate the amount of work lost due to changing system availability but can introduce significant runtime overhead. The latter largely depends on the length of check pointing interval and the chosen number of replicas, respectively. In case of complex scientific workflows where tasks can execute in well defined order reliability is another biggest challenge because of the unreliable nature of the grid resources.

Authors and Affiliations

Pankaj Gupta

Keywords

Related Articles

Design of Model for Component Based System

This paper is based on designing such kind of model that will enhance the reusability of software modules, while component based software development approach is used to develop software. It is widely growing engineering...

A comparative study of System Network Architecture Vs Digital Network Architecture

The efficient managing system of sources is mandatory for the successful running of any network. Here this paper describes the most popular network architectures one of developed by IB , System Network Architecture (SNA)...

 An Intelligent Approach to Perform Image Fusion Using Segmentation

This paper deals with the clearance of images by the Fusion technique. The main objective of image fusion is to extract all the useful information from the source images. It does not introduce artifacts or inconsistencie...

Web Crawler: A Review

In a large distributed system like the Web, users find resources by following hypertext links from one document to another. When the system is small and its resources share the same fundamental purpose, users can find r...

Download PDF file
  • EP ID EP98169
  • DOI -
  • Views 106
  • Downloads 0

How To Cite

Pankaj Gupta (2011). GRID COMPUTING AND FAULT TOLERANCE APPROACH. International Journal of Computer Science and Management Studies (IJCSMS) www.ijcsms.com, 11(3), 69-72. https://europub.co.uk/articles/-A-98169