Skip to content Skip to main navigation Report an accessibility issue

EECS Publication

Reliability and Performance Modeling and Analysis for Grid Computing

Yuan-Shun Dai, Jack Dongarra

Grid computing is a newly developed technology for complex systems with large-scale resource sharing, wide-area communication, and multi-institutional collaboration. It is hard to analyze and model the Grid reliability because of its largeness, complexity and stiffness. Therefore, this chapter introduces the Grid computing technology, presents different types of failures in grid system, models the grid reliability with star structure and tree structure, and finally studies optimization problems for grid task partitioning and allocation. The chapter then presents models for star-topology considering data dependence and tree-structure considering failure correlation. Evaluation tools and algorithms are developed, evolved from Universal generating function and Graph Theory. Then, the failure correlation and data dependence are considered in the model. Numerical examples are illustrated to show the modeling and analysis.

Published  2008-06-03 04:00:00  as  ut-cs-08-618 (ID:93)

ut-cs-08-618.pdf

« Back to Listing