NodeWiz: Fault-tolerant Grid Information Service

Sujoy Basua
basus@hpl.hp.com
Lauro Beltrao Costab
lauro@dsc.ufcg.edu.br
Francisco Brasileirob
fubica@dsc.ufcg.edu.br
Sujata Banerjeea
sujata@hpl.hp.com
Puneet Sharmaa
puneet@hpl.hp.com
Sung-Ju Leea
sjlee@hpl.hp.com

aHewlett-Packard Laboratories, Palo Alto, CA
bUniversidade Federal de Campina Grande, Paraiba, Brazil

Abstract

Large scale grid computing systems may provide multitudinous services, from different providers, whose quality of service will vary. Moreover, services are deployed and undeployed in the grid with no central coordination. Thus, to find out the most suitable service to fulfill their needs, or to find the most suitable set of resources on which to deploy their services, grid users must resort to a Grid Information Service (GIS). This service allows users to submit rich queries that are normally composed of multiple attributes and range operations. The ability to efficiently execute complex searches in a scalable and reliable way is a key challenge for current GIS designs. Scalability issues are normally dealt with by using peer-to-peer technologies. However, the more reliable peer-to-peer approaches do not cater for rich queries in a natural way. On the other hand, approaches that can easily support these rich queries are less robust in the presence of failures. In this paper we present the design of NodeWiz, a GIS that allows multi-attribute range queries to be performed efficiently in a distributed manner, while maintaining load balance and resilience to failures.

PDF (464 KB)