|
A large scale fault-tolerant grid information service
Brasileiro, Francisco; Costa, Lauro Beltrao; Andrade, Alisson; Cirne, Walfredo; Basu, Sujoy; Banerjee, Sujata
HPL-2006-146
External - Copyright Consideration
Keyword(s): Grid Information Service; peer-to-peer; failure detection; availability; kd-tree
Abstract: Large scale grid systems may provide multitudinous services, from different providers, whose quality of service will vary. Moreover, services appear (and disappear) in the grid with no central coordination. Thus, to find out the most suitable service to fulfill their needs, grid users must resort to Grid Information Services (GISs). These services allow users to submit rich queries that are normally composed of multiple attributes and range operations. The ability to efficiently execute complex searches in a scalable and reliable way is a key challenge for current GISs. Scalability issues are normally dealt with by using peer-to-peer technologies. However, the more reliable peer-to-peer approaches do not cater for rich queries in a natural way. On the other hand, approaches that can easily support these rich queries are less robust in the presence of faults. In this paper we focus on peer-to-peer GISs that efficiently support rich queries. In particular, we thoroughly analyze the impact of faults in one representant of such GISs, named NodeWiz. We propose extensions that increase NodeWiz's resilience to faults. The fault tolerance mechanism we propose substantially increases NodeWiz's availability. Notes: Copyright ACM 2006 Published in the 4th International Workshop on Middleware for Grid Computing - MGC 2006, 27 November 2006, Melbourne, Australia
6 Pages
Back to Index
|