Publications

Replica Management in Data Intensive Distributed Science Applications

Abstract

Management of the large data sets produced by data-intensive scientific applications is complicated by the fact that participating institutions are often geographically distributed and separated by distinct administrative domains. A key data management problem in these distributed collaborations has been the creation and maintenance of replicated data sets. This chapter provides an overview of replica management schemes used in large, data-intensive, distributed scientific collaborations. Early replica management strategies focused on the development of robust, highly scalable catalogs for maintaining replica locations. In recent years, more sophisticated, application-specific replica management systems have been developed to support the requirements of scientific Virtual Organizations. These systems have motivated interest in application-independent, policy-driven schemes for replica management that can be …

Date
February 3, 2026
Authors
Ann Chervenak, Robert Schuler
Book
Data Intensive Distributed Computing: Challenges and Solutions for Large-scale Information Management
Pages
188-205
Publisher
IGI Global