Publications

High-performance remote access to climate simulation data: a challenge problem for data grid technologies

Abstract

In numerous scientific disciplines, terabyte and petabyte-scale data collections are emerging as critical community resources. A new class of “data grid” infrastructure is required to support management, transport, distributed access to, and analysis of these datasets by potentially thousands of users. Researchers who face this challenge include the climate modeling community, which performs long-duration computations accompanied by frequent output of very large files that must be further analyzed. We describe the Earth System Grid-I prototype, which brings together advanced analysis, replica management, data transfer, request management, and other technologies to support high-performance, interactive analysis of replicated data. We present performance results that demonstrate our ability to manage the location and movement of large datasets from the user’s desktop. We report on experiments conducted …

Date
October 1, 2003
Authors
Ann Chervenak, Ewa Deelman, Carl Kesselman, Bill Allcock, Ian Foster, Veronika Nefedova, Jason Lee, Alex Sim, Arie Shoshani, Bob Drach, Dean Williams, Don Middleton
Journal
Parallel Computing
Volume
29
Issue
10
Pages
1335-1356
Publisher
North-Holland