Publications

Workflow matching using semantic metadata

Abstract

Workflows are becoming an increasingly more common paradigm to manage scientific analyses. As workflow repositories start to emerge, workflow retrieval and discovery becomes a challenge. Studies have shown that scientists wish to discover workflows given properties of workflow data inputs, intermediate data products, and data results. However, workflows typically lack this information when contributed to a repository. Our work addresses this issue by augmenting workflow descriptions with constraints derived from properties about the workflow components used to process data as well as the data itself. An important feature of our approach is that it assumes that component and data properties are obtained from catalogs that are external to the workflow system, consistent with current architectures for computational science.

Date
September 1, 2009
Authors
Yolanda Gil, Jihie Kim, Gonzalo Florez, Varun Ratnakar, Pedro A González-Calero
Book
Proceedings of the fifth international conference on Knowledge capture
Pages
121-128