Introducing Provenance Capture into a Legacy Data System
Title | Introducing Provenance Capture into a Legacy Data System |
Publication Type | Journal Article |
Year of Publication | 2013 |
Authors | Conover, H, Ramachandran, R, Beaumont, B, Kulkarni, A, McEniry, M, Regner, K, Graves, S |
Journal | IEEE Transactions on Geoscience and Remote Sensing |
Volume | 51 |
Issue | 11 |
Date Published | 11/2013 |
ISSN Number | 0196-2892 |
Keywords | Browsers, Communities, Context, Data management, data processing, Data systems, Geoscience, geospatial data, metadata standards, provenance, science data systems, Software, standards |
Abstract | Accurate provenance information facilitates improved understanding of Earth science data and scientific reproducibility and can serve as an indicator of data quality. Provenance capture is an integral part of many modern workflow systems but may not have been considered in the design of legacy data production systems. Furthermore, in addition to data lineage, it is also important to capture contextual information needed for understanding how a data set was produced. This paper describes our experience in retrofitting a legacy data system to support capture, storage, and dissemination of provenance. Data inputs and transformations are logged automatically, while broader context information describing science algorithms and ancillary files is manually compiled. Provenance and context information are integrated for interactive user access and embedded into data files as XML documents compliant with the “Lineage” specification for geographic metadata defined by the International Organization for Standardization in the ISO 19115-2 standard. Lessons learned from this approach can inform others who need to incorporate provenance into a data system after the fact. |
DOI | 10.1109/TGRS.2013.2282817 |
- Log in to post comments
- Google Scholar
- DOI