Provenance in Sensornet Republishing

Unkyu Park and John Heidemann
USC/Information Sciences Institute

Abstract

Sensornets are being deployed and increasingly brought on-line to share data as it is collected. Sensornet republishing is the process of transforming on-line sensor data and sharing the filtered, aggregated, or improved data with others. We explore the need for data provenance in this system to allow users to understand how processed results are derived and detect and correct anomalies. We describe our sensornet provenance system, exploring design alternatives and quantifying storage trade-offs in the context of a city-sized temperature monitoring application. In that application, our link approach outperforms other alternatives on saving storage requirement and our incremental compression scheme save the storage further up to 83%.

Availability

This paper is available in several formats: abstract web page with pointers and cites, PDF, paper copies can be obtained by mail to the authors. Copyright terms for this paper appear below.

Reference

Park08a
Unkyu Park and John Heidemann. Provenance in Sensornet Republishing. In Proceedings of the 2nd International Provenance and Annotation Workshop , p. to appear. Salt Lake City, Utah, USA, Springer Verlag. June, 2008. <http://www.isi.edu/~johnh/PAPERS/Park08a.html>.
@inproceedings{Park08a,
	author = "Unkyu Park and John Heidemann",
	title = "Provenance in Sensornet Republishing",
	booktitle = "Proceedings of the 2nd International Provenance and Annotation Workshop ",
	year = "2008",
	publisher = "Springer Verlag",
	address = "Salt Lake City, Utah, USA",
	month = "June",
	pages = "to appear",
	xnote = "(also released as tech report ISI-TR-2008-650",
	keywords = "sensornet, data provenance",
	url = "http://www.isi.edu/~johnh/PAPERS/Park08a.html",
	pdfurl = "http://www.isi.edu/~johnh/PAPERS/Park08a.pdf",
}

Copyright

This paper is copyright © 2008 by its authors. Permission to make digital or hard copies of part or all of this work for personal use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that new copies bear this notice and the full citation on the first page. Abstracting with credit is permitted.

To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission of the authors.