Publications

Automatic spatio-temporal indexing to integrate and analyze the data of an organization

Abstract

Organizations are awash in data. In many cases, they do not know what data exists within the organization and much information is not available when needed, or worse, information gets recreated from other sources. In this paper, we present an automatic approach to spatio-temporal indexing of the datasets within an organization. The indexing process automatically identifies the spatial and temporal fields, normalizes and cleans those fields, and then loads them into a big data store where the information can be efficiently searched, queried, and analyzed. We evaluated our approach on 600 datasets published by the City of Los Angeles and show that we can automatically process their data and can efficiently access and analyze the indexed data.

Date
November 7, 2017
Authors
Craig A Knoblock, Aparna R Joshi, Abhishek Megotia, Minh Pham, Chelsea Ursaner
Book
Proceedings of the 3rd ACM SIGSPATIAL Workshop on Smart Cities and Urban Analytics
Pages
1-8