Publications
A scalable architecture for extracting, aligning, linking, and visualizing multi-Int data
Abstract
An analyst today has a tremendous amount of data available, but each of the various data sources typically exists in their own silos, so an analyst has limited ability to see an integrated view of the data and has little or no access to contextual information that could help in understanding the data. We have developed the Domain-Insight Graph (DIG) system, an innovative architecture for extracting, aligning, linking, and visualizing massive amounts of domain-specific content from unstructured sources. Under the DARPA Memex program we have already successfully applied this architecture to multiple application domains, including the enormous international problem of human trafficking, where we extracted, aligned and linked data from 50 million online Web pages. DIG builds on our Karma data integration toolkit, which makes it easy to rapidly integrate structured data from a variety of sources, including databases …
- Date
- May 15, 2015
- Authors
- Craig A Knoblock, Pedro Szekely
- Conference
- Next-Generation Analyst III
- Volume
- 9499
- Pages
- 31-40
- Publisher
- SPIE