University of Southern California

Applying Web of Data Technology to enterprise Linked Data Clouds

When:
Monday, June 11, 2012, 11:00 am - 12:00 pm
Where:
11th fl Large CR (rm 1135)
Speaker:
Giovanni Tummarello, Ph.D, DERI
Description:

AI SEMINAR -   
Webcast: http://webcasterms1.isi.edu/mediasite/Viewer/?peid=b9391d8a8f374382bee569f3477f24961d 

Traditional data warehousing techniques and modern big data tools are great when dealing with great quantity of data which however must have a well known in advance and relatively simply structure. Knowledge intensive and knowledge centered enterprises which have hundreds or even thousands of data sources to integrate are still seeking effective solutions that put data and data representations at the heart of the problem. Based on the data processing pipelines currently in production in Sindice.com I’ll discuss how these can be applied to Enterprise data integration use cases. 

The RETIS platform (Real Time Semantic Warehousing infrastructure) leverages cloud computing to give to the operator the ability to truly perform "pay as you go"integration and data normalization on large amount of data, retaining the ability to see all as a large graph. 

I will how:


• An enterprise use case, with RETIS on top of your existing infrastructure, extracting delta changes from running systems • How one can perform a preliminary integration and then refine it while the systems are running • Interaction with other components including SIREn (Semantic Information Retrieval Engine), triplestores, traditional and nosql databases. 

About Giovanni Tummarello, Ph.D

Giovanni Tummarello, Ph.D, is senior research fellow at DERI, an institute focused on Semantic Web technologies. He is known by his work by projects such as the Sindice.com search engine, and others such as Sig.ma and also Semantic Web pipes. He currently lead a team of approximately 15 people between academic and startup workers. With Renaud Delbru he is founder of SindiceTech a spinoff company from DERI. 

About SindiceTech 

SindiceTech specializes in "Big Data" infrastructure that deal with semistructured, semantic data. Our goal is enabling your business to build its private, strategic "Linked Data Clouds" where information sources can be added in a "pay as you go fashion" and with no concern for size. Ultimately, our technology enables a direct relationship between domain expert and data integration process, something unknown in traditional, "IT department driven" data warehousing processes. SindiceTech solutions include SIREn, an extension to Lucene/SOLr providing unparalleled semi-structured and semantic data search capabilities. 

View Event Calendar »