to ISI Home Page
isd home
About ISD
education at isd
employment
environment
news
people
research
AI Seminars
div3admin

environment
Renee Miller


"Clio: Schema Mapping and Data Exchange"

5/30/2003: 10:30am - 12:00pm
10th Floor Conference Room

Abstract: We present a novel framework for creating mappings between any combination of XML and relational data sources. In our approach, attribute correspondences (the result of "schema matching") are translated into a set of mappings that capture the semantics of the source and target schemas (including their hierarchical organization as well as their nested referential constraints). These mappings are then translated into queries over the source schema(s) that produce data satisfying the referential constraints and structure of the target schema. These queries preserve the semantic relationships of the source. The mapping algorithm is complete in that it produces all mappings that are consistent with the semantics of the schemas. We have implemented the translation algorithm in Clio, a schema mapping tool, and present our experience using Clio on life science data. The mappings produced by Clio can be used both within data integration where source data is queried through a virtual target view and for data exchange, including the exchange of data in P2P data sharing applications. We discuss the often subtle difference between the semantics of data integration and that of data exchange. This is joint work with Ron Fagin, Mauricio Hernandez, Phokion Kolaitis, Lucian Popa, and Yannis Velegrakis.

About Renee Miller: Renee J. Miller is an associate professor of computer science at the University of Toronto. She received the 1997 Presidential Early Career Award for Scientists and Engineers (PECASE), the highest honor bestowed by the United States government on outstanding scientists and engineers beginning their careers. She is a recipient of the NSF CAREER Award, the Premier's Research Excellence Award, and an IBM Faculty Award. Her research interests are in the efficient, effective use of large volumes of complex, heterogeneous data. This interest spans heterogeneous databases, data mining, and data warehousing. She received her PhD in Computer Science from the University of Wisconsin, Madison and bachelor's degrees in Mathematics and Cognitive Science from MIT.


Last updated: Mon Jun 19 17:44:06 2006

 

 

 

 

 
USC Home Page ISI Home Page