to ISI Home Page
isd home
About ISD
education at isd
employment
environment
news
people
research
AI Seminars
div3admin

environment
Lise Getoor
University of Maryland
donotspam.getoor@cs.umd.edu
http://www.cs.umd.edu/users/getoor/


"Learning Statistical Models from Relational Data"

12/21/2001: [time not recorded]
[location not recorded]

Abstract: A large portion of real-world data is stored in commercial relational database systems. In contrast, most statistical learning methods work only with "flat" data representations. Thus, to apply these methods, we are forced to convert the data into a flat form, thereby losing much of the relational structure present in the data and potentially introducing statistical skew. These drawbacks severely limit the ability of current methods to mine relational databases. In this talk I will review recent work on probabilistic models, including Bayesian networks (BNs) and Probabilistic Relational Models (PRMs), and then describe the development of techniques for automatically inducing PRMs directly from structured data stored in a relational or object-oriented database. These algorithms provide the necessary tools to discover patterns in structured data, and provide new techniques for mining relational data. As we go along, I'll present experimental results in several domains, including a biological domain describing tuberculosis epidemiology, a database of scientific paper author and citation information, and Web data. Finally I will present an application of these techniques to the task of selectivity estimation for database query optimization.


Last updated: Mon Jun 19 17:44:06 2006

 

 

 

 

 
USC Home Page ISI Home Page