
Publications
Papers
Data Integration and Access: The Digital Government Research Center's Energy Data Collection (EDC) Project.
J.L. Ambite, Y. Arens, L. Gravano, V. Hatzivassiloglou, E.H. Hovy, J.L. Klavans, A. Philpot, U. Ramachandran, K. Ross, J. Sandhaus, D. Sarioz, A. Singla, and B. Whitman.
2002 forthcoming.
Chapter in W. McIver (ed), ***. Kluwer Academic Publishers. This chapter provides an in-depth description of ontologies and domain modeling to support database access planning, in the framework of the EDC project. It discusses experiments to automate term extraction from glossaries, term clustering, and term-to-ontology matching.
Simplifying Data Access: The Energy Data Collection Project.
J.L. Ambite, Y. Arens, E.H. Hovy, A. Philpot, L. Gravano, V. Hatzivassiloglou, J.L. Klavans.
February 2001.
Article in IEEE Computer 34(2). Get paper in pdf.
This paper provides a brief overview of the goals and operation of the EDC system, tailored to a general Computer Science audience.
Building Ontologies and Integrating Data from Multiple Agencies: A Case Study Using Gasoline.
J.L. Ambite, Y. Arens, L. Gravano, V. Hatzivassiloglou, E.H. Hovy, J.L. Klavans, A. Philpot, U. Ramachandran, J. Sandhaus, A. Singla, and B. Whitman.
August 2000.
Proceedings of the 2000 Joint Statistical Meetings, Paper 1941. Indianapolis. Get paper in pdf.
This paper provides a brief overview of the goals and operation of the EDC system, tailored to researchers and Government officials working with Federal statistics.
The EDC Project.
J.L. Ambite, Y. Arens, L. Gravano, V. Hatzivassiloglou, E.H. Hovy, J.L. Klavans, and A. Philpot.
May 2000.
Presented at the NSF's Workshop on Digital Government dg.o 2000, Los Angeles. This presentation provided a brief overview of the goals and first version of the EDC system.
Data Acquisition and Integration in the DGRC's Energy Data Collection Project.
E.H. Hovy, A. Philpot, J.L. Ambite, Y. Arens, J.L. Klavans, W. Bourne, and D. Sarioz.
May 2001.
Proceedings of the NSF's 1st Digital Government dg.o 2001 conference, Los Angeles. Get paper in pdf.
This paper provides a technical description of experiments with algorithms used to try to build domain models automatically: term clustering and ontology alignment.
Extracting Taxonomic Relationships from On-Line Definitional Sources Using LEXING.
J.L. Klavans and B. Whitman.
June 2001.
Proceedings of the ACM/IEEE-Computer Joint Conference on Digital Libraries, Roanoke, VA. Get paper in pdf.
This paper describes LEXING, a system that extracts words and their interrelationships from glossaries.
Scalable Access and Integration of Statistical Data for Digital Government.
J.L. Ambite, Y.Arens, S. Feiner, E.H. Hovy, J.L. Klavans, and A. Philpot.
April 2001.
Proceedings of the AFCEA Colloquium, San Diego. Get paper in pdf.
This paper describes database access planning in the EDC system.
Fast Approximate Evaluation of OLAP Queries for Integrated Statistical Data. J.L. Ambite, C. Shahabi, R.R. Schmidt, and A. Philpot. May 2001.
Proceedings of the NSF's 1st Digital Government dg.o 2001 conference, Los Angeles. Get paper in pdf.
This paper describes a method to greatly speed up database queries against large data warehouses by using a fast wavelet-based technique from Physics.
Two Approaches toward Solving the Problem of Access to Distributed and Heterogeneous Data.
J. Callan, W.B. Croft, and E.H. Hovy.
December 2001.
Article in DG Online Get article in pdf.
This article contrasts two approaches to metadata creation for database access: statistical language modeling and ontologies.
Reports