RISE: Repository of Online Information Sources Used in Information Extraction Tasks


RISE is a distributed repository of online information sources that are used for the empirical analysis of learning algorithms that generate extraction patterns. The sources included in this repository are provided by people from the information extraction (IE) and wrapper generation (WG) communities. Both communities use machine learning algorithms to generate extraction patterns for online information sources. If you are interested in more details about learning extraction patterns, you can download this survey. The construction of this repository is an on-going process. If you are aware of an entry that RISE should contain, please let us know. We are always interested in donations.

If you publish material based on sources obtained from this repository, then, in your acknowledgments, please note the assistance you received by using RISE. This will help others to obtain the same data sets and replicate your experiments. We suggest the following pseudo-APA reference format for referring to this repository:

RISE (1998). A repository of online information sources used in information extraction tasks
[http://www.isi.edu/integration/RISE/index.html]. University of Southern California,
Information Sciences Institute.

Thank you and please come again!

Acknowledgment:This document was created based on the excellent home page of the U.C. Irvine Machine Learning Repository.

Last modified: Oct 25, 2004 Suggestions, comments, and questions to: Ion Muslea