Page Model
-
ONE_HTML_PAGE ::= List( RESTAURANT )
-
RESTAURANT ::= < Name, Address, Phone, Review,
List(Credit_Card)>
NOTE:
- each HTML page is seen as a list of restaurants, and each
restaurant description contains an embedded list of credit
cards. In database terms, an HTML page represents a table with five
columns (i.e., Name, Address, Phone, Review,
List(Credit_Card)), where the last column is an
embedded table that has exactly one column. The wrapper must be able
to identify each individual credit card, so that one can use
the extracted data to answer queries like give me all Thai
restaurants that accept VISA.
Items to be extracted: Name, Address, Phone, Review, Credit_Card.
SAMPLE EXTRACTION OUTPUT