Page Model


ONE_HTML_PAGE ::= List( PERSON )

PERSON ::= < Name, Email, AltName, LastUpdate, Organization, ServiceProvider >


NOTE:
each HTML page is seen as a list of persons, where each person is described by the following 6 attributes: Name, Email, AltName, LastUpdate, Organization, ServiceProvider. In database terms, an HTML page represents a table with six columns (i.e., Name, Email, AltName, LastUpdate, Organization, ServiceProvider).

Items to be extracted: Name, Email, AltName, LastUpdate, Organization, ServiceProvider.

NOTES: In this source the items may appear in various orders, and most of them may be missing. A special note for the "Organization" item, which may appear twice; in such a case, for sake of consistency with situations when the organization appears only once, we consider that the occurence to be extracted is the one in which the organization name appears capitalized.


SAMPLE EXTRACTION OUTPUT