Homework 4: Introduction to Theseus

In this homework you will get a chance to build your own Theseus data integration plans. For this homework, all of the plans you will write will process local data (stored in files). The next Theseus homework will focus on processing network data, such as that which the wrappers that you wrote earlier can fetch and extract.

What to do

Write a plan that returns finds the set of non "fast food" restaurants that exist in a certain location (specifically, contained in a geocoded bounding box). You must use a predefined input data file, which contains a set of restaurants, a set of lattitude and longitude values for those restaurants, and a predefined bounding box. This input file is an example of the type of data you find on the Web - data from multiple sources that you want to relate or combine.

You will need to do the following:

Hints/Notes:

After writing your plan, you can ensure that it works by making sure it generates the following output (order of the values is not important):
----------------------------------------------
RELATION: hw4_answer
   attrs: info
----------------------------------------------
California Pizza Kitchen is located at lat=33.48, lon=-117.55
Il Fornaio is located at lat=33.45, lon=-118.21
Gaylord is located at lat=33.75, lon=-117.23
Alto Palato is located at lat=33.70, lon=-117.45
Youngsusan is located at lat=32.25, lon=-119.12
Cheesecacke Factory is located at lat=33.02, lon=-116.45
Panda Express is located at lat=33.44, lon=-116.35
Chart House is located at lat=32.45, lon=-118.21
Benihanas is located at lat=33.77, lon=-117.11
----------------------------------------------
What to turn in

You should turn in the following:

Questions and comments

If you have any, contact Greg Barish.