{"id":80,"date":"2021-10-15T17:58:48","date_gmt":"2021-10-15T17:58:48","guid":{"rendered":"https:\/\/www.isi.edu\/centers-ckg\/?page_id=80"},"modified":"2024-08-09T16:21:48","modified_gmt":"2024-08-09T16:21:48","slug":"downloads","status":"publish","type":"page","link":"https:\/\/www.isi.edu\/centers-ckg\/resources\/downloads\/","title":{"rendered":"Downloads"},"content":{"rendered":"\n\n\t<a onclick=\"topFunction()\" id=\"toTop\" aria-label=\"Go to top\">\n<\/a>\n<header><h2>Available for Download<\/h2><ul><li><a href=\"#sk\">Software: Karma<\/a><\/li><li><a href=\"#mfh\">MapFinder: Harvesting maps on the Web<\/a><\/li><li><a href=\"#arxp\">ARX and Phoebus: Information Extraction from Unstructured and Ungrammatical Text on Web<\/a><\/li><li><a href=\"#bsl\">BSL: A system for learning blocking schemes<\/a><\/li><li><a href=\"#eidos\">EIDOS: Efficiently Inducing Definitions for Online Sources<\/a><\/li><\/ul><\/header>\n\t<h2>Software: Karma<\/h2>\n<p>Karma is an information integration tool that enables users to quickly and easily integrate data from a variety of data sources including databases, spreadsheets, delimited text files, XML, JSON, KML and Web APIs.<\/p>\n<p><a href=\"https:\/\/github.com\/usc-isi-i2\/Web-Karma\" target=\"_blank\" rel=\"noopener\">Github<\/a><a href=\"https:\/\/usc-isi-i2.github.io\/karma\/\" target=\"_blank\" rel=\"noopener\">Project Page<\/a><\/p>\n\t<h2>MapFinder: Harvesting maps on the Web<\/h2>\n<p>Maps are one of the most valuable documents for gathering geospatial information about a region. We use a Content Based Image Retrieval (CBIR) technique to built an accurate and scalable system, MapFinder, that can discover standalone images as well as images embedded within documents on the Web that are maps. The implementation provided here has the capabilities of extracting WaterFilling features from images, and classifying a given image as a map or nonmap. We also provide the data collected by us for our experiments.<\/p>\n<p><a href=\"https:\/\/publications.isi.edu\/downloads\/integration\/mapfindercode.zip\" target=\"_blank\" rel=\"noopener\">Download Code<\/a><a href=\"https:\/\/publications.isi.edu\/downloads\/integration\/mapfinderdata.zip\" target=\"_blank\" rel=\"noopener\">Download Data (1.5 GB)<\/a> <a href=\"https:\/\/publications.isi.edu\/downloads\/integration\/goel10-ijdar.pdf\" target=\"_blank\" rel=\"noopener\">MapFinder Project Paper<\/a><\/p>\n\t<h2>ARX and Phoebus: Information Extraction from Unstructured and Ungrammatical Text on Web<\/h2>\n<p>The project presents two implementations for performing information extraction from unstructured, ungrammatical text on the Web such as classified ads, auction listings, and forum posting titles. The ARX system is an automatic approach to exploiting reference sets for this extraction. The Phoebus system presents a machine learning approach exploiting reference sets.<\/p>\n<p><a href=\"https:\/\/publications.isi.edu\/downloads\/integration\/ARXPhoebus.zip\" target=\"_blank\" rel=\"noopener\">Download<\/a><a href=\"https:\/\/publications.isi.edu\/downloads\/integration\/michelson07-ijdar.pdf\" target=\"_blank\" rel=\"noopener\">ARX Project Paper<\/a><a href=\"https:\/\/publications.isi.edu\/downloads\/integration\/michelson08-jair.pdf\" target=\"_blank\" rel=\"noopener\">Phoebus Project Paper<\/a><\/p>\n\t<h2>BSL: A system for learning blocking schemes<\/h2>\n<p>Record linkage is the problem of determining the matches between two data sources. However, as data sources become larger and larger, this task becomes difficult and expensive. To aid in this process, blocking is the efficient generation of candidate matches which can then be examined in detail later to determine whether or not they are true matches. So, blocking is a preprocessing step to make record linkage a more scalable process.The BSL system presented here does this in the supervised setting of record linkage. This means that given some training matches, it can discover rules (a blocking scheme) to efficiently generate candidate matches between the sets.<\/p>\n<p><a href=\"https:\/\/github.com\/usc-isi-i2\/bsl\" target=\"_blank\" rel=\"noopener\">Github<\/a><a href=\"https:\/\/publications.isi.edu\/downloads\/integration\/michelson06-aaai.pdf\" target=\"_blank\" rel=\"noopener\">Project Paper<\/a><\/p>\n\t<h2>EIDOS: Efficiently Inducing Definitions for Online Sources<\/h2>\n<p>Record linkage is the problem of determining the matches between two data sources. However, as data sources become larger and larger, this task becomes difficult and expensive. To aid in this process, blocking is the efficient generation of candidate matches which can then be examined in detail later to determine whether or not they are true matches. So, blocking is a preprocessing step to make record linkage a more scalable process.The BSL system presented here does this in the supervised setting of record linkage. This means that given some training matches, it can discover rules (a blocking scheme) to efficiently generate candidate matches between the sets.<\/p>\n<p><a href=\"https:\/\/github.com\/usc-isi-i2\/eidos\" target=\"_blank\" rel=\"noopener\">Github<\/a><a href=\"https:\/\/publications.isi.edu\/downloads\/integration\/carman07-jair.pdf\" target=\"_blank\" rel=\"noopener\">Project Paper\u00a0<\/a><\/p>\n\n","protected":false},"excerpt":{"rendered":"<p>Available for Download Software: Karma MapFinder: Harvesting maps on the Web ARX and Phoebus: Information Extraction from Unstructured and Ungrammatical Text on Web BSL: A system for learning blocking schemes EIDOS: Efficiently Inducing Definitions for Online Sources Software: Karma Karma is an information integration tool that enables users to quickly and easily integrate data from&hellip;<\/p>\n","protected":false},"author":421,"featured_media":0,"parent":388,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"news_source":"","news_author":"","external_news_link":"","footnotes":""},"class_list":["post-80","page","type-page","status-publish","hentry"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.2 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Downloads - Center on Knowledge Graphs<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.isi.edu\/centers-ckg\/resources\/downloads\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Downloads - Center on Knowledge Graphs\" \/>\n<meta property=\"og:description\" content=\"Available for Download Software: Karma MapFinder: Harvesting maps on the Web ARX and Phoebus: Information Extraction from Unstructured and Ungrammatical Text on Web BSL: A system for learning blocking schemes EIDOS: Efficiently Inducing Definitions for Online Sources Software: Karma Karma is an information integration tool that enables users to quickly and easily integrate data from&hellip;\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.isi.edu\/centers-ckg\/resources\/downloads\/\" \/>\n<meta property=\"og:site_name\" content=\"Center on Knowledge Graphs\" \/>\n<meta property=\"article:modified_time\" content=\"2024-08-09T16:21:48+00:00\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.isi.edu\/centers-ckg\/resources\/downloads\/\",\"url\":\"https:\/\/www.isi.edu\/centers-ckg\/resources\/downloads\/\",\"name\":\"Downloads - Center on Knowledge Graphs\",\"isPartOf\":{\"@id\":\"https:\/\/www.isi.edu\/centers-ckg\/#website\"},\"datePublished\":\"2021-10-15T17:58:48+00:00\",\"dateModified\":\"2024-08-09T16:21:48+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.isi.edu\/centers-ckg\/resources\/downloads\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.isi.edu\/centers-ckg\/resources\/downloads\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.isi.edu\/centers-ckg\/resources\/downloads\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.isi.edu\/centers-ckg\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Resources\",\"item\":\"https:\/\/www.isi.edu\/centers-ckg\/resources\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Downloads\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.isi.edu\/centers-ckg\/#website\",\"url\":\"https:\/\/www.isi.edu\/centers-ckg\/\",\"name\":\"Center on Knowledge Graphs\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.isi.edu\/centers-ckg\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Downloads - Center on Knowledge Graphs","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.isi.edu\/centers-ckg\/resources\/downloads\/","og_locale":"en_US","og_type":"article","og_title":"Downloads - Center on Knowledge Graphs","og_description":"Available for Download Software: Karma MapFinder: Harvesting maps on the Web ARX and Phoebus: Information Extraction from Unstructured and Ungrammatical Text on Web BSL: A system for learning blocking schemes EIDOS: Efficiently Inducing Definitions for Online Sources Software: Karma Karma is an information integration tool that enables users to quickly and easily integrate data from&hellip;","og_url":"https:\/\/www.isi.edu\/centers-ckg\/resources\/downloads\/","og_site_name":"Center on Knowledge Graphs","article_modified_time":"2024-08-09T16:21:48+00:00","twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.isi.edu\/centers-ckg\/resources\/downloads\/","url":"https:\/\/www.isi.edu\/centers-ckg\/resources\/downloads\/","name":"Downloads - Center on Knowledge Graphs","isPartOf":{"@id":"https:\/\/www.isi.edu\/centers-ckg\/#website"},"datePublished":"2021-10-15T17:58:48+00:00","dateModified":"2024-08-09T16:21:48+00:00","breadcrumb":{"@id":"https:\/\/www.isi.edu\/centers-ckg\/resources\/downloads\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.isi.edu\/centers-ckg\/resources\/downloads\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.isi.edu\/centers-ckg\/resources\/downloads\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.isi.edu\/centers-ckg\/"},{"@type":"ListItem","position":2,"name":"Resources","item":"https:\/\/www.isi.edu\/centers-ckg\/resources\/"},{"@type":"ListItem","position":3,"name":"Downloads"}]},{"@type":"WebSite","@id":"https:\/\/www.isi.edu\/centers-ckg\/#website","url":"https:\/\/www.isi.edu\/centers-ckg\/","name":"Center on Knowledge Graphs","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.isi.edu\/centers-ckg\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/www.isi.edu\/centers-ckg\/wp-json\/wp\/v2\/pages\/80","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.isi.edu\/centers-ckg\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.isi.edu\/centers-ckg\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.isi.edu\/centers-ckg\/wp-json\/wp\/v2\/users\/421"}],"replies":[{"embeddable":true,"href":"https:\/\/www.isi.edu\/centers-ckg\/wp-json\/wp\/v2\/comments?post=80"}],"version-history":[{"count":0,"href":"https:\/\/www.isi.edu\/centers-ckg\/wp-json\/wp\/v2\/pages\/80\/revisions"}],"up":[{"embeddable":true,"href":"https:\/\/www.isi.edu\/centers-ckg\/wp-json\/wp\/v2\/pages\/388"}],"wp:attachment":[{"href":"https:\/\/www.isi.edu\/centers-ckg\/wp-json\/wp\/v2\/media?parent=80"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}