Publications

Sorted neighborhood for the semantic web

Abstract

Entity Resolution (ER) concerns identifying logically equivalent entity pairs across databases. To avoid quadratic pairwise comparisons of entities, blocking methods are used. Sorted Neighborhood is an established blocking method for relational databases. It has not been applied on graph-based data models such as the Resource Description Framework (RDF). This poster presents a modular workflow for applying Sorted Neighborhood to RDF. Real-world evaluations demonstrate the workflow's utility against a popular baseline.

Date
2015
Authors
Mayank Kejriwal
Journal
Proceedings of the AAAI Conference on Artificial Intelligence
Volume
29
Issue
1