Publications
A Common Framework for Developing Table Understanding Models.
Abstract
A wealth of knowledge is contained in tabular data, and there are a vast number of efforts to model and capture this knowledge. Unfortunately, these efforts have disparate inputs, outputs, and goals hindering research progress and making table understanding tools difficult to use in practice. In this paper, we propose a table understanding framework that formalizes the problem of understanding tabular data into three distinct subtasks: cell classification, block detection, and relation prediction. We introduce a common API for table understanding systems that supports a host of existing approaches and allows easy development of new approaches. Our framework supports approaches that range from heuristic rules to probabilistic models, allows outputs that span simple, correlational tuples to sophisticated, semantic knowledge graphs, and provides tools for visualizing model outputs and transforming complex tabular data into flattened relational dataframes.
- Date
- October 10, 2025
- Authors
- Jay Pujara, Arunkumar Rajendran, Majid Ghasemi-Gol, Pedro A Szekely
- Conference
- ISWC Satellites
- Pages
- 133-136