Publications
Retrieving complex tables with multi-granular graph representation learning
Abstract
The task of natural language table retrieval (NLTR) seeks to retrieve semantically relevant tables based on natural language queries. Existing learning systems for this task often treat tables as plain text based on the assumption that tables are structured as dataframes. However, tables can have complex layouts which indicate diverse dependencies between subtable structures, such as nested headers. As a result, queries may refer to different spans of relevant content that is distributed across these structures. Moreover, such systems fail to generalize to novel scenarios beyond those seen in the training set. Prior methods are still distant from a generalizable solution to the NLTR problem, as they fall short in handling complex table layouts or queries over multiple granularities. To address these issues, we propose Graph-based Table Retrieval (GTR), a generalizable NLTR framework with multi-granular graph …
- Date
- July 11, 2021
- Authors
- Fei Wang, Kexuan Sun, Muhao Chen, Jay Pujara, Pedro Szekely
- Book
- Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval
- Pages
- 1472-1482