Abstract
We present a study of a dataset of tables from biomedical research publications. Our aim is to identify characteristics of biomedical tables that pose challenges for the task of extracting information from tables, and to determine which parts of research papers typically contain information that is useful for this task. Our results indicate that biomedical tables are hard to interpret without their source papers due to the brevity of the entries in the tables. In many cases, unstructured text segments, such as table titles, footnotes and non-table prose discussing a table, are required to interpret the table's entries.
Original language | English |
---|---|
Title of host publication | Australasian Language Technology Association Workshop 2014 - Proceedings of the Workshop (ALTA) |
Editors | Gabriela Ferraro, Stephen Wan |
Place of Publication | Stroudsburg PA USA |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 118-122 |
Number of pages | 5 |
Publication status | Published - 2014 |
Event | Australasian Language Technology Association Workshop 2014 - RMIT, Melbourne, Australia Duration: 26 Nov 2014 → 28 Nov 2014 Conference number: 12th https://www.aclweb.org/anthology/events/alta-2014/ (Proceedings) |
Conference
Conference | Australasian Language Technology Association Workshop 2014 |
---|---|
Abbreviated title | ALTAW 2014 |
Country/Territory | Australia |
City | Melbourne |
Period | 26/11/14 → 28/11/14 |
Other | ALTA 2014 will be held in conjuction with the 19th Australasian Document Computing Symposium 2014 (ADCS 2014). |
Internet address |
|