Projects per year
Abstract
Narrative modelling is an area of active research, motivated by the acknowledgement of narratives as drivers of societal decision making. These research efforts conceptualize narratives as connected entity chains, and modeling typically focuses on the identification of entities and their connections within a text. An emerging approach to narrative modelling is the use of semantic role labeling (SRL) to extract Entity-Verb-Entity (E-V-Es) tuples from a text, followed by dimensionality reduction to reduce the space of entities and connections separately. This process penalises the semantic richness of narratives and discards much contextual information along the way. Here, we propose an alternate narrative extraction approach - CANarEx, incorporating a pipeline of common contextual constructs through co-reference resolution, micro-narrative generation and clustering of these narratives through sentence embeddings. We evaluate our approach through testing the recovery of “narrative time-series clusters”, mimicking a desirable text-as-data task. The evaluation framework leverages synthetic data generated using a GPT-3 model. The GPT-3 model is trained to generate similar sentences using a large dataset of news articles. The synthetic data maps to three topics in the news dataset. We then generate narrative time-series document cluster representations by mapping the synthetic data to three distinct signals synthetically injected into the testing corpus. Evaluation results demonstrate the superior ability of CANarEx to recover narrative time-series through reduced MSE and improved precision/recall relative to existing methods. The validity is further reinforced through ablation studies and qualitative analysis.
Original language | English |
---|---|
Title of host publication | Findings of the Association for Computational Linguistics |
Subtitle of host publication | EMNLP 2022 |
Editors | Yoav Goldberg, Zornitsa Kozareva, Yue Zhang |
Place of Publication | Abu Dhabi United Arab Emirates |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 3551-3564 |
Number of pages | 14 |
Publication status | Published - 2022 |
Projects
- 1 Finished
-
Understanding policy, media, and academic narratives around cycles of disadvantage in Australia
Faulkner, N. (Chief Investigator (CI)), Dwyer, T. (Primary Chief Investigator (PCI)), Smith, L. (Primary Chief Investigator (PCI)), Bragge, P. (Chief Investigator (CI)), Angus, S. (Chief Investigator (CI)), Raschky, P. (Chief Investigator (CI)), Buntine, W. (Chief Investigator (CI)), Webb, G. (Chief Investigator (CI)), Batstone, J. (Chief Investigator (CI)) & Goodwin, S. (Chief Investigator (CI))
11/01/21 → 26/02/22
Project: Research