Score: 0

Towards Generating Automatic Anaphora Annotations

Published: March 12, 2025 | arXiv ID: 2503.09417v2

By: Dima Taji, Daniel Zeman

Potential Business Impact:

Teaches computers to understand tricky word meanings.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Training models that can perform well on various NLP tasks require large amounts of data, and this becomes more apparent with nuanced tasks such as anaphora and conference resolution. To combat the prohibitive costs of creating manual gold annotated data, this paper explores two methods to automatically create datasets with coreferential annotations; direct conversion from existing datasets, and parsing using multilingual models capable of handling new and unseen languages. The paper details the current progress on those two fronts, as well as the challenges the efforts currently face, and our approach to overcoming these challenges.