Anaphoric Phenomena in Situated dialog: A First Round of Annotations

Loáiciga, Sharid and Dobnik, Simon and Schlangen, David

We present a first release of 500 documents from the multimodal corpus Tell-me-more (Ilinykh et al., 2019) annotated with coreference information according to the ARRAU guidelines (Poesio et al., 2021). The corpus consists of images and short texts of five sentences. We describe the annotation process and present the adaptations to the original guidelines in order to account for the challenges of grounding the annotations to the image. 50 documents from the 500 available are annotated by two people and used to estimate inter-annotator agreement (IAA) relying on Krippendorff’s alpha.

In Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference , 2022
[PDF]
@inproceedings{Loaiciga-2022-1,
  title = {Anaphoric Phenomena in Situated dialog: A First Round of Annotations},
  author = {Lo{\'a}iciga, Sharid and Dobnik, Simon and Schlangen, David},
  booktitle = {Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference},
  month = oct,
  year = {2022},
  address = {Gyeongju, Republic of Korea},
  publisher = {Association for Computational Linguistics},
  url = {https://aclanthology.org/2022.crac-1.4},
  pages = {31--37},
  topics = {},
  domains = {},
  approach = {},
  project = {}
}