Whose idea was this? Deciding attribution in scientific literature

Advaith Siddharthan, Simone Teufel

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

For a variety of discourse level analyses and tasks performed on scientific literature, it is necessary to identify which (if any) cited paper the discourse entities in focus are attributable to. In this paper we introduce a scientific attribution task that aims to associate a range of linguistic expressions such as definite descriptions, pronouns and “work ” nouns with specific cited papers. We report human agreement of Krippendorff’s Alpha greater than 0.8 on our scientific attribution task, based on written guidelines with ten rules for common systematic problem cases. The high alpha suggests that our task is well defined and fairly intuitive to annotators. Our machine learning approach achieves Krippendorff’s Alpha of 0.67 and percentage agreement of 85 % with a manually constructed gold standard, suggesting that the task is simpler than traditional anaphora resolution tasks.
Original languageEnglish
Title of host publicationProceedings of the 6th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC'07)
Publication statusPublished - 2007
Event6th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC'07) - Logos, Portugal
Duration: 29 Mar 200730 Mar 2007

Conference

Conference6th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC'07)
CountryPortugal
CityLogos
Period29/03/0730/03/07

Fingerprint

Linguistics
Learning systems

Cite this

Siddharthan, A., & Teufel, S. (2007). Whose idea was this? Deciding attribution in scientific literature. In Proceedings of the 6th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC'07)

Whose idea was this? Deciding attribution in scientific literature. / Siddharthan, Advaith; Teufel, Simone.

Proceedings of the 6th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC'07). 2007.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Siddharthan, A & Teufel, S 2007, Whose idea was this? Deciding attribution in scientific literature. in Proceedings of the 6th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC'07). 6th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC'07) , Logos, Portugal, 29/03/07.
Siddharthan A, Teufel S. Whose idea was this? Deciding attribution in scientific literature. In Proceedings of the 6th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC'07). 2007
Siddharthan, Advaith ; Teufel, Simone. / Whose idea was this? Deciding attribution in scientific literature. Proceedings of the 6th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC'07). 2007.
@inproceedings{640dc34f11b3463cb6fca5788f577168,
title = "Whose idea was this? Deciding attribution in scientific literature",
abstract = "For a variety of discourse level analyses and tasks performed on scientific literature, it is necessary to identify which (if any) cited paper the discourse entities in focus are attributable to. In this paper we introduce a scientific attribution task that aims to associate a range of linguistic expressions such as definite descriptions, pronouns and “work ” nouns with specific cited papers. We report human agreement of Krippendorff’s Alpha greater than 0.8 on our scientific attribution task, based on written guidelines with ten rules for common systematic problem cases. The high alpha suggests that our task is well defined and fairly intuitive to annotators. Our machine learning approach achieves Krippendorff’s Alpha of 0.67 and percentage agreement of 85 {\%} with a manually constructed gold standard, suggesting that the task is simpler than traditional anaphora resolution tasks.",
author = "Advaith Siddharthan and Simone Teufel",
year = "2007",
language = "English",
booktitle = "Proceedings of the 6th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC'07)",

}

TY - GEN

T1 - Whose idea was this? Deciding attribution in scientific literature

AU - Siddharthan, Advaith

AU - Teufel, Simone

PY - 2007

Y1 - 2007

N2 - For a variety of discourse level analyses and tasks performed on scientific literature, it is necessary to identify which (if any) cited paper the discourse entities in focus are attributable to. In this paper we introduce a scientific attribution task that aims to associate a range of linguistic expressions such as definite descriptions, pronouns and “work ” nouns with specific cited papers. We report human agreement of Krippendorff’s Alpha greater than 0.8 on our scientific attribution task, based on written guidelines with ten rules for common systematic problem cases. The high alpha suggests that our task is well defined and fairly intuitive to annotators. Our machine learning approach achieves Krippendorff’s Alpha of 0.67 and percentage agreement of 85 % with a manually constructed gold standard, suggesting that the task is simpler than traditional anaphora resolution tasks.

AB - For a variety of discourse level analyses and tasks performed on scientific literature, it is necessary to identify which (if any) cited paper the discourse entities in focus are attributable to. In this paper we introduce a scientific attribution task that aims to associate a range of linguistic expressions such as definite descriptions, pronouns and “work ” nouns with specific cited papers. We report human agreement of Krippendorff’s Alpha greater than 0.8 on our scientific attribution task, based on written guidelines with ten rules for common systematic problem cases. The high alpha suggests that our task is well defined and fairly intuitive to annotators. Our machine learning approach achieves Krippendorff’s Alpha of 0.67 and percentage agreement of 85 % with a manually constructed gold standard, suggesting that the task is simpler than traditional anaphora resolution tasks.

M3 - Conference contribution

BT - Proceedings of the 6th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC'07)

ER -