TY - JOUR
T1 - Barriers and enabling factors for error analysis in NLG research
AU - Van Miltenburg, Emiel
AU - Clinciu, Miruna
AU - Dušek, Ondřej
AU - Gkatzia, Dimitra
AU - Inglis, Stephanie
AU - Leppänen, Leo
AU - Mahamood, Saad
AU - Schoch, Stephanie
AU - Thomson, Craig
AU - Wen, Luou
PY - 2023/2/21
Y1 - 2023/2/21
N2 - Earlier research has shown that few studies in Natural Language Generation (NLG) evaluate their system outputs using an error analysis, despite known limitations of automatic evaluation metrics and human ratings. This position paper takes the stance that error analyses should be encouraged, and discusses several ways to do so. This paper is not just based on our shared experience as authors, but we also distributed a survey as a means of public consultation. We provide an overview of existing barriers to carry out error analyses, and proposes changes to improve error reporting in the NLG literature.
AB - Earlier research has shown that few studies in Natural Language Generation (NLG) evaluate their system outputs using an error analysis, despite known limitations of automatic evaluation metrics and human ratings. This position paper takes the stance that error analyses should be encouraged, and discusses several ways to do so. This paper is not just based on our shared experience as authors, but we also distributed a survey as a means of public consultation. We provide an overview of existing barriers to carry out error analyses, and proposes changes to improve error reporting in the NLG literature.
U2 - 10.3384/nejlt.2000-1533.2023.4529
DO - 10.3384/nejlt.2000-1533.2023.4529
M3 - Article
VL - 9
JO - Northern European Journal of Language Technology
JF - Northern European Journal of Language Technology
SN - 2000-1533
IS - 1
ER -