Investigating the content and form of referring expressions in Mandarin: Introducing the mtuna corpus

Kees Van Deemter, Le Sun, Rint Sybesma, Xiao Li, Bo Chen, Muyun Yang

Research output: Chapter in Book/Report/Conference proceedingPublished conference contribution

6 Citations (Scopus)

Abstract

East Asian languages are thought to handle reference differently from English, particularly in terms of the marking of definiteness and number. We present the first Data-Text corpus for Referring Expressions in Mandarin, and we use this corpus to test some initial hypotheses inspired by the theoretical linguistics literature. Our findings suggest that function words deserve more attention in Referring Expression Generation than they have so far received, and they have a bearing on the debate about whether different languages make different trade-offs between clarity and brevity.

Original languageEnglish
Title of host publicationINLG 2017 - 10th International Natural Language Generation Conference, Proceedings of the Conference
PublisherAssociation for Computational Linguistics (ACL)
Pages213-217
Number of pages5
ISBN (Electronic)9781945626524
Publication statusPublished - Sept 2017
Event10th International Natural Language Generation Conference, INLG 2017 - Santiago de Compostela, Spain
Duration: 4 Sept 20177 Sept 2017

Conference

Conference10th International Natural Language Generation Conference, INLG 2017
Country/TerritorySpain
CitySantiago de Compostela
Period4/09/177/09/17

Bibliographical note

Funding Information:
This work is partly supported by the National Natural Science Foundation of China, Grant no. 61433015. We thank Stephen Matthews, University of Hong Kong, for comments, and Albert Gatt, University of Malta, for access to Dutch TUNA.

Publisher Copyright:
© 2017 The Association for Computational Linguistics.

Fingerprint

Dive into the research topics of 'Investigating the content and form of referring expressions in Mandarin: Introducing the mtuna corpus'. Together they form a unique fingerprint.

Cite this