Abstract
This paper describes a multi-site project to annotate six sizable bilingual parallel corpora for interlingual content. After presenting the background and objectives of the effort, we will go on to describe the data set that is being annotated, the interlingua representation language used, an interface environment that
supports the annotation task and the annotation process itself. We will then present a preliminary version of our evaluation methodology and conclude with a summary of the current status of the project along with a number of issues which have arisen.
supports the annotation task and the annotation process itself. We will then present a preliminary version of our evaluation methodology and conclude with a summary of the current status of the project along with a number of issues which have arisen.
Original language | English |
---|---|
Pages (from-to) | 197-243 |
Number of pages | 47 |
Journal | Natural Language Engineering |
Volume | 16 |
Issue number | 3 |
Early online date | 15 Jun 2010 |
DOIs | |
Publication status | Published - Jul 2010 |