Interlingual annotation of multilingual text corpora and FrameNet

D. Farwell, B. Dorr, R. Green, N. Habash, S. Helmreich, E. Hovy, L. Levin, K. Miller, T. Mitamura, O. Rambow, F. Reeder, A. Siddharthan

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

This article raises an issue of common interest to those interested in Interlinguas and interlingual MT as well as to those interested in developing a multilingual FrameNet. Specifically, it addresses the problem of teasing apart the difference between meaning and interpretation, between semantics and pragmatics and between semantic representation and the representation of information conveyed. No translation (nor paraphrase) conveys the exactly same information as the original utterance. Rather, additional information may be conveyed and information may be lost, or information originally expressed explicitly may be conveyed implicitly and vice versa. The semantic representation of an utterance (the result of integrating the semantic representations of its subcomponents) does not capture what people intuitively feel is the meaning of that utterance. Instead, various pragmatic factors must be taken into account, including the time and place of utterance and the speaker's motivation for uttering something. The focus of the discussion here is on describing the IAMTC project2 (Interlingual Annotation of Multilingual Text Corpora), a multi-site NSF-supported project to annotate six sizable bilingual parallel corpora for interlingual content.
Original languageEnglish
Title of host publicationMultilingual FrameNets in Computational Lexicography
Subtitle of host publicationMethods and Applications
EditorsH.C. Boas
Place of PublicationBerlin
PublisherMoutin de Gruyter
Pages287-318
Number of pages32
ISBN (Print)311021296X, 978-3110212969
Publication statusPublished - 1 Jul 2009

Fingerprint

Dive into the research topics of 'Interlingual annotation of multilingual text corpora and FrameNet'. Together they form a unique fingerprint.

Cite this