Interlingual annotation of multilingual text corpora and FrameNet

D. Farwell, B. Dorr, R. Green, N. Habash, S. Helmreich, E. Hovy, L. Levin, K. Miller, T. Mitamura, O. Rambow, F. Reeder, A. Siddharthan

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

This article raises an issue of common interest to those interested in Interlinguas and interlingual MT as well as to those interested in developing a multilingual FrameNet. Specifically, it addresses the problem of teasing apart the difference between meaning and interpretation, between semantics and pragmatics and between semantic representation and the representation of information conveyed. No translation (nor paraphrase) conveys the exactly same information as the original utterance. Rather, additional information may be conveyed and information may be lost, or information originally expressed explicitly may be conveyed implicitly and vice versa. The semantic representation of an utterance (the result of integrating the semantic representations of its subcomponents) does not capture what people intuitively feel is the meaning of that utterance. Instead, various pragmatic factors must be taken into account, including the time and place of utterance and the speaker's motivation for uttering something. The focus of the discussion here is on describing the IAMTC project2 (Interlingual Annotation of Multilingual Text Corpora), a multi-site NSF-supported project to annotate six sizable bilingual parallel corpora for interlingual content.
Original languageEnglish
Title of host publicationMultilingual FrameNets in Computational Lexicography
Subtitle of host publicationMethods and Applications
EditorsH.C. Boas
Place of PublicationBerlin
PublisherMoutin de Gruyter
Pages287-318
Number of pages32
ISBN (Print)311021296X, 978-3110212969
Publication statusPublished - 1 Jul 2009

Fingerprint

Text Corpus
Annotation
Utterance
Semantic Representation
Paraphrase
Teasing
Parallel Corpora

Cite this

Farwell, D., Dorr, B., Green, R., Habash, N., Helmreich, S., Hovy, E., ... Siddharthan, A. (2009). Interlingual annotation of multilingual text corpora and FrameNet. In H. C. Boas (Ed.), Multilingual FrameNets in Computational Lexicography: Methods and Applications (pp. 287-318). Berlin: Moutin de Gruyter.

Interlingual annotation of multilingual text corpora and FrameNet. / Farwell, D.; Dorr, B.; Green, R.; Habash, N.; Helmreich, S.; Hovy, E.; Levin, L.; Miller, K.; Mitamura, T.; Rambow, O.; Reeder, F.; Siddharthan, A.

Multilingual FrameNets in Computational Lexicography: Methods and Applications. ed. / H.C. Boas. Berlin : Moutin de Gruyter, 2009. p. 287-318.

Research output: Chapter in Book/Report/Conference proceedingChapter

Farwell, D, Dorr, B, Green, R, Habash, N, Helmreich, S, Hovy, E, Levin, L, Miller, K, Mitamura, T, Rambow, O, Reeder, F & Siddharthan, A 2009, Interlingual annotation of multilingual text corpora and FrameNet. in HC Boas (ed.), Multilingual FrameNets in Computational Lexicography: Methods and Applications. Moutin de Gruyter, Berlin, pp. 287-318.
Farwell D, Dorr B, Green R, Habash N, Helmreich S, Hovy E et al. Interlingual annotation of multilingual text corpora and FrameNet. In Boas HC, editor, Multilingual FrameNets in Computational Lexicography: Methods and Applications. Berlin: Moutin de Gruyter. 2009. p. 287-318
Farwell, D. ; Dorr, B. ; Green, R. ; Habash, N. ; Helmreich, S. ; Hovy, E. ; Levin, L. ; Miller, K. ; Mitamura, T. ; Rambow, O. ; Reeder, F. ; Siddharthan, A. / Interlingual annotation of multilingual text corpora and FrameNet. Multilingual FrameNets in Computational Lexicography: Methods and Applications. editor / H.C. Boas. Berlin : Moutin de Gruyter, 2009. pp. 287-318
@inbook{1bba073a8d2645bfb5ea3e929c1b5365,
title = "Interlingual annotation of multilingual text corpora and FrameNet",
abstract = "This article raises an issue of common interest to those interested in Interlinguas and interlingual MT as well as to those interested in developing a multilingual FrameNet. Specifically, it addresses the problem of teasing apart the difference between meaning and interpretation, between semantics and pragmatics and between semantic representation and the representation of information conveyed. No translation (nor paraphrase) conveys the exactly same information as the original utterance. Rather, additional information may be conveyed and information may be lost, or information originally expressed explicitly may be conveyed implicitly and vice versa. The semantic representation of an utterance (the result of integrating the semantic representations of its subcomponents) does not capture what people intuitively feel is the meaning of that utterance. Instead, various pragmatic factors must be taken into account, including the time and place of utterance and the speaker's motivation for uttering something. The focus of the discussion here is on describing the IAMTC project2 (Interlingual Annotation of Multilingual Text Corpora), a multi-site NSF-supported project to annotate six sizable bilingual parallel corpora for interlingual content.",
author = "D. Farwell and B. Dorr and R. Green and N. Habash and S. Helmreich and E. Hovy and L. Levin and K. Miller and T. Mitamura and O. Rambow and F. Reeder and A. Siddharthan",
year = "2009",
month = "7",
day = "1",
language = "English",
isbn = "311021296X",
pages = "287--318",
editor = "H.C. Boas",
booktitle = "Multilingual FrameNets in Computational Lexicography",
publisher = "Moutin de Gruyter",

}

TY - CHAP

T1 - Interlingual annotation of multilingual text corpora and FrameNet

AU - Farwell, D.

AU - Dorr, B.

AU - Green, R.

AU - Habash, N.

AU - Helmreich, S.

AU - Hovy, E.

AU - Levin, L.

AU - Miller, K.

AU - Mitamura, T.

AU - Rambow, O.

AU - Reeder, F.

AU - Siddharthan, A.

PY - 2009/7/1

Y1 - 2009/7/1

N2 - This article raises an issue of common interest to those interested in Interlinguas and interlingual MT as well as to those interested in developing a multilingual FrameNet. Specifically, it addresses the problem of teasing apart the difference between meaning and interpretation, between semantics and pragmatics and between semantic representation and the representation of information conveyed. No translation (nor paraphrase) conveys the exactly same information as the original utterance. Rather, additional information may be conveyed and information may be lost, or information originally expressed explicitly may be conveyed implicitly and vice versa. The semantic representation of an utterance (the result of integrating the semantic representations of its subcomponents) does not capture what people intuitively feel is the meaning of that utterance. Instead, various pragmatic factors must be taken into account, including the time and place of utterance and the speaker's motivation for uttering something. The focus of the discussion here is on describing the IAMTC project2 (Interlingual Annotation of Multilingual Text Corpora), a multi-site NSF-supported project to annotate six sizable bilingual parallel corpora for interlingual content.

AB - This article raises an issue of common interest to those interested in Interlinguas and interlingual MT as well as to those interested in developing a multilingual FrameNet. Specifically, it addresses the problem of teasing apart the difference between meaning and interpretation, between semantics and pragmatics and between semantic representation and the representation of information conveyed. No translation (nor paraphrase) conveys the exactly same information as the original utterance. Rather, additional information may be conveyed and information may be lost, or information originally expressed explicitly may be conveyed implicitly and vice versa. The semantic representation of an utterance (the result of integrating the semantic representations of its subcomponents) does not capture what people intuitively feel is the meaning of that utterance. Instead, various pragmatic factors must be taken into account, including the time and place of utterance and the speaker's motivation for uttering something. The focus of the discussion here is on describing the IAMTC project2 (Interlingual Annotation of Multilingual Text Corpora), a multi-site NSF-supported project to annotate six sizable bilingual parallel corpora for interlingual content.

M3 - Chapter

SN - 311021296X

SN - 978-3110212969

SP - 287

EP - 318

BT - Multilingual FrameNets in Computational Lexicography

A2 - Boas, H.C.

PB - Moutin de Gruyter

CY - Berlin

ER -