Instance-Based Natural Language Generation

S. Varges; C. Mellish

doi:10.1017/S1351324910000069

Instance-Based Natural Language Generation

S. Varges, C. Mellish

Computing Science

Research output: Contribution to journal › Article › peer-review

8 Citations (Scopus)

Abstract

We investigate the use of instance-based ranking methods for surface realization in natural language generation. Our approach to instance-based natural language generation (IBNLG) employs two components: a rule system that ‘overgenerates’ a number of realization candidates from a meaning representation and an instance-based ranker that scores the candidates according to their similarity to examples taken from a training corpus. We develop an efficient search technique for identifying the optimal candidate based on a novel extension of the A* algorithm. The rule system is produced automatically from a semantically annotated fragment of the Penn Treebank II containing management succession texts. We detail the annotation scheme and grammar induction algorithm and evaluate the efficiency and output of the generator. We also discuss issues such as input coverage (completeness) and fluency that are relevant to surface generation in general.

Original language	English
Pages (from-to)	309-346
Number of pages	37
Journal	Natural Language Engineering
Volume	16
Issue number	3
Early online date	12 May 2010
DOIs	https://doi.org/10.1017/S1351324910000069
Publication status	Published - Jul 2010

Access to Document

10.1017/S1351324910000069

Cite this

@article{45c4c40a74864405b17c63b61b31c9e3,

title = "Instance-Based Natural Language Generation",

abstract = "We investigate the use of instance-based ranking methods for surface realization in natural language generation. Our approach to instance-based natural language generation (IBNLG) employs two components: a rule system that {\textquoteleft}overgenerates{\textquoteright} a number of realization candidates from a meaning representation and an instance-based ranker that scores the candidates according to their similarity to examples taken from a training corpus. We develop an efficient search technique for identifying the optimal candidate based on a novel extension of the A* algorithm. The rule system is produced automatically from a semantically annotated fragment of the Penn Treebank II containing management succession texts. We detail the annotation scheme and grammar induction algorithm and evaluate the efficiency and output of the generator. We also discuss issues such as input coverage (completeness) and fluency that are relevant to surface generation in general.",

author = "S. Varges and C. Mellish",

year = "2010",

month = jul,

doi = "10.1017/S1351324910000069",

language = "English",

volume = "16",

pages = "309--346",

journal = "Natural Language Engineering",

issn = "1351-3249",

publisher = "Cambridge University Press",

number = "3",

}

TY - JOUR

T1 - Instance-Based Natural Language Generation

AU - Varges, S.

AU - Mellish, C.

PY - 2010/7

Y1 - 2010/7

N2 - We investigate the use of instance-based ranking methods for surface realization in natural language generation. Our approach to instance-based natural language generation (IBNLG) employs two components: a rule system that ‘overgenerates’ a number of realization candidates from a meaning representation and an instance-based ranker that scores the candidates according to their similarity to examples taken from a training corpus. We develop an efficient search technique for identifying the optimal candidate based on a novel extension of the A* algorithm. The rule system is produced automatically from a semantically annotated fragment of the Penn Treebank II containing management succession texts. We detail the annotation scheme and grammar induction algorithm and evaluate the efficiency and output of the generator. We also discuss issues such as input coverage (completeness) and fluency that are relevant to surface generation in general.

AB - We investigate the use of instance-based ranking methods for surface realization in natural language generation. Our approach to instance-based natural language generation (IBNLG) employs two components: a rule system that ‘overgenerates’ a number of realization candidates from a meaning representation and an instance-based ranker that scores the candidates according to their similarity to examples taken from a training corpus. We develop an efficient search technique for identifying the optimal candidate based on a novel extension of the A* algorithm. The rule system is produced automatically from a semantically annotated fragment of the Penn Treebank II containing management succession texts. We detail the annotation scheme and grammar induction algorithm and evaluate the efficiency and output of the generator. We also discuss issues such as input coverage (completeness) and fluency that are relevant to surface generation in general.

U2 - 10.1017/S1351324910000069

DO - 10.1017/S1351324910000069

M3 - Article

SN - 1351-3249

VL - 16

SP - 309

EP - 346

JO - Natural Language Engineering

JF - Natural Language Engineering

IS - 3

ER -

Instance-Based Natural Language Generation

Abstract

Access to Document

Fingerprint

Cite this