Dimensions of incongruity in register humour

Chris Thomas Venour; Graeme D Ritchie; Christopher Stuart Mellish

Dimensions of incongruity in register humour

Chris Thomas Venour, Graeme D Ritchie, Christopher Stuart Mellish

Research output: Chapter in Book/Report/Conference proceeding › Chapter

Abstract

Register-based humour consists of texts in which most of the language is in a particular style or tone, except for one or two words which are radically different in tone (or register) from the rest. It is not initially clear how to define register formally in terms of constructs, such as literariness, archaism, formality, etc. We have adopted a perspective in which words are located in a multi-dimensional space, and incongruity between words should correspond to a relatively large distance between those words, within this space. In order to construct this space in a way which shows up differences relevant to the question of register, we have based each dimension on a word’s frequency of occurrence in a particular corpus of texts. We have put together a number of corpora between which there are likely to be differences of tone/register, and for each word in a text we compute its frequency within every corpus. These numbers are then used to plot the word’s position in our abstract space. The most successful technique, both for building the space and for computing outliers, was tested on the task of distinguishing humorous texts from plain newspaper sentences, where it performed quite well.

Original language	English
Title of host publication	The Pragmatics of Humour across Discourse Domains
Editors	Marta Dynel
Publisher	John Benjamins Pub.
Pages	125-144
Number of pages	22
Volume	vi
Edition	2011
ISBN (Print)	9027256144 , 978-9027256140
Publication status	Published - 2011

Keywords

humour
computational humour
artificial intelligence
computers and literature

Cite this

@inbook{21d2255809c64642a9c32660c08b316b,

title = "Dimensions of incongruity in register humour",

abstract = "Register-based humour consists of texts in which most of the language is in a particular style or tone, except for one or two words which are radically different in tone (or register) from the rest. It is not initially clear how to define register formally in terms of constructs, such as literariness, archaism, formality, etc. We have adopted a perspective in which words are located in a multi-dimensional space, and incongruity between words should correspond to a relatively large distance between those words, within this space. In order to construct this space in a way which shows up differences relevant to the question of register, we have based each dimension on a word{\textquoteright}s frequency of occurrence in a particular corpus of texts. We have put together a number of corpora between which there are likely to be differences of tone/register, and for each word in a text we compute its frequency within every corpus. These numbers are then used to plot the word{\textquoteright}s position in our abstract space. The most successful technique, both for building the space and for computing outliers, was tested on the task of distinguishing humorous texts from plain newspaper sentences, where it performed quite well. ",

keywords = "humour, computational humour, artificial intelligence, computers and literature",

author = "Venour, {Chris Thomas} and Ritchie, {Graeme D} and Mellish, {Christopher Stuart}",

year = "2011",

language = "English",

isbn = "9027256144 ",

volume = "vi",

pages = "125--144",

editor = "Marta Dynel",

booktitle = "The Pragmatics of Humour across Discourse Domains",

publisher = "John Benjamins Pub.",

edition = "2011",

}

TY - CHAP

T1 - Dimensions of incongruity in register humour

AU - Venour, Chris Thomas

AU - Ritchie, Graeme D

AU - Mellish, Christopher Stuart

PY - 2011

Y1 - 2011

N2 - Register-based humour consists of texts in which most of the language is in a particular style or tone, except for one or two words which are radically different in tone (or register) from the rest. It is not initially clear how to define register formally in terms of constructs, such as literariness, archaism, formality, etc. We have adopted a perspective in which words are located in a multi-dimensional space, and incongruity between words should correspond to a relatively large distance between those words, within this space. In order to construct this space in a way which shows up differences relevant to the question of register, we have based each dimension on a word’s frequency of occurrence in a particular corpus of texts. We have put together a number of corpora between which there are likely to be differences of tone/register, and for each word in a text we compute its frequency within every corpus. These numbers are then used to plot the word’s position in our abstract space. The most successful technique, both for building the space and for computing outliers, was tested on the task of distinguishing humorous texts from plain newspaper sentences, where it performed quite well.

AB - Register-based humour consists of texts in which most of the language is in a particular style or tone, except for one or two words which are radically different in tone (or register) from the rest. It is not initially clear how to define register formally in terms of constructs, such as literariness, archaism, formality, etc. We have adopted a perspective in which words are located in a multi-dimensional space, and incongruity between words should correspond to a relatively large distance between those words, within this space. In order to construct this space in a way which shows up differences relevant to the question of register, we have based each dimension on a word’s frequency of occurrence in a particular corpus of texts. We have put together a number of corpora between which there are likely to be differences of tone/register, and for each word in a text we compute its frequency within every corpus. These numbers are then used to plot the word’s position in our abstract space. The most successful technique, both for building the space and for computing outliers, was tested on the task of distinguishing humorous texts from plain newspaper sentences, where it performed quite well.

KW - humour

KW - computational humour

KW - artificial intelligence

KW - computers and literature

M3 - Chapter

SN - 9027256144

SN - 978-9027256140

VL - vi

SP - 125

EP - 144

BT - The Pragmatics of Humour across Discourse Domains

A2 - Dynel, Marta

PB - John Benjamins Pub.

ER -