Presenting the Bangor Autoglosser and the Bangor Automated Clause-Splitter

D. M. Carter* (Corresponding Author), M. Broersma, K. Donnelly, A. Konopka

*Corresponding author for this work

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Until recently, corpus studies of natural bilingual speech and, more specifically, codeswitching in bilingual speech have used a manual method of glossing, partof- speech tagging, and clause-splitting to prepare the data for analysis. In our article, we present innovative tools developed for the first large-scale corpus study of codeswitching triggered by cognates. A study of this size was only possible due to the automation of several steps, such as morpheme-by-morpheme glossing, splitting complex clauses into simple clauses, and the analysis of internal and external codeswitching through the use of database tables, algorithms, and a scripting language.

Original languageEnglish
Pages (from-to)21-28
Number of pages8
JournalDigital Scholarship in the Humanities
Volume33
Issue number1
Early online date27 Feb 2017
DOIs
Publication statusPublished - Apr 2018

Fingerprint

automation
Automation
language
Clause
Code-switching
Corpus Study
Morpheme
Tagging
Data Base
Scripting
Cognates
Language

ASJC Scopus subject areas

  • Information Systems
  • Language and Linguistics
  • Linguistics and Language
  • Computer Science Applications

Cite this

Presenting the Bangor Autoglosser and the Bangor Automated Clause-Splitter. / Carter, D. M. (Corresponding Author); Broersma, M.; Donnelly, K.; Konopka, A.

In: Digital Scholarship in the Humanities, Vol. 33, No. 1, 04.2018, p. 21-28.

Research output: Contribution to journalArticle

Carter, D. M. ; Broersma, M. ; Donnelly, K. ; Konopka, A. / Presenting the Bangor Autoglosser and the Bangor Automated Clause-Splitter. In: Digital Scholarship in the Humanities. 2018 ; Vol. 33, No. 1. pp. 21-28.
@article{66323c8d187f4f92bca54a950057b469,
title = "Presenting the Bangor Autoglosser and the Bangor Automated Clause-Splitter",
abstract = "Until recently, corpus studies of natural bilingual speech and, more specifically, codeswitching in bilingual speech have used a manual method of glossing, partof- speech tagging, and clause-splitting to prepare the data for analysis. In our article, we present innovative tools developed for the first large-scale corpus study of codeswitching triggered by cognates. A study of this size was only possible due to the automation of several steps, such as morpheme-by-morpheme glossing, splitting complex clauses into simple clauses, and the analysis of internal and external codeswitching through the use of database tables, algorithms, and a scripting language.",
author = "Carter, {D. M.} and M. Broersma and K. Donnelly and A. Konopka",
note = "This work was supported by a Small Research Grant from the British Academy awarded to the first and second authors (grant number 101421). We also gratefully acknowledge the support of the Max Planck Institute for Psycholinguistics, the Centre for Research on Bilingualism in Wales, and the University of Calgary.",
year = "2018",
month = "4",
doi = "10.1093/llc/fqw065",
language = "English",
volume = "33",
pages = "21--28",
journal = "Digital Scholarship in the Humanities",
issn = "2055-7671",
publisher = "Oxford University Press",
number = "1",

}

TY - JOUR

T1 - Presenting the Bangor Autoglosser and the Bangor Automated Clause-Splitter

AU - Carter, D. M.

AU - Broersma, M.

AU - Donnelly, K.

AU - Konopka, A.

N1 - This work was supported by a Small Research Grant from the British Academy awarded to the first and second authors (grant number 101421). We also gratefully acknowledge the support of the Max Planck Institute for Psycholinguistics, the Centre for Research on Bilingualism in Wales, and the University of Calgary.

PY - 2018/4

Y1 - 2018/4

N2 - Until recently, corpus studies of natural bilingual speech and, more specifically, codeswitching in bilingual speech have used a manual method of glossing, partof- speech tagging, and clause-splitting to prepare the data for analysis. In our article, we present innovative tools developed for the first large-scale corpus study of codeswitching triggered by cognates. A study of this size was only possible due to the automation of several steps, such as morpheme-by-morpheme glossing, splitting complex clauses into simple clauses, and the analysis of internal and external codeswitching through the use of database tables, algorithms, and a scripting language.

AB - Until recently, corpus studies of natural bilingual speech and, more specifically, codeswitching in bilingual speech have used a manual method of glossing, partof- speech tagging, and clause-splitting to prepare the data for analysis. In our article, we present innovative tools developed for the first large-scale corpus study of codeswitching triggered by cognates. A study of this size was only possible due to the automation of several steps, such as morpheme-by-morpheme glossing, splitting complex clauses into simple clauses, and the analysis of internal and external codeswitching through the use of database tables, algorithms, and a scripting language.

UR - http://www.scopus.com/inward/record.url?scp=85045044426&partnerID=8YFLogxK

U2 - 10.1093/llc/fqw065

DO - 10.1093/llc/fqw065

M3 - Article

AN - SCOPUS:85045044426

VL - 33

SP - 21

EP - 28

JO - Digital Scholarship in the Humanities

JF - Digital Scholarship in the Humanities

SN - 2055-7671

IS - 1

ER -