Presenting the Bangor Autoglosser and the Bangor Automated Clause-Splitter

D. M. Carter* (Corresponding Author), M. Broersma, K. Donnelly, A. Konopka

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)


Until recently, corpus studies of natural bilingual speech and, more specifically, codeswitching in bilingual speech have used a manual method of glossing, partof- speech tagging, and clause-splitting to prepare the data for analysis. In our article, we present innovative tools developed for the first large-scale corpus study of codeswitching triggered by cognates. A study of this size was only possible due to the automation of several steps, such as morpheme-by-morpheme glossing, splitting complex clauses into simple clauses, and the analysis of internal and external codeswitching through the use of database tables, algorithms, and a scripting language.

Original languageEnglish
Pages (from-to)21-28
Number of pages8
JournalDigital Scholarship in the Humanities
Issue number1
Early online date27 Feb 2017
Publication statusPublished - Apr 2018


Dive into the research topics of 'Presenting the Bangor Autoglosser and the Bangor Automated Clause-Splitter'. Together they form a unique fingerprint.

Cite this