Presenting the Bangor Autoglosser and the Bangor Automated Clause-Splitter

D. M. Carter* (Corresponding Author), M. Broersma, K. Donnelly, A. Konopka

*Corresponding author for this work

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Until recently, corpus studies of natural bilingual speech and, more specifically, codeswitching in bilingual speech have used a manual method of glossing, partof- speech tagging, and clause-splitting to prepare the data for analysis. In our article, we present innovative tools developed for the first large-scale corpus study of codeswitching triggered by cognates. A study of this size was only possible due to the automation of several steps, such as morpheme-by-morpheme glossing, splitting complex clauses into simple clauses, and the analysis of internal and external codeswitching through the use of database tables, algorithms, and a scripting language.

Original languageEnglish
Pages (from-to)21-28
Number of pages8
JournalDigital Scholarship in the Humanities
Volume33
Issue number1
Early online date27 Feb 2017
DOIs
Publication statusPublished - Apr 2018

    Fingerprint

ASJC Scopus subject areas

  • Information Systems
  • Language and Linguistics
  • Linguistics and Language
  • Computer Science Applications

Cite this