Abstract
This article provides an account of the steps involved in adapting IBM's Languageware natural language processing software to a large corpus of highly non-standard 17th century documents. It examines the challenges encountered as part of this process, and outlines the approach adopted to provide a robust and reusable tool for the linguistic analysis of early modern source texts.
Original language | English |
---|---|
Pages (from-to) | 39-54 |
Number of pages | 16 |
Journal | Literary and Linguistic Computing |
Volume | 27 |
Issue number | 1 |
Early online date | 15 Dec 2011 |
DOIs | |
Publication status | Published - Apr 2012 |
Keywords
- linguistics
- digital humanities
- 1641 depositions
- corpus linguistics