TY - JOUR
T1 - Relational data modelling of textual corpora
T2 - the Skaldic Project and its extensions
AU - Wills, Tarrin
N1 - This work was supported by the Australian Research Council and the College of Arts and Social Sciences, University of Aberdeen.
PY - 2015/6/1
Y1 - 2015/6/1
N2 - Skaldic poetry is a highly complex textual phenomenon both in terms of the intricacy of the poetry and its contextual environment. XML applications such as that of the Text Encoding Initiative provide a means of semantic representation of some of these complexities. XML, however, has limitations in representing semantic relationships that do not conform to the tree model. This paper presents the relational data model as a way of representing the structure of skaldic texts and their contextual environment. The relational data model raises both problems and possibilities for this type of project. The main problem addressed here is the representation of the syntagmatic structures of texts in a data model that is not intrinsically ordered. The advantages are also explored, including networked data editing and management, quantitative linguistic analysis, dynamic representation of the data, and the ability to extend the structure and reuse data for related projects without creating redundancy.
AB - Skaldic poetry is a highly complex textual phenomenon both in terms of the intricacy of the poetry and its contextual environment. XML applications such as that of the Text Encoding Initiative provide a means of semantic representation of some of these complexities. XML, however, has limitations in representing semantic relationships that do not conform to the tree model. This paper presents the relational data model as a way of representing the structure of skaldic texts and their contextual environment. The relational data model raises both problems and possibilities for this type of project. The main problem addressed here is the representation of the syntagmatic structures of texts in a data model that is not intrinsically ordered. The advantages are also explored, including networked data editing and management, quantitative linguistic analysis, dynamic representation of the data, and the ability to extend the structure and reuse data for related projects without creating redundancy.
U2 - 10.1093/llc/fqt045
DO - 10.1093/llc/fqt045
M3 - Article
VL - 30
SP - 294
EP - 313
JO - Literary and Linguistic Computing
JF - Literary and Linguistic Computing
SN - 0268-1145
IS - 2
ER -