[Back]


Talks and Poster Presentations (with Proceedings-Entry):

A. Frank, A. Dittrich:
"Flexible annotation of digital literary text corpus with RDF";
Talk: Workshop on Corpus-Based Research in the Humanities (CRH), Warschau; 2015-12-10; in: "Proceedings of the Workshop on Corpus-Based Research in the Humanities (CRH)", F. Mambrini, M. Passarotti, C. Sporleder (ed.); Polish Academy of Sciences, (2015), ISBN: 978-83-63159-19-1; 49 - 58.



English abstract:
A corpus of text to be used for literary analysis, but generaly for other corpora in many other document oriented fields of digital humanities, must provide methods
- to add and remove texts from the corpus,
- to inquire about the texts in the corpus and
- to query the content of the corpus.
RDF to structure document storage and SPARQL as a flexible query language are suitable to build, maintain and use corpora.
We report on a system to prepare text for inclusion in a corpus for literary analysis, where text structure is annotated partially automatic and the result of linguistic analysis included in the same corpus. Analysis of the text can use SPARQL to extract text parts or produce statistics.Three experiments have shown us that requirements for (i) literary analysis of a single authors work, (ii) the analysis of specific aspects of a limited set of texts - the ontology of fairy tales - or (iii) a wide ranged, mostly statistical, analysis of a large number of texts pose the same fundamental requirements.

German abstract:
A corpus of text to be used for literary analysis, but generaly for other corpora in many other document oriented fields of digital humanities, must provide methods
- to add and remove texts from the corpus,
- to inquire about the texts in the corpus and
- to query the content of the corpus.
RDF to structure document storage and SPARQL as a flexible query language are suitable to build, maintain and use corpora.
We report on a system to prepare text for inclusion in a corpus for literary analysis, where text structure is annotated partially automatic and the result of linguistic analysis included in the same corpus. Analysis of the text can use SPARQL to extract text parts or produce statistics.Three experiments have shown us that requirements for (i) literary analysis of a single authors work, (ii) the analysis of specific aspects of a limited set of texts - the ontology of fairy tales - or (iii) a wide ranged, mostly statistical, analysis of a large number of texts pose the same fundamental requirements.

Keywords:
Text Corpus, RDF Triples, SPARQL

Created from the Publication Database of the Vienna University of Technology.