TNA Fellow Botond Szemes

New Metrics for Computational Drama Analysis

The aim of the project was to create a method that compares the utterances of characters in dramatic texts according to whether a character tends to talk innovatively compared to others, or vice versa: to what extent she or he tends to repeat others. Another important part of the project was the development of the Hungarian sub-corpus of the DraCor database (HunDraCor: https://dracor.org/hun). Previously, 41 dramas were available, but the underlying corpus, which is being developed at Eötvös Loránd University (ELTE-DH), has since grown significantly. Furthermore work was carried out towards the alignment of these two corpora, the cleaning of the TEI XML files, and the creation of a workflow that can be used automatically when changes are made to one of the corpora (ELTE-DH or DraCor).