TNA Fellow Petr Porizka

CapekDraCor & CzeDraCor: Capek Drama Corpus & Czech Drama Corpus

This project sets out to create CapekDraCor, a pilot database containing all five plays by Karel Čapek, as the foundation for a broader CzechDraCor corpus, an extension of the international DraCor (Drama Corpora) platform developed by the University of Potsdam. DraCor hosts structured, multilingual drama corpora for literary and linguistic research, but Czech drama is not yet represented. Unlike existing resources, such as the Czech National Corpus, DraCor allows for detailed structural segmentation, character-level analysis, and integration with digital tools (such as API, SPARQL, and Shiny DraCor). The resulting Czech database will support research in corpus linguistics and digital humanities education, while also being accessible to the broader academic and educational community. By incorporating Czech drama into DraCor, the project enhances Czech participation in international digital literary research and offers a unique, richly annotated resource for the study of dramatic texts.

The Czech Drama Corpus is available by clicking on the button below.