Thanks to the Zotpress plugin, the CLS INFRA Zotero collection is below. Or join our Group to add CLS INFRA resources to your own Zotero.

Zotero logo
Agić, Ž., Aranzabe, M., Atutxa, A., Bosco, C., Choi, J., Marneffe, M.-C. de, Dozat, T., Farkas, R., Foster, J., Ginter, F., Goenaga, I., Gojenola, K., Goldberg, Y., Hajič, J., Johannsen, A., Kanerva, J., Kuokkala, J., Laippala, V., Lenci, A., … Zeman, D. (2015). Universal Dependencies 1.1. LINDAT/CLARIN digital library at Institute of Formal and Applied Linguistics, Charles University in Prague.
Akbik, A., Blythe, D., & Vollgraf, R. (2018). Contextual String Embeddings for Sequence Labeling. 1638–1649.
Akbik, A., Bergmann, T., Blythe, D., Rasul, K., Schweter, S., & Vollgraf, R. (2019). FLAIR: An Easy-to-Use Framework for State-of-the-Art NLP. Proceedings of the 2019 Conference of the North, 54–59.
Bastian, M., Heymann, S., & Jacomy, M. (2009). Gephi: An Open Source Software for Exploring and Manipulating Networks. International AAAI Conference on Weblogs and Social Media.
Béranger, M., Heiden, S., & Lavrentiev, A. (2015). Reengineering Akkadian Tablets with TEI and TXM for Linguistic Analysis. TEI Conference and Members’ Meeting, 36.
Birkholz, J., Börner, I., Chambers, S., Charvat, V., Cinková, S., Dejaeghere, T., Dudar, J., Ďurčo, M., Eder, M., Edmond, J., Fileva, E., Fischer, F., Heiden, S., Křen, M., Kunda, B., Mrugalski, M., Murphy, C., Odebrecht, C., Raciti, M., … Van Rossum, L. (2022). Computational Literary Studies Infrastructure (CLS INFRA): a project to connect people, data, tools, and methods. Digital Humanities 2022: Conference Abstracts, 624–627.
Birkholz, J. M., Börner, I., Byszuk, J., Chambers, S., Charvat, V. M., Cinková, S., Dejaeghere, T., Dudar, J., Ďurčo, M., Eder, M., Edmond, J., Fileva, E., Fischer, F., Garnett, V., Heiden, S., Křen, M., Kunda, B., Laszakovits, S., Mrugalski, M., … van Rossum, L. (2023, June 30). Computational Literary Studies Infrastructure (CLS INFRA): Initial Findings and Conclusions for the Field: Literature by Numbers. DH2023.
Bloomfield, L., & Hockett, C. F. (1984). Language. University of Chicago Press.
Börner, I., & Trilcke, P. (2023). CLS INFRA D7.1 On Programmable Corpora. 10.5281/zenodo.7664964
Börner, I., Charvat, V. M., Ďurčo, M., Mrugalski, M., & Odebrecht, C. (2022). Computational Literary Studies Data Landscape Review.
Börner, I., Trilcke, P., Milling, C., Fischer, F., & Sluyter-Gäthje, H. (2023, July 10). Dockerizing DraCor. Digital Humanities 2023. Collaboration as Opportunity (DH2023), Graz, Austria.
Börner, Ingo, Charvat, Vera Maria, Ďurčo, Matej, Mrugalski, Michał, & Odebrecht, Carolin. (2022). Computational Literary Studies Data Landscape Review. 272–273.
Bresnan, J., Asudeh, A., Toivonen, I., & Wechsler, S. (2015). Lexical-Functional Syntax. Wiley.
Calvo Tello, J., Rißler-Pipka, N., Barth, F., Jung, K., & Schöch, C. (2023). Questionnaire of the survey: How do you Compose your Literary Corpus or Literary Collection?
Carroll, S. R., Garba, I., Figueroa-Rodríguez, O. L., Holbrook, J., Lovett, R., Materechera, S., Parsons, M., Raseroka, K., Rodriguez-Lonebear, D., Rowe, R., Sara, R., Walker, J. D., Anderson, J., & Hudson, M. (2020). The CARE Principles for Indigenous Data Governance. Data Science Journal, 19, 43.
Chomsky, N., & Lightfoot, D. W. (2002). Syntactic Structures. Mouton de Gruyter.
Christ, O. (1994). A modular and flexible architecture for an integrated corpus query system. Proceedings of COMPLEX’94: 3rd Conference on Computational Lexicography and Text Research, 23–32.
Christ, O., & Schulze, B. M. (1995). Ein flexibles und modulares Anfragesystem für Textcorpora. Tagungsbericht Des Arbeitstreffen Lexikon Und Text.
Deliverable 8.1: Report of the Tools for Basic Natural Language Processing (NLP)Tasks. (2023, May 19).
Cinková, S., & Janssen, M. (2023). Towards a sustainable communitye ffort: training NLP data for under-resourced language domains in CLS. TwinTalks 4: Understanding and Facilitating Remote Collaboration in Digital Humanities.
Cinková, S., Cvrček, V., Janssen, M., & Křen, M. (2022, June). CLS-INFRA TRAINING SCHOOL ON DATA AND ANNOTATION (Vol. 1) [Recorded lectures + pdf resources].
Cinkova, S., Janssen, M., Cvrček, V., & Křen, M. (2022). CLS INFRA Prague Training School Hand-outs.
Cinková, S., Birkholz, J. M., Börner, I., Dejaeghere, T., Heiden, S., Janssen, M., Křen, M., & Pozo, A. P. (2023). CLS INFRA D8.1 Report of the tools for the basic Natural Language Processing (NLP) tasks in the CLS context.
Cvrček, V. (2021). Calc: Corpus Calculator (1.04).
Cvrček, V., Čech, R., & Kubát, M. (2020). QuitaUp –  a tool for quantitative stylometric analysis. Czech National Corpus and University of Ostrava.
de la Rosa, J., Pérez, Á., Hern´andez, L., D´ıaz, A., Ros, S., & Gonz´alez-Blanco, E. (2021). PoetryLab as Infrastructure for the Analysis of Spanish Poetry. 75–82.
de la Rosa, J., Pozo, Á. P., Ros, S., & González-Blanco, E. (2023). ALBERTI, a Multilingual Domain Specific Language Model for Poetry Analysis.
de Marneffe, M.-C., MacCartney, B., & Manning, C. D. (2006). Generating Typed Dependency Parses from Phrase Structure Parses. Proceedings of the IEEE / ACL 2006 Workshop on Spoken Language Technology.
Dejaeghere, T. (2022). Beyond Babylonian Confusion: a case-study based approach for multilingual NLP on historical literature. 1.
Díaz, A., de la Rosa, J., Pérez, Á., Lorenzo, L. H., González-Blanco, E., & Ros, S. (2020). Averell a management tool to transform XML/TEI poetic corporain JSON POSTDATA ontology compliant. Zenodo.
Dimitrova, L., Erjavec, T., Ide, N., Kaalep, H. J., Petkevic, V., & Tufis, D. (1998). Multext-East: Parallel and Comparable Corpora and Lexicons for Six Central and Eastern European Languages. COLING 1998 Volume 1: The 17th International Conference on Computational Linguistics.
Ďurčo, M., Charvat, V. M., Börner, I., Mrugalski, M., & Odebrecht, C. (2022). CLS INFRA D6.1 Inventory of existing data sources and formats.
Ďurčo, M., Charvát, V. M., Resch, S., Börner, I., & Plank, L. (2023). CLS INFRA D6.3 Standards beyond TEI / Extended Transformation Matrix / Alternative Formats.
Ďurčo, Matej, Charvat, Vera Maria, Börner, Ingo, Mrugalski, Michał, & Odebrecht, Carolin. (2022). CLS INFRA D6.1 Inventory of existing data sources and formats.
Eder, M. (2021, February 11). A quick tour around the CLS INFRA project for computational literary studies.
Eder, M. (2022). CLS INFRA: an infrastructural project.
Eder, M. (2023). Digital Humanities Infrastructures: The Case of Computational Literary Studies. IAUPE 2023, Rome.
Eder, M. (2023). Computational Literary Studies Infrastructure: challenges in the post-COVID era. TwinTalks 4: Understanding and Facilitating Remote Collaboration in Digital Humanities, Graz.
Eder, M., Rybicki, J., & Kestemont, M. (2016). Stylometry with R: A Package for Computational Text Analysis. The R Journal, 8(1), 107.
Edmond, J. (2023, November 9). Literary Studies: Close, Distant, and Everywhere in Between [Lecture]. ARIANE Meeting, Lyon, France.
Edmond, J., & Yakupova, V. (2023, July 10). What’s the use? Exploring academic applications of (computational) literary studies. Digtal Humanities 2023. Collaboration as Opportunity (DH2023), Graz, Austria.
Understanding user requirements beyond academic research CLS infrastructure (DARIAH): Task 3.5. (2023, October 26).
Evert, S., & Hardie, A. (2011). Twenty-first century Corpus Workbench: Updating a query architecture for the new millennium. Proceedings of the Corpus Linguistics 2011 Conference. Corpus Linguistics, Birmingham, UK.
Firth, J. (1957). A Synopsis of Linguistic Theory 1930-1955. In Studies in Linguistic Analysis. Philological Society, Oxford.
Fischer, F., Börner, I., Göbel, M., Hechtl, A., Kittel, C., Milling, C., & Trilcke, P. (2019, July 10). Programmable Corpora: Introducing DraCor, an Infrastructure for the Research on European Drama. DH2019: »Complexities«. 9–12 July 2019. Book of Abstracts. DH2019 »Complexities«, Utrecht.
CLS INFRA TNA Fellow Khanim Garayeva. (2023, May 18).
Gerdes, K., Hajičová, E., Wanner, L., & Press, I. (2013). Computational Dependency Theory. IOS Press.
Ghajari, A., & Fresno, V. (n.d.). Platform for exploring Semantic Composition from pre-trained Language Models and static embeddings. Retrieved December 3, 2022, from
Ghajari, A., Fresno, V., & Amigó, E. (2022). Plataforma de exploración de la Composición Semántica a partir de Modelos de Lenguaje pre-entrenados y embeddings estáticos. 52–56.
Giovannini, L., Skorinkin, D., Trilcke, P., Börner, I., Fischer, F., Dudar, J., Milling, C., & Pořízka, P. (2023, June 30). Distributed Corpus Building in Literary Studies: The DraCor Example.