Universidad Nacional de Educación a Distancia, Madrid
National Distance Education University, Madrid


Digging for Gold:
Knowledge Extraction From Text

9-11 May 2023

Eager to kick start your research in Digital Humanities and Computational Literary Studies? Join CLS INFRA in Madrid for our three day international training school in Knowledge Extraction from Text. From 9-11 May 2023, we will offer a crash course in how to “Dig for Gold” in a corpus of texts. From Stylometry to Natural Language Processing, you will be completing your own analyses and visualizing the results in a hands-on way. We will work with existing code that is plug and play, so it is not necessary to have existing experience in Python or R. Most of all, it will be a fun and safe environment to boost your textual analysis skills. You’ll be surprised what precious information you can extract from your own data set!

How to apply

To apply for the CLS INFRA Madrid Training School 2023, please fill out the registration form. Applications open in January 2023 and close on 1 March 2023. After the deadline, we will be in touch with you to notify you of the result of your application. We ask you to read and complete the sign-up form carefully.

Event Details

Training School Schedule

The preliminary schedule for the training school is now available. Note that it still may be subject to changes.

Venue & Remote Access Pathways

This event will be taking place in person and online. Details of how to attend this hybrid event can be found below.

Teacher profiles

Get to know the CLS INFRA Training School Team by clicking on their profiles:

Meet the UNED Training School Team by clicking on their profiles:

Previous training school

Would you like to get a sense of what you can expect from a CLS INFRA Training School? Check out these videos featuring participants from our first Training School at Charles University, Prague in June 2022.


The learning materials from our inaugural CLS INFRA Training School on data and annotation are hosted on the DARIAH-Campus platform. Click on the button opposite to view all materials and recordings from the Training School.


Most frequent questions and answers

When attending the physical workshops, please bring a laptop computer.

Corpus building is an integral part of this workshop. Therefore, we ask you to bring a few text files of your own interest for your individual hands-on experience with building a corpus. These files do not necessarily have to be in English.

To visit the city of Madrid, CLS INFRA cannot supply travel or childcare grants. However, the three days training can be followed free of charge – and lunch is included. Moreover, we are currently working on accommodation arrangements and advice for participants. More information about this will be released in due course.

This is an elementary course for beginners. If you are already very experienced in Named Entity Recognition or Stylometry, or if you are well-versed in programming in R or Python, this course may be too slow for you. Check out our preliminary schedule for more details.

In our sign-up form we will ask you to indicate your experience level, but in case you would like a personalised recommendation, please reach out to our local organiser Salvador Ros: sros@scc.uned.es.

If you do have prior experience with the skills offered in this training school, you might still be interested in applying for an transnational access (TNA) fellowship to pursue your individual research or teaching project with the on-site support by our CLS INFRA project partners: https://clsinfra.io/opportunities/tnafellowships/

The Training School takes place at UNED, Madrid and full details of how to attend in person or remotely can be found here.

You can read the teacher profiles above this section, under ‘Teacher Profiles.’

Due to the personalized and tailor-made nature of this workshop, there will be no public live-stream of the sessions. However, the DARIAH-Campus platform will host training materials and recordings of plenary sessions post-event. See here for training materials from our Prague training school.

NB: When registering for the training school, participants will be given the opportunity to allow or deny permission to appear in recordings of the event.

The UNED cafeteria will be able to accommodate dietary requirements. Please indicate these in the sign-up form.

The preliminary schedule is available here and is subject to change. 

We ask you to attend all three workshop days to deliver you the most complete and integrated learning experience.

On Tuesday (9 May) evening, the CLS INFRA will host complimentary drinks in the city of Madrid. On Wednesday (10 May), we are hoping to host a local excursion after workshops end. Further details will be announced as they are confirmed.

Applications open in January 2023 and close on 1 March 2023. Once applications open, you can register by clicking the button at the top of this page.

Read more about our access pathways here.

Note that we will be keeping physical and hybrid attendance relatively limited in order to provide a personalised training experience.

Before the Training School takes place we will reach out to you via e-mail to confirm your placement and mode of participation.

Please find our project at @CLSinfra. When tweeting about the Madrid training school, please use hashtag #CLSInfraTraining!

Madrid Training School is the first of a training school series hosted by the European infrastructure project CLS INFRA. Read more about our objectives here.

Please reach out to our Training School contact Bartlomiej Kunda: bartlomiej.kunda@ijp.pan.pl.