PhD Studentship — Natural Language Processing for Historical Corpora
Posted : 16 May 2022
Closes : 12 June 2022
Location : UCD


The EU-funded VICTEUR project is working with the SFI-funded Insight Centre for Data Analytics at UCD to explore large-scale historical text corpora from a computational perspective. 

As part of this research project, the successful candidate for this PhD role will work on developing new Natural Language Processing (NLP) and Machine Learning methods to extract insights and trends from large collections of historical texts, combining automated content analysis and the integration of “human-in-the-loop” expertise. The PhD project will also look at the application of these models to textual data coming from other sources and domains.

This is a fully funded 4-year full-time PhD position, which will be based in the Insight Centre labs at the UCD Belfield campus, commencing in September 2022.

Supervisor: Dr. Derek Greene, UCD School of Computer Science

Mandatory Criteria:

  • BSc or MSc Degree in Computer Science or another relevant technical discipline.
  • Prior knowledge in machine learning, natural language processing, text mining or computational linguistics.
  • Excellent programming and technical skills.
  • Strong written and verbal communication skills.
  • Excellent organisational skills, including a proven ability to work to deadlines.

Desirable Criteria: 

  • Evidence of prior research activity.
  • Experience of collaborating with researchers from other disciplines.
  • Programming experience working with the Python Data Science Stack.

Application Guidelines:

Please email the following details by 12th June 2022 to <>

  • A recent Curriculum Vitae.
  • A personal letter of motivation explaining your background and why you wish to apply for this research project.
  • Evidence of recent technical work (e.g., link to a personal GitHub repository).