Scientific Coordination
Verena Kunz
Administrative Coordination
Claudia O'Donovan-Bellante
Please wait...
Introduction to Computational Text Analysis with R
About
Location:
Online via Zoom
Online via Zoom
General Topics:
Course Level:
Format:
Software used:
Duration:
Language:
Fees:
Students: 330 €
Academics: 495 €
Commercial: 990 €
Keywords
Additional links
Lecturer(s): Lea Kaftan, Jens Wäckerle, Jan Schwalbach
Course description
Computational text analysis is a fast-growing and widely used methods field, allowing researchers to structure and study large corpora of texts in-depth from a multitude of theoretical backgrounds. The workshop introduces key concepts, standard methods, and the research logic underlying computational text analysis. It is designed for researchers with no or limited experience with computational text analysis who want to analyze text data for their own research.
Participants will learn to pre-process their text data and to apply and validate basic supervised and unsupervised methods such as dictionaries (e.g. sentiment), classification methods (e.g. support vector machines or random forest), and clustering methods (e.g. topic models) to their own text data using the programming language R. Participants will additionally learn the benefits and pitfalls of research designs based on the quantitative analysis of large text corpora, including ethical issues, potential biases, and implications of choices regarding document selection and pre-processing. After the course, participants will be able to choose appropriate designs for their own research questions and apply standard methods of text analysis in R.
Organizational structure of the course
The workshop contains both lectures and practical exercises. The lectures aim at providing a general understanding of the research logic and methods. The practical exercises deepen the participants' understanding of the methods and teach them how to apply the methods to their own data. During the exercises, the lecturers will be available for support, troubleshooting, and questions. Participants can bring their own data or work with data provided by the lecturers.
Target group
Participants will find the course useful if:
- they want to work with text data in their research for the first time.
- they want to refresh prior basic knowledge in text analysis.
- they want to learn how to preprocess text data for quantitative analyses.
- they want an overview of standard computational approaches to analyzing text data.
Learning objectives
By the end of the course participants will:
Prerequisites
Software requirements
R (at least version 4.0.0) and RStudio. Prior to the workshop, participants will receive an R script to install all required packages.