GESIS Training Courses
user_jsdisabled
Search

Scientific Coordination

Julia Leesch
Tel: +49 221 47694-169

Administrative Coordination

Laura Rüwe

Topic Modeling in R

Lecturer(s):
Dr. Wouter van Atteveldt, Dr. Kasper Welbers

Date: 13.11 - 15.11.2019 ics-file

Location: Cologne / Course language: Englisch

About the lecturer - Dr. Wouter van Atteveldt

About the lecturer - Dr. Kasper Welbers

The first workshop will give an introduction to topic modeling using R. The first day of the workshop we will introduce R, Rstudio, and tidyverse. This day can be safely skipped by students or researchers with experience in using R.
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
In the first day of the second workshop, we will introduce topic modeling and the principles of automatic text analysis and topic modeling. We will explain the basic assumptions of bag-of-words analysis, unsupervised clustering, and the dirichlet distribution. We will use the quanteda and topicmodels packages for doing the analyses and LDAviz and corpustools for visualization and validation.

The second day of the second workshop we will first look in depth at how fitting an LDA model with Gibbs sampling actually works and look at the various parameters and choices. We will also look at linguistic preprocessing using the spacy package. Finally, we will introduce alternative topic models, from Dynamic and Correlated topic models to Structural Topic Models. We will use the stm package to show how to estimate a structural topic model with time or source as covariates, and show how to analyse and interpret the results.


More Information