GESIS Training Courses
user_jsdisabled
Search

Scientific Coordination

Verena Kunz

Administrative Coordination

Janina Götsche

Preprocessing and Analyzing Web Tracking Data

About
Location:
hybrid (online via Zoom / Unter Sachsenhausen 6-8)
 
General Topics:
Course Level:
Format:
Software used:
Duration:
Language:
Fees:
Students: 220 €
Academics: 330 €
Commercial: 660 €
 
Keywords
Additional links
Lecturer(s): Frank Mangold, Helena Rauxloh, Sebastian Stier

About the lecturer - Frank Mangold

About the lecturer - Helena Rauxloh

About the lecturer - Sebastian Stier

Course description

The internet has fundamentally changed people's lives. From a scientific perspective, human behavior has diversified significantly and is becoming increasingly volatile in the digital age. Therefore, self-reports in surveys cannot validly reflect the diversity and content of individual use of digital media. As a reaction, web tracking, i.e., the collection of the web browsing behavior of study participants through web browser plugins, has emerged as a new data collection method. The collection and linkage of web tracking with panel surveys offers the most comprehensive methodology for explaining and investigating online media use and information-seeking behaviors and their effects.
Still, web tracking data poses theoretical, infrastructural, ethical, and legal challenges for researchers. When data is successfully acquired, efficient and scalable pipelines need to be produced to handle the resulting large datasets. And before starting an analysis, researchers have to take manifold preprocessing and data aggregation decisions that need to be theoretically justified and transparently reported.
This workshop introduces participants to the preprocessing and analysis of web tracking data. Participants will get to know foundational studies, theories, and methods for handling web tracking data. We will show how to navigate challenges and how to devise appropriate research designs in computational social science. Primarily, the course will be application oriented. Participants will familiarize themselves with the main preprocessing steps and routines when handling web tracking data and learn how to implement these in R. The hands-on applications will be based on example web tracking datasets collected by GESIS. Equipped with this theoretical and methodological toolkit, participants will be able to develop their own research projects based on web tracking data and can discuss research ideas with the lecturers.
 
Organizational structure of the course
The workshop will be structured around three hours of classroom instruction and three hours of hands-on exercises per day. We will mix these elements, aiming to create an engaging and varied learning environment. The instruction sessions will introduce theories, concepts and methods. The hands-on sessions will be structured around exercises and group work. The instructors will support work on assignments and exercises, structure and facilitate discussions within group work assignments and be available for individual consultations on participants' projects.


Target group

You will find the course useful if:
  • You are interested in the analysis of digital behavioral data
  • Have an interest in individual-level research such as the exposure to online content
  • Have an interest in the analysis of complex, longitudinal, nested data


  • Learning objectives

    By the end of the course, you will:
  • Have an advanced understanding of concepts, methods, and challenges related to web tracking data
  • Be able to critically reflect upon researcher degrees of freedom and know about best practices when working with web tracking data
  • Be equipped with an appropriate toolbox and best practices for efficiently analyzing web tracking data in R


  • Prerequisites

  • Experience in wrangling, preprocessing and joining data using the R tidyverse
  • Please bring your own laptops for use in the course
  •  
    Software and hardware requirements
    Participants should have the latest versions of R and RStudio installed on their own laptop.


    Schedule

    Recommended readings