Skip to main content

Text Mining YouTube Comment Data with Wordfish in R

This lesson will introduce readers to a method for conducting research on internet discourse by performing data analysis on comments posted under YouTube videos by viewers.

You will learn how to download YouTube video comments and analyze their textual data using the natural language processing algorithm Wordfish. Designed for scaling textual data using unsupervised machine learning, Wordfish brings into relief the salient dimensions of latent meaning within a corpus.

This lesson will guide you through three key phases of 1) data collection, 2) data preparation (cleaning and modeling), and 3) data analysis (including generating visualizations).

Reviewed by:

  • Janna Joceli Omena
  • Heather Lang

Learning outcomes

After completing this lesson, you will be able to:

  • Use YouTube Data Tools to download video comments and metadata
  • Use R to sort and clean the comment data
  • Use Wordfish to analyze and visualize the comment data to search for underlying meaning and ideological positioning within
Interested in learning more?

Check out this lesson on Programming Historian's website

Go to this resource

Cite as

Alex Wermer-Colan, Nicole Lemire Garlic and Jeff Antsen (2024). Text Mining YouTube Comment Data with Wordfish in R. Version 1.0.0. Edited by Nabeel Siddiqui. ProgHist Ltd. [Training module]. https://doi.org/10.46430/phen0120

Reuse conditions

Resources hosted on DARIAH-Campus are subjects to the DARIAH-Campus Training Materials Reuse Charter

Full metadata

Title:
Text Mining YouTube Comment Data with Wordfish in R
Authors:
Alex Wermer-Colan, Nicole Lemire Garlic, Jeff Antsen
Domain:
Social Sciences and Humanities
Language:
en
Published to DARIAH-Campus:
1/30/2025
Originally published:
8/7/2024
Content type:
Training module
Licence:
CCBY 4.0
Sources:
Programming Historian
Topics:
Data management, Data visualisation
Version:
1.0.0