Computer Vision for the Humanities: An Introduction to Deep Learning for Image Classification (Part 1)

Over the last ten years, the field of computer vision, which seeks to gain a high-level understanding of images using computational techniques, has seen rapid innovation. For example, computer vision models can locate and identify people, animals and thousands of objects included in images with high accuracy. This technological advancement promises to do the same for image recognition that the combination of OCR/NLP techniques has done for texts. Put simply, computer vision opens up a part of the digital archive for large-scale analysis that has remained mostly unexplored: the millions of images in digitised books, newspapers, periodicals, and historical documents. Consequently, historians will now be able to explore the ‘visual side of the digital turn in historical research’.

This two-part lesson provides examples of how computer vision techniques can be applied to analyse large historical visual corpora in new ways and how to train custom computer vision models. As well as identifying the contents of images and classifying them according to category — two tasks which focus on visual features — computer vision techniques can also be used to chart the stylistic (dis)similarities between images.

Reviewed by:

Michael Black
Catherine DeRose

Learning outcomes

After completing this lesson, you will be able to:

Know what steps are needed to train a deep learning model
Understand some of the specific considerations around using deep learning and computer vision for humanities research

Cite as

Daniel van Strien, Kaspar Beelen, Melvin Wevers, Thomas Smits and Katherine McDonough (2024). Computer Vision for the Humanities: An Introduction to Deep Learning for Image Classification (Part 1). Version 1.0.0. Edited by Nabeel Siddiqui and Alex Wermer-Colan. ProgHist Ltd [Training module]. https://doi.org/10.46430/phen0101

Full metadata

Title:

Computer Vision for the Humanities: An Introduction to Deep Learning for Image Classification (Part 1)

Authors:

Daniel van Strien, Kaspar Beelen, Melvin Wevers, Thomas Smits, Katherine McDonough

Domain:

Social Sciences and Humanities

Language:

English

Published to DARIAH-Campus:

06/02/2024

Originally published:

17/08/2022

URL:

https://doi.org/10.46430/phen0101

Content type:

Training module

License:

CC BY 4.0

Sources:

Programming Historian

Topics:

Python, Machine Learning

Version:

1.0.0

PID:

https://hdl.handle.net/21.11159/019595c4-2d4a-74d9-8c78-08852b58fd9a

Learning outcomes

Cite as

Reuse conditions

Full metadata