Course Description

470.643 - Text as Data

Text is not straightforward. In this course, students will develop the tools necessary to collect, analyze, and visualize large amounts of text. The course begins with a hands-on introduction to the programming concepts necessary to collect and process textual data, then proceeds to the key statistical concepts in machine learning and statistics used to analyze text as data. Throughout the course, students develop a research project that culminates in the online display of results from a large-scale textual analysis. Prerequisite: 470.681 Statistics and Political Analysis