Skip to main content

Introduction to Statistics and Data Science (202-DL-20)

Instructors

John Michael Bailey
Professor Bailey is a well known teacher and researcher of human sexuality. He has taught at Northwestern since 1989, and in the School of Professional Studies since 2000. His research has focused on sexual orientation, including its genetic and evolutionary aspects.

Meeting Info

Online: TBA

Overview of class

This course focuses on the analysis of data using computer software. The approach is conceptual—I want you to understand, not to memorize. Important concepts include samples versus populations, normal curves and the central limit theorem, sampling distributions, standard errors, statistical inference, correlation and regression, t-tests, analysis of variance (ANOVA), and the chi-squared test. We will also discuss the replication crisis in science and how bad statistics helped cause it. Students who take this class will learn to avoid these problems. This course is conducted completely online. A technology fee will be added to tuition.

Registration Requirements

There are no formal prerequisites for this course.

Recommended skills include comfort with basic algebra and some experience with spreadsheet software, such as Microsoft Excel or Google Spreadsheet.

Learning Objectives

By the end of this course, you will be able to:

Identify situations in which statistical tests are appropriate and the general rationale for their application.
Distinguish situations in which each of the following statistical tests are appropriate: Z-test, one-sample t-test, paired t-test, two-sample t-test, correlation, regression, multiple regression, chi-squared.
Conduct the statistical tests from the second bullet point above using computer software and identify the key aspects of the output.
Using the results of the statistical tests, write informative results for these tests.
List, and conduct, the steps necessary for statistical inference.
Distinguish Type 1 and Type 2 errors and factors that influence (and do not influence) them.
Identify some of the statistical reasons for the current replication crisis in science.

Class Materials (Required)

Materials available in Canvas.

Class Attributes

Asynchronous:Remote class-no scheduled mtg time