Skip to main content

Data Science Project (390-0-20)

Instructors

Han Liu

Meeting Info

Technological Institute M128: Mon 5:00PM - 7:50PM

Overview of class

In this course, students will work in groups to solve a real-world data science problem, gaining valuable experience that will prepare them for their future careers. Throughout the course, students will learn to use version control tools such as Git and GitHub to collaborate effectively with their teammates, and they will gain valuable skills and experience in using Google Cloud platforms and big data platform Spark for data science.

At the end of the course, each group will present their project and findings to the class, showcasing the skills and techniques they have learned. Overall, this course is designed to prepare senior students for the workforce by providing them with hands-on experience with the entire project lifecycle and exposing them to real-world challenges and collaborative environments using modern data science tools and technologies.

Registration Requirements

STAT 301-3 or STAT 303-3 or consent of instructor

Learning Objectives

The outcome of this project should give you a data product to show off to potential employers or educational programs, a strong indicator of your expertise in the field of data science.

Students will learn to conduct teamwork closely using version control system Git and GItHub.
Students will be able to apply essential methods for exploratory data analysis and modern machine learning methods for predictive modeling
Students will be able to carefully document their analysis result.

Teaching Method

The students will be formed as small teams. Each team is expected to meet with the instructor review progress and complete a Meeting Notes page at each class.

Evaluation Method

A student in this class will be evaluated based on the performance of each project team.

Class Materials (Required)

No required textbook.

Class Materials (Suggested)

In-class lecture notes will be provided