Skip to the content.

CS534 - Machine Learning

This repository contains course materials for CS 534 - Machine Learning provided in Fall 2020 at Emory University.

Course syllabus, slides, and homework are largely borrowed/copied from Professor Joyce Ho’s CS534 class.

Overview

“Software is eating the world,” said Marc Andreessen in 2011. By then, he was referring to companies like Amazon, Netflix, Apple, and Google who are massively transforming the traditional industries with the power of the internet and software. It was true in some sense, as apparent in their enormous growth in market capitalization.

How did these companies become so successful? There may be many reasons, one may say. Unfortunately, the answer to this question is beyond the scope of this course. However, I find that one commonality of these companies remarkably stands out: “their insatiable appetite for data”. Six years later, Marc Andreessen’s famous quote was wittily supplemented with “…, but AI is going to eat software” by Jensen Huang.

We are living in an exciting time. Artificial intelligence and its key engine, machine learning, are impacting every aspect of our lives. From shopping groceries to treating diseases, people are experimenting and adopting machine learning algorithms, achieving the level of productivity that was never possible before.

In this course, students will learn some of the fundamental concepts, theories, and algorithms of machine learning. Students will have a chance to implement and test some of the time-tested machine learning algorithms. At the end of the class, students will be able to 1) identify and formulate machine learning problems, 2) implement and test appropriate machine learning algorithms, and 3) understand and articulate the limitations of the approaches.

Prerequisites:

Logistics

Schedule

WEEKLY OBJECTIVE: Most of the time, each lecture focuses on a different machine learning algorithm, starting from very foundational yet simple ones to more complex and sophisticated ones. Students will be able to understand 1) how such machine learning algorithms are motivated/developed, 2) when such models will be useful and avoided, and 3) how to implement and modify such models to deal with more custom settings.

GUEST LECTURES: We are excited to have a stellar array of guest lecturers:

Date Topic Reference Assignment
08/19 Introduction . Homework #0
08/24 Prerequisite - Statistics . .
08/26 Linear Model - Regression (1) ESL Chapter 3 Homework #1
08/31 Linear Model - Regression (2) ESL Chapter 3 .
09/02 Linear Model - Classification ESL Chapter 4 .
09/07 Linear Model - Other Distributions ESL Chapter 4 and GLMNET .
09/09 Evaluation Metrics ESL Chapter 7 Homework #2
09/14 Evaluation Strategies ESL Chapter 7 .
09/16 Bias-Variance Tradeoff (1) Chapter 20-27 of Machine Learning Yearning (Optional) .
09/21 Bias-Variance Tradeoff (2) Chapter 20-27 of Machine Learning Yearning (Optional) .
09/23 Decision Tree . Homework #3
09/28 Boosting . .
09/30 Boosting . .
10/05 Bagging . .
10/07 Random Forests . .
10/12 Nearest Neighbors Chapter 3 of Mining of Massive Dataset (Optional) .
10/14 Dimensionality Reduction (1) . Homework #4
10/19 Dimensionality Reduction (2) . .
10/21 Convex Optimization . .
10/26 Project Elevator Pitch . .
10/28 Support Vector Machine . .
11/02 Neural Networks . Homework #5
11/04 Guest Lecture (Yen Sia Low) . .
11/09 Guest Lecture (TBD) . .
11/11 Guest Lecture (TBD) . .
11/16 Presentations . .
11/18 Presentations . .
11/23 Presentations . .

NOTE: Course materials with the md extension are Markdown files. To read the Markdown files, you can use various “free” Markdown viewer applications out there e.g:

Grading

Project

The goal of the final project is to:

  1. identify and formulate a machine learning problem,
  2. come up with appropriate solutions,
  3. implement the solutions,
  4. analyze the results and limitations, and
  5. suggest any algorithmic/engineering improvements.

You are free to find any datasets and problems of your research interest.

You are encouraged to work in groups of 2-3 people for the project. Teams are required to hand in a project proposal, a final report, and presentation slides of their work.

Note that the final project contributes to 40% of your final grade. The final report should include a section that details each member’s contribution to the project.

Grading for the Final Project:

Policy

Honor Code

All classwork is governed by the College Honor Code, Emory Laney Graduate School, and Computer Science Departmental Policy. It is acceptable and encouraged to discuss homework with other students. However, this should be noted on your submitted homework and all code and writeup must be written by yourself. Any code and writeup that is found to be similar are grounds for an honor code investigation by the Director of Graduate Studies, Laney Graduate School, and the honor council.