Home

Awesome

DAT4 Course Repository

Course materials for General Assembly's Data Science course in Washington, DC (12/15/14 - 3/16/15).

Instructors: Sinan Ozdemir and Kevin Markham (Data School blog, email newsletter, YouTube channel)

Teaching Assistant: Brandon Burroughs

Office hours: 1-3pm on Saturday and Sunday (Starbucks at 15th & K), 5:15-6:30pm on Monday (GA)

Course Project information

MondayWednesday
12/15: Introduction12/17: Python
12/22: Getting Data12/24: No Class
12/29: No Class12/31: No Class
1/5: Git and GitHub1/7: Pandas<br>Milestone: Question and Data Set
1/12: Numpy, Machine Learning, KNN1/14: scikit-learn, Model Evaluation Procedures
1/19: No Class1/21: Linear Regression
1/26: Logistic Regression,<br>Preview of Other Models1/28: Model Evaluation Metrics<br>Milestone: Data Exploration and Analysis Plan
2/2: Working a Data Problem2/4: Clustering and Visualization<br>Milestone: Deadline for Topic Changes
2/9: Naive Bayes2/11: Natural Language Processing
2/16: No Class2/18: Decision Trees<br>Milestone: First Draft
2/23: Ensembling2/25: Databases and MapReduce
3/2: Recommenders3/4: Advanced scikit-learn<br>Milestone: Second Draft (Optional)
3/9: Course Review3/11: Project Presentations
3/16: Project Presentations

Installation and Setup

Class 1: Introduction

Homework:

Optional:

Class 2: Python

Homework:

Optional:

Resources:

Class 3: Getting Data

Homework:

Optional:

Resources:

Class 4: Git and GitHub

Homework:

Optional:

Resources:

Class 5: Pandas

Homework:

Optional:

Resources:

Class 6: Numpy, Machine Learning, KNN

Homework:

Resources:

Class 7: scikit-learn, Model Evaluation Procedures

Homework:

Optional:

Resources:

Class 8: Linear Regression

Homework:

Optional:

Resources:

Class 9: Logistic Regression, Preview of Other Models

Resources:

Class 10: Model Evaluation Metrics

Homework:

Optional:

Resources:

Class 11: Working a Data Problem

Class 12: Clustering and Visualization

Homework:

Resources:

Class 13: Naive Bayes

Resources:

Homework:

Class 14: Natural Language Processing

Resources:

Class 15: Decision Trees

Homework:

Resources:

Installing Graphviz (optional):

Class 16: Ensembling

Resources:

Class 17: Databases and MapReduce

Resources:

Class 18: Recommenders

Resources:

Class 19: Advanced scikit-learn

Homework:

Resources:

Class 20: Course Review

Resources:

Class 21: Project Presentations

Class 22: Project Presentations