STA 199 - STA 199: Introduction to Data Science and Statistical Thinking
Lab 0: Hello, World and STA 199!
š» lab 0
Thu, Jan 11
Welcome to STA 199
š„ļø slides 00
āØļø ae 00
2 Mon, Jan 15
No lab - Martin Luther King Jr. Day holiday
Tue, Jan 16
š r4ds - intro
š ims - chp 1
Meet the toolkit
š„ļø slides 01
āØļø ae 01
Thu, Jan 18
š r4ds - chp 1
š„ Data and visualization
š„ Visualising data with ggplot2
Grammar of graphics
š„ļø slides 02
āØļø ae 02
ā
ae 02
3 Mon, Jan 22
š r4ds - chp 2
Lab 1: Data visualization
Tue, Jan 23
š ims - chp 4
š ims - chp 5
š„ Visualizing numerical data
š„ Visualizing categorical data
Visualizing various types of data
š„ļø slides 03
āØļø ae 02 (cont.)
ā
ae 02
Thu, Jan 25
š ims - chp 6
Data visualization overview
š„ļø slides 04
āØļø ae 03
ā
ae 03
4 Mon, Jan 29
š„ Grammar of data wrangling
š r4ds - chp 3.1-3.5
Lab 2: Data wrangling
Lab 1 at 8 am
Tue, Jan 30
š„ Working with a single data frame
š r4ds - chp 3.6-3.7
š r4ds - chp 4
Grammar of data wrangling
š„ļø slides 05
āØļø ae 04
ā
ae 04
Thu, Feb 1
š„ Tidying data
š r4ds - chp 5
Tidying data
š„ļø slides 06
āØļø ae 05
ā
ae 05
5 Mon, Feb 5
š„ Working with multiple data frames
Lab 3: Data tidying and joining
Lab 2 at 8 am
Tue, Feb 6
š r4ds - chp 19.1-19.3
Joining data
š„ļø slides 07
āØļø ae 06
ā
ae 06
Thu, Feb 8
š„ Data types
š„ Data classes
š r4ds - chp 16
Data types and classes
š„ļø slides 08
āØļø ae 07
ā
ae 07
6 Mon, Feb 12
Work on Exam 1 Review
š exam 1 review
ā
exam 1 review
Lab 3 at 8 am
Tue, Feb 13
Exam 1 Review
š„ļø slides 09
Thu, Feb 15
Exam 1 - In-class + take-home released
7 Mon, Feb 19
Project milestone 1 - Working collaboratively
š project milestone 1
Exam 1 take-home at 8 am
Tue, Feb 20
š„ Importing data
š„ Recoding data
š r4ds - chp 7
š r4ds - chp 17.1 - 17.3
Importing and recoding data
š„ļø slides 10
āØļø ae 08
ā
ae 08
Thu, Feb 22
š„ Web scraping
š„ Scraping top 250 movies on IMDB
š„ Web scraping considerations
š r4ds - chp 24.1 - 24.6
Web scraping
š„ļø slides 11
āØļø ae 09
āØļø ae 09
ā
ae 09
8 Mon, Feb 26
Lab 4: Web scraping and ethics
Project milestone 1 at 8 am
Tue, Feb 27
š„ Functions
š„ Iteration
š r4ds - chp 25.1 - 25.2
Working with Chat GPT
š„ļø slides 12
āØļø ae 09
ā
ae 09
Thu, Feb 29
š„ Misrepresentation
š„ Data privacy
š„ Algorithmic bias
š mdsr - chp 8
š„ Alberto Cairo - How charts lie
š„ Joy Buolamwini - How Iām fighting bias in algorithms
Data science ethics
š„ļø slides 13
9 Mon, Mar 4
Lab 5: Topic TBA
Lab 4 at 8 am
Tue, Mar 5
š„ The language of models
š ims - chp 7.1
The language of models
š„ļø slides 14
āØļø ae 10
ā
ae 10
Thu, Mar 7
š„ Fitting and interpreting models
š„ Modeling nonlinear relationships
š ims - chp 7.2
Linear regression with a single predictor
š„ļø slides 15
āØļø ae 11
ā
ae 11
10 Mon, Mar 11
š“ No lab - Spring Break
Tue, Mar 12
š“ No lecture - Spring Break
Thu, Mar 14
š“ No lecture - Spring Break
11 Mon, Mar 18
Project milestone 2 - Project proposals
š project milestone 2
Lab 5 at 8 am
Tue, Mar 19
š„ Models with multiple predictors
š„ More models with multiple predictors
š ims - chp 8.1-8.2
Linear regression with multiple predictors I
š„ļø slides 16
āØļø ae 12
ā
ae 12
Thu, Mar 21
š ims - chp 8.3-8.5
Linear regression with multiple predictors II
š„ļø slides 17
12 Mon, Mar 25
Lab 6: Modeling I
Project milestone 2 at 8 am
Tue, Mar 26
š„ Logistic regression
š„ Prediction and overfitting
Model selection and overfitting
š„ļø slides 18
āØļø ae 13
ā
ae 13
Thu, Mar 28
š ims - chp 9
Logistic regression
š„ļø slides 19
āØļø ae 14
ā
ae 14
13 Mon, Apr 1
Lab 7: Modeling II
Lab 6 at 8 am
Tue, Apr 2
š„ Quantifying uncertainty
š„ Bootstrapping
š ims - chp 12
Quantifying uncertainty with bootstrap intervals
š„ļø slides 20
āØļø ae 15
ā
ae 15
Thu, Apr 4
š ims - chp 11
Making decisions with randomization tests
š„ļø slides 21
āØļø ae 16
ā
ae 16
14 Mon, Apr 8
Work on Exam 2 Review
š exam 2 review
ā
exam 2 review
Lab 7 at 8 am
Tue, Apr 9
Exam 2 Review
š„ļø slides 22
Thu, Apr 11
Exam 2 - In-class + take-home released
15 Mon, Apr 15
Project milestone 3 - Peer review
š project milestone 3
Exam 2 take-home at 8 am
Project milestone 3 at the end of lab session
Tue, Apr 16
š„ Tips for effective data visualization
š ims - chp 6
š r4ds - chp 10
Communicating data science results effectively
š„ļø slides 23
āØļø ae 17
ā
ae 17
Thu, Apr 18
š„ Doing data science
Customizing Quarto reports and presentations
š„ļø slides 24
āØļø ae 18
16 Mon, Apr 22
Project milestone 4 - Project presentations
š project milestone 4
Project presentations at the beginning of lab session
Tue, Apr 23
Looking further: Interactive web applications with Shiny
š„ļø slides 25
āØļø ae 19
Wed, Apr 24
Project writeup at 8 am