STA 199 - STA 199: Introduction to Data Science and Statistical Thinking

1 Wed, Jan 10

Lab 0: Hello, World and STA 199!

šŸ’» lab 0



Thu, Jan 11

Welcome to STA 199

šŸ–„ļø slides 00
āŒØļø ae 00


2 Mon, Jan 15

No lab - Martin Luther King Jr. Day holiday




Tue, Jan 16

šŸ“— r4ds - intro
šŸ“˜ ims - chp 1

Meet the toolkit

šŸ–„ļø slides 01
āŒØļø ae 01



Thu, Jan 18

šŸ“— r4ds - chp 1
šŸŽ„ Data and visualization
šŸŽ„ Visualising data with ggplot2

Grammar of graphics

šŸ–„ļø slides 02
āŒØļø ae 02
āœ… ae 02


3 Mon, Jan 22

šŸ“— r4ds - chp 2

Lab 1: Data visualization

šŸ’» lab 1
āœ… lab 1



Tue, Jan 23

šŸ“˜ ims - chp 4
šŸ“˜ ims - chp 5
šŸŽ„ Visualizing numerical data
šŸŽ„ Visualizing categorical data

Visualizing various types of data

šŸ–„ļø slides 03
āŒØļø ae 02 (cont.)
āœ… ae 02



Thu, Jan 25

šŸ“˜ ims - chp 6

Data visualization overview

šŸ–„ļø slides 04
āŒØļø ae 03
āœ… ae 03


4 Mon, Jan 29

šŸŽ„ Grammar of data wrangling
šŸ“— r4ds - chp 3.1-3.5

Lab 2: Data wrangling

šŸ’» lab 2
āœ… lab 2

Lab 1 at 8 am


Tue, Jan 30

šŸŽ„ Working with a single data frame
šŸ“— r4ds - chp 3.6-3.7
šŸ“— r4ds - chp 4

Grammar of data wrangling

šŸ–„ļø slides 05
āŒØļø ae 04
āœ… ae 04



Thu, Feb 1

šŸŽ„ Tidying data
šŸ“— r4ds - chp 5

Tidying data

šŸ–„ļø slides 06
āŒØļø ae 05
āœ… ae 05


5 Mon, Feb 5

šŸŽ„ Working with multiple data frames

Lab 3: Data tidying and joining

šŸ’» lab 3
āœ… lab 3

Lab 2 at 8 am


Tue, Feb 6

šŸ“— r4ds - chp 19.1-19.3

Joining data

šŸ–„ļø slides 07
āŒØļø ae 06
āœ… ae 06



Thu, Feb 8

šŸŽ„ Data types
šŸŽ„ Data classes
šŸ“— r4ds - chp 16

Data types and classes

šŸ–„ļø slides 08
āŒØļø ae 07
āœ… ae 07


6 Mon, Feb 12

Work on Exam 1 Review

šŸ“ exam 1 review
āœ… exam 1 review

Lab 3 at 8 am


Tue, Feb 13

Exam 1 Review

šŸ–„ļø slides 09



Thu, Feb 15

Exam 1 - In-class + take-home released



7 Mon, Feb 19

Project milestone 1 - Working collaboratively

šŸ““ project milestone 1

Exam 1 take-home at 8 am


Tue, Feb 20

šŸŽ„ Importing data
šŸŽ„ Recoding data
šŸ“— r4ds - chp 7
šŸ“— r4ds - chp 17.1 - 17.3

Importing and recoding data

šŸ–„ļø slides 10
āŒØļø ae 08
āœ… ae 08



Thu, Feb 22

šŸŽ„ Web scraping
šŸŽ„ Scraping top 250 movies on IMDB
šŸŽ„ Web scraping considerations
šŸ“— r4ds - chp 24.1 - 24.6

Web scraping

šŸ–„ļø slides 11
āŒØļø ae 09
āŒØļø ae 09
āœ… ae 09


8 Mon, Feb 26

Lab 4: Web scraping and ethics

šŸ’» lab 4
āœ… lab 4

Project milestone 1 at 8 am


Tue, Feb 27

šŸŽ„ Functions
šŸŽ„ Iteration
šŸ“— r4ds - chp 25.1 - 25.2

Working with Chat GPT

šŸ–„ļø slides 12
āŒØļø ae 09
āœ… ae 09



Thu, Feb 29

šŸŽ„ Misrepresentation
šŸŽ„ Data privacy
šŸŽ„ Algorithmic bias
šŸ“• mdsr - chp 8
šŸŽ„ Alberto Cairo - How charts lie
šŸŽ„ Joy Buolamwini - How I’m fighting bias in algorithms

Data science ethics

šŸ–„ļø slides 13


9 Mon, Mar 4

Lab 5: Topic TBA

šŸ’» lab 5
āœ… lab 5

Lab 4 at 8 am


Tue, Mar 5

šŸŽ„ The language of models
šŸ“˜ ims - chp 7.1

The language of models

šŸ–„ļø slides 14
āŒØļø ae 10
āœ… ae 10



Thu, Mar 7

šŸŽ„ Fitting and interpreting models
šŸŽ„ Modeling nonlinear relationships
šŸ“˜ ims - chp 7.2

Linear regression with a single predictor

šŸ–„ļø slides 15
āŒØļø ae 11
āœ… ae 11


10 Mon, Mar 11

🌓 No lab - Spring Break




Tue, Mar 12

🌓 No lecture - Spring Break




Thu, Mar 14

🌓 No lecture - Spring Break



11 Mon, Mar 18

Project milestone 2 - Project proposals

šŸ““ project milestone 2

Lab 5 at 8 am


Tue, Mar 19

šŸŽ„ Models with multiple predictors
šŸŽ„ More models with multiple predictors
šŸ“˜ ims - chp 8.1-8.2

Linear regression with multiple predictors I

šŸ–„ļø slides 16
āŒØļø ae 12
āœ… ae 12



Thu, Mar 21

šŸ“˜ ims - chp 8.3-8.5

Linear regression with multiple predictors II

šŸ–„ļø slides 17


12 Mon, Mar 25

Lab 6: Modeling I

šŸ’» lab 6
āœ… lab 6

Project milestone 2 at 8 am


Tue, Mar 26

šŸŽ„ Logistic regression
šŸŽ„ Prediction and overfitting

Model selection and overfitting

šŸ–„ļø slides 18
āŒØļø ae 13
āœ… ae 13



Thu, Mar 28

šŸ“˜ ims - chp 9

Logistic regression

šŸ–„ļø slides 19
āŒØļø ae 14
āœ… ae 14


13 Mon, Apr 1

Lab 7: Modeling II

šŸ’» lab 7
āœ… lab 7

Lab 6 at 8 am


Tue, Apr 2

šŸŽ„ Quantifying uncertainty
šŸŽ„ Bootstrapping
šŸ“˜ ims - chp 12

Quantifying uncertainty with bootstrap intervals

šŸ–„ļø slides 20
āŒØļø ae 15
āœ… ae 15



Thu, Apr 4

šŸ“˜ ims - chp 11

Making decisions with randomization tests

šŸ–„ļø slides 21
āŒØļø ae 16
āœ… ae 16


14 Mon, Apr 8

Work on Exam 2 Review

šŸ“ exam 2 review
āœ… exam 2 review

Lab 7 at 8 am


Tue, Apr 9

Exam 2 Review

šŸ–„ļø slides 22



Thu, Apr 11

Exam 2 - In-class + take-home released



15 Mon, Apr 15

Project milestone 3 - Peer review

šŸ““ project milestone 3

Exam 2 take-home at 8 am
Project milestone 3 at the end of lab session


Tue, Apr 16

šŸŽ„ Tips for effective data visualization
šŸ“˜ ims - chp 6
šŸ“— r4ds - chp 10

Communicating data science results effectively

šŸ–„ļø slides 23
āŒØļø ae 17
āœ… ae 17



Thu, Apr 18

šŸŽ„ Doing data science

Customizing Quarto reports and presentations

šŸ–„ļø slides 24
āŒØļø ae 18


16 Mon, Apr 22

Project milestone 4 - Project presentations

šŸ““ project milestone 4

Project presentations at the beginning of lab session


Tue, Apr 23

Looking further: Interactive web applications with Shiny

šŸ–„ļø slides 25
āŒØļø ae 19



Wed, Apr 24


Project writeup at 8 am