GitHub - TZstatsADS/ADS_Teaching: Teaching repo for Applied Data Science @ Columbia, a project-based course for data science skills (statistical thinking, machine learning, data engineering, team work, presentation, endurance of frustration, etc).
Stat GU4243/GR5243 Applied Data Science
Spring 2024 - Teaching Materials (Syllabus)
Project cycle 1: (Individual) R notebook for exploratory data analysis
(starter codes)
Week 1 (January 15)
Week 2 (January 22)
Week 3 (January 29)
Project cycle 2: Shiny App Development
(starter codes)
Week 3 (January 29)
- Project 2 starts.
- Check Piazza for your project team and follow the video instructions to clone the starter codes.
- After you join project 2, you can clone your team's GitHub repo to your local computer.
- You can find in the starter codes:
- the project description,
- an example toy shiny app.
Week 4 (February 5)
Week 5 (February 12)
Week 6 (February 19)
Finished student projects
Project cycle 3: Weakly Supervised Learning
(starter codes)
Week 6 (February 19)
- Project 3 starts.
- Check Piazza for your project team and follow the video instructions to clone the starter codes.
- After you join project 3, you can clone your team's GitHub repo to your local computer.
- You can find in the starter codes
Week 7 (February 26)
Week 8 (March 4)
Spring Break (March 11)
Week 9 (March 18)
- Project 3 submission and presentations
Project cycle 4: Algorithm implementation and evaluation
(starter codes)
Week 9 (March 18)
Week 10 (March 25)
- Talk on fairness (see slides)
- Overview on the methods
- Method assignment on Piazza
Week 11 (April 1)
- Live class canceled
- Feel free to have group brainstorm and meetings offline
Week 12 (April 8)
Week 13 (April 15)