Jump to Block: (About) 01 02 03 04 05 06 07 08 09 10 11 12
Coursebook for Data Science Toolbox
Lectures
- 1.1 - Introduction to Data Science (26.59)
- 1.2 - Exploratory Data Analysis (26.45)
- Slides
- Reference R code (NB: See 1.3.1 for explanation)
- 2.1.1 Overview of Regression (35.15)
- 2.1.2 - Modern Regression (28.30)
- 2.2.1 Statistical Testing - Classical Testing (25.50)
- 2.2.2 - Statistical Testing - Empirical Testing (24.53)
- 2.2.3 - Model Selection (36.26)
- 3.1 Latent Structures and PCA (41:12)
- 3.2.1 Clustering Part 1 (32:34)
- 3.2.2 Clustering Part 2 (34:32)
Worksheets
- Worksheet 01.2 Exploratory Data Analysis
- Worksheet 02.1 Regression and Correlation
- Worksheet 02.2 Statistical Testing
- Worksheet 03.1 Latent Spaces and PCA
- Worksheet 03.2 Clustering
- Worksheet 04.1 Non-parametrics
- Worksheet 04.2 Outliers and Missing Data
- Worksheet 05.1 Introduction to Classification
- Worksheet 05.2 Ensemble Learning
- Worksheet 06.1 Trees, Forests, Decisions
- Worksheet 07.1 Topic Models
- Worksheet 08.1 Algorithms
- Worksheet 09.1 Neural Networks
- Worksheet 10.1 Parallel algorithms
- Worksheet 11.1 Parallel Infrastructure
- Worksheet 12.1 Ethics and Privacy in Data Science
Preparation
See Appendix 1: Preparation for details.
Workshops
- 1.3.1 - Workshop Lecture for RStudio (29.05)
- 1.3.2 - Workshop Lecture for Exploratory Data Analysis (18.13)
- 1.3.3 Workshop Lecture on Assessments, split into the following parts:
- 2.3 - Workshop Lecture on Regression (28.58)
- 3.3 - Workshop Lecture (21:55)
Assessments
Semester 1:
- Example Assessment
- Assessment 0 (Formative, i.e. does not contribute to grade)
- Assessment 1
- Assessment 2
Semester 2:
See Appendix 2: Assessment for details, and Appendix 3: Replicability for notes on how to ensure that code works for everyone, by documenting versions of packages.