Want to learn data science in 2021? Here’s the internet’s best curriculum

Curated by David Venturi for the #NotARealDegree community

Image for post
Image for post
The process you’ll use to build your new data skills.
Image for post
Image for post

Curriculum overview

Term 1: Data Analysis

Term 2: Machine Learning & More

Learn: courses, books, and tutorials

Term 1: Data Analysis

Introduction to Data Science

Image for post
Image for post
A video from Data Science for Everyone on DataCamp.

Introduction to Python Programming

Image for post
Image for post
Dr. Joyner teaching in Georgia Tech’s Introduction to Python Programming series on edX.

Setting Up Your Computer

Image for post
Image for post
The JupyterLab interface, where you can interact with the command line, conda, and Git, as well as do fancy data science as displayed in the notebooks.

Python Data Science Toolbox

Importing Data

Preparing Data

Exploratory Data Analysis

Statistics

Image for post
Image for post
An exercise from Introduction to Statistics in Python on DataCamp.

Data Visualization

More Statistics

Image for post
Image for post
The ModernDive website.

Databases & SQL

Image for post
Image for post
An exercise from Joining Data in PostgreSQL on DataCamp.

Data Engineering

Data Warehouses & Cloud Computing

Image for post
Image for post
Snowflake is a cloud-based data warehousing company.

Analytics Engineering

Image for post
Image for post
dbt is pioneering modern analytics engineering.

Term 2: Machine Learning & More

Objects & Algorithms

Introduction to Machine Learning

More Python Programming

Supervised Learning

More Data Visualization

Image for post
Image for post
An exercise from Intermediate Data Visualization with Seaborn on DataCamp.

Unsupervised Learning

Introduction to Neural Networks

Data Science Ethics

Scalable Data Science

Image for post
Image for post
Built with the PyData ecosystem in mind, Dask and BlazingSQL work nicely together.

Time Series Analysis

Text Analysis

Other Fun Stuff

Frame: blog posts and YouTube videos

Assess: adaptive tests

Image for post
Image for post
From the DataCamp Signal white paper: “Assessment results include a score (0–200), a percentile (0%-100%), and an associated knowledge level (Novice, Intermediate, Advanced).”
Image for post
Image for post
The screen before you start DataCamp’s Python Programming assessment.
Image for post
Image for post
My Python skills measured over time. June 8th: A little rusty. June 9th: After refreshing my skills, I scored 149 (95th percentile). December 24th: Rusty again (plus a little tired). Just like any skill, your data skills can erode over time if you don’t keep them sharp!

Create: self-directed projects

Image for post
Image for post
DataCamp Signal telling me my current strengths and skill gaps for Python programming.
How we’ll collaborate in Deepnote.

Career services

Image for post
Image for post

How I created the curriculum

Next steps

Curating the internet’s best data science program.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store