Video Index - StatQuest!!!

This page contains links to playlists and individual videos on Statistics, Statistical Tests, Machine Learning, The StatQuest Musical Dictionary, Webinars, Live Streams, and The AI Buzz, organized, roughly, by category. Generally speaking, the videos are organized from basic concepts to complicated concepts, so, in theory, you should be able to start at the top and work you way down and everything will make sense.

Playlists:

Statistics Fundamentals – These videos give you a general overview of statistics as well as a be a reference for statistical concepts. Topics include:
- Histograms
- What is a statistical distribution?
- And many more!!!
Linear Regression and Linear Models – These videos teach the basics relating to one of statistics most powerful tools. Linear Regression and Linear Models allow us to use continuous values, like weight or height, and categorical values, like favorite color or favorite movie, to predict a continuous value, like age.
Logistic Regression – These videos pick up where Linear Regression and Linear Models leave off. Now, instead of predicting something continuous, like age, we can predict something discrete, like whether or not someone will enjoy the 1990 theatrical bust Troll 2.
Machine Learning – Linear Models and Logistic Regression are just the tips of the machine learning iceberg. There’s tons more to learn, and this playlist will help you trough it all, one step at a time.
Neural Networks, Deep Learning, and AI – Everything you need to know, from the basics, to image classification with Convolutional Neural Networks, to the state of the art transformers used for large language models like ChatGPT, presented one step at a time so that it is easily understood.
- I also have a complete course on Attention, a learning mechanism in Large Language Models like ChatGPT at deeplearning.ai
High Throughput Sequence Analysis – If you do high-throughput sequence analysis, this playlist is for you!
Statistics in R – If you want to do any of this stuff in R, this playlist is for you, and you only. No one else is allowed to watch it.
#66DaysOfData – If you want to participate in Ken Jee’s #66DaysOfData and are having trouble thinking of new stuff to learn, here’s a playlist that covers everything from the basics to the fancy stuff.
The StatQuest Musical Dictionary – Short songs to help you remember all of the data science terminology.
Human Stories in AI with StatQuest and Lighting AI is about the career journeys of passionate AI experts. From their humble beginnings to conquered challenges, we’ll be inspired by the real-world experiences of professionals thriving in the ever-evolving AI landscape.
The AI Buzz – A conversation between me and Luca Antiga (and guests) about AI and how it works and how it is changing everything.

Statistics Fundamentals:

Statistical Tests:

Machine Learning and Dealing with large datasets that have lots and lots of measurements per sample:

(NOTE: All of the linear model and curve fitting stuff in the “Basics” section is also considered Machine Learning, so make sure you check out those videos).

Neural Networks

NOTE: All of the neural network videos and more are covered in The StatQuest Illustrated Guide to Neural Networks and AI (with hands-on tutorials in PyTorch).

The StatQuest Musical Dictionary:

Webinars:

Human Stories in AI:

The AI Buzz:

Live Streams:

2020-01-06
- 0:00 Introduction
- 1:52 Comment #1 – What’s the difference between a single sample of 20 measurements and 4 samples of 5 measurements each.
- 6:20 Comment #2 – Is machine learning a subset of statistics?
- 9:59 Comment #3 – A poem!!!
- 10:51 Viewer Questions/Comments Also, here’s the link to the Introduction to Statistical Learning: http://faculty.marshall.usc.edu/gareth-james/ISL/
2020-01-20
- 0:00 Introduction
- 1:04 Comment #1 – What is your favorite machine learning algorithm
- 4:40 Comment #2 – What is data leakage in machine learning?
- 8:39 Comment #3 – Where do you learn these nitty gritty details?
- 13:37 Live Question #1 – R-squared and Adjusted R-squared
- 17:23 Live Question #2 – How are the videos arranged on https://statquest.org/video-index/ (simple to complex)
- 18:26 Live Question #3 – Is it important to learn all of the formulas and equations even though we have advanced software that does the work?
2020-02-03
- 0:00 Silly Song and Introduction
- 0:18 A big huge announcement
- 3:14 Question #1 – Do we use statistical models to predict or explain stuff?
- 8:31 Question #2 – Can you show the effects of regularization?
- 9:42 My cat, Poe
- 15:04 Question #3 – How do I choose the best machine learning algorithm for my data?
- 21:17 Live Questions
2020-02-17
- 0:00 Silly Song and Introduction
- 0:46 Question #1 – What do we do with imbalanced data?
- 13:24 Question #2 – Post-Hoc Tests for ANOVA
- 22:11 Live Questions
2020-03-02
- 0:00 Silly Song
- 1:35 Question #1 – What is the best model for the corona virus epidemic?
- 4:34 Question #2 – Two types of p-hacking explained and what to do about them:
- 20:31 Live Questions:
2020-03-16 – Naive Bayes
- 0:00 Silly Song and Introduction
- 2:02 Naive Bayes
2020-04-06 – Gaussian Naive Bayes
- Gaussian Naive Bayes
- Why Naive Bayes Is Naive
2020-04-20 – Expected Values (NOTE: There is now a full StatQuest video on Expected Values that revises and updates this material).
2020-05-04 – Conditional Probability
2020-05-18 – Bayes’ Theorem
2020-06-01 – Hypothesis Testing
2020-06-15 – Bootstrapping Main Ideas
2022-10-04 – One-Hot Encoding, Label Encoding and Target Encoding
2022-10-18 – Target Encoding without Leakage and CatBoost, Part 1
2022-11-01 – Cosine Similarity for Natural Language Processing (NLP) and CatBoost, Part 2
2024-05-10 – Luis Serrano and Josh Starmer, Episode #1

High-throughput Sequencing Analysis: