An epic journey through statistics and machine learning
StatQuest: Random Forests Part 1: Building, using and evaluating.
4 thoughts on “StatQuest: Random Forests Part 1: Building, using and evaluating.”
Hi Joshua I saw you video on decision tree and its one of the best i have seen till date. i wanted to look into the data if possible..is that something I can download?
Thanks…looked into it..i could not locate the exact dataset that has the variables you discussed in your video – chest pain, blood circulation, blocked arteries – and how they are related to heart disease. I am trying to work out the whole gini index math as you showed in the video looking into the relevant data . I see a lot of data here https://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/
but not the exact one that has the variables you discussed
Hi Joshua I saw you video on decision tree and its one of the best i have seen till date. i wanted to look into the data if possible..is that something I can download?
Yep! You can get the data from UCI’s machine learning archive: https://archive.ics.uci.edu/ml/datasets/Heart+Disease
Thanks…looked into it..i could not locate the exact dataset that has the variables you discussed in your video – chest pain, blood circulation, blocked arteries – and how they are related to heart disease. I am trying to work out the whole gini index math as you showed in the video looking into the relevant data . I see a lot of data here
https://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/
but not the exact one that has the variables you discussed
This is the link to the data:
http://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/processed.cleveland.data
This is the link to the description of the data:
https://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/heart-disease.names