Data Analysis 6: Principal Component Analysis (PCA) - Computerphile
Video Overview & Insights
PCA - Principle Component Analysis - finally explained in an accessible way, thanks to Dr Mike Pound. This is part 6 of the Data Analysis Learning Playlist: https://www.youtube.com/playlist?list=PLzH6n4zXuckpfMu_4Ff8E7Z1behQks5ba
Check out the full Data Analysis Learning Playlist: https://www.youtube.com/playlist?list=PLzH6n4zXuckpfMu_4Ff8E7Z1behQks5ba
This Learning Playlist was designed by Dr Mercedes Torres-Torres & Dr Michael Pound of the University of Nottingham Computer Science Department. Find out more about Computer Science at Nottingham here: https://bit.ly/2IqwtNg
This series was made possible by sponsorship from by Google.
I just watched a statquest video on this, then I watched this video. I fully understand PCA and got all questions right on my assignment!
The music dataset can be found here: https://github.com/mdeff/fma
https://www.facebook.com/computerphile
Very nice explanation!!!!! â€â€â€â€ Greets from Germany
https://twitter.com/computer_phile
This video was filmed and edited by Sean Riley.
the only way you save money is by not spending it
Computer Science at the University of Nottingham: https://bit.ly/nottscomputer
Computerphile is a sister project to Brady Haran's Numberphile. More at http://www.bradyharan.com
Itâs not necessary to scale your data if you use scale=TRUE in the prcomp function.
More User Perspectives
itâs a dimension reduction technique (not a data reduction technique, thatâs a misnomer)
@redmotherfiveWhy the PCs are always orthogonal to each other ?
@EW-mb1ihI understand why itâs necessary to reduced our data (divide by the std). But Why is it necessary to center our dataset?
@EW-mb1ihCan someone explain to me how the PC1 and linear regression line differ? I understand they serve different purposes, but if both get calculated by minimizing the squared sum of the datapoints to a fitting line it technically always is the same line right?
@Carlos848awesome Explanation thank you
@kilogrammhunger961How to project data from an n dimensional space to an m dimensional space. n > m
@rishidixit7939How do you get column names of that 133 features that make up PCA1 for submitting that as a data frame for Kmeans?
@breadandcheese1880Thank you for this brilliant video. In a less then a half an hour I developed intuition that it would take me a month to do from a book.
@HitAndMissLabWhy even have lectures? This fella explained why we "maximize the variance" so clearly in the first 5 minutes.. Lecturers should just make us watch this video in class... great stuff!
@kanewilliams1653Cool đ
@framm703Thank you for explaining this! Very good quality of the video
@ăăŁăȘăČăăč-313Excellent video
@djstr0b3A very nice explanation! Thanks!
@kirar2004don't watch the video if you know nothing about pca , come back after you know what is it from StatQuest or other channels
@donfeto7636Genius explanation
@tapanbasak1453@ 9:45 starts r
@HamromerochannelI gotta say I enjoy this video so much and kinda started to under stand what PCA is and what it is used for. Totally a new and different angle to look at this concept. Thank you again Dr. Mike.
@harpercfc_Good stuff. Is the "weighted sum" the frobenius norm or related? I'm following a book and I'm trying to compare how it is teaching this to how it is explained in other forms of media like youtube videos.
@Rockyzach88Thank you for this video.
@melikaelwadany4524But how do we make use of principle components afterwards, despite the fact that we canât interpret the components since they no longer represent the original variables? Without interpretability, can PC still be useful? What can PC still tell us?
@user-wr4yl7tx3wDr. Mike, you are a genius.
@asgharbeigi9718What a simple way to explain PCA! Thank you so much for the video.
@gzuzchuy505i fall in love:D
@leksa8845Thanks Dr. Mike, really helpful!
@reacher3232I'm late to the party but this playlist is gold. Thanks guys :)
@demonblood88415:25 Why you have not constructed a center of data? Project points to both X and Y axis, calculate both averages and then draw perpendiculars where these averages will intersect which will be a center of dataset
@m22d52Great video. I also enjoyed the throwback stripey dot-matrix printer paper :)
@annprong5052"A new principal component is gonna come out orthogonal to the ones before, until you run out of dimensions and you can't do it anymore."
- poetry
ridiculously understandable explained! thank you very much!
@paull923Best explanation I've come upon as of yet. Thanks!
@soupsOffUpon first hearing the phrase "principal component analysis", I thought it sounded very analogous to finding principal stress axes in a body under load. As Dr. Pound gave a more detailed explanation later, I realized that is exactly what it is - just expanded to take place in n-dimensional space instead of 3D space. May be a helpful way to visualize for any mechanical engineers out there.
@tlniecCongratulations again for a great video. Thank you!
@ec92009yThank you Dr. Pound, finally someone who can explain pca in easy words. Really helpful in my thesis - and by a strange accident I ended up writing both my thesis about pca. First time in my Bachelors I used it for data reduction, this time I use it to categorize data.
@ZilfalonI agree with most of what is being taught in this video . Using a new basis to maximize variance or minimize the projection error is why PCA is used . What I can't agree with however is the lecturer telling that PCA is used to cluster data . I don't think this is necessarily true . PCA clusters those features which are highly correlated together . It doesn't cluster the data points when they are represented using the new basis vectors . I hope I am not wrong
@alexandros27.Very nice explanation. I almost never subscribe but you got me. Thank you.
@VG-bi9swDude, you're better at explaining this than our uni professor :""D
please keep doing what you're doing.
Thank you.
"sponsorship from by Google" - was this piece of English generated by Google's AI?
@charlieangkor8649That maximizes the variance=r2? Bc it seems like p1 was tvhere to minimize the variiance between the linne and the points no?
@ControlTheGuhSo the idea behind it, is a finding a right angle to look at all data, where we can see clearly all data and distances between them. Looks more like support vector machine or SVM, where we increase dimensionality to fit the line on some other dimension.
@8eckThank you for the explanation.
@trafalgarlaw9919And what is the benefit of doing PCA? Are we training our neural networker quicker or why would I do this? I still have to collect all the variables, so what is the point?
@CenthihiIt is very pleasant to listen to you. Thanks!
@simaykazc15085:34 why when we rotate the axis data also split out as 2 clusters?
@sdeitymbrilliant, thanks!
@proprius