Using SPSS to carry out Principal components analysis (2018)

Поділитися
Вставка
  • Опубліковано 6 вер 2024

КОМЕНТАРІ • 67

  • @rajibkanti9949
    @rajibkanti9949 Рік тому

    Don't know how I would show gratitude for this kind of great work you have done. Take a prostrated bow, Sir.

  • @TheKubbas
    @TheKubbas 4 роки тому +6

    This is insane! Feels like going to class with a great lecturer. Thank you so much! Helped me with my final year project!

  • @mehdibadami5942
    @mehdibadami5942 2 роки тому

    Best practical video for PCA, Millions of thanks!

  • @Chandrajith100
    @Chandrajith100 3 місяці тому

    The best video. thank you very much and love from sri lanka!!!

  • @briannabrown6173
    @briannabrown6173 5 років тому +5

    This video helped me get through my stats hw. Thank you so much for your time in crafting it I'm sure it took a long time!

  • @nafisimtiazhossain9335
    @nafisimtiazhossain9335 3 роки тому

    i dont kow why i am loving your interpretation

  • @czaitabaniag9852
    @czaitabaniag9852 5 років тому +5

    Hello, may I know the variables exactly? And what is the data about?

  • @ZzKhosh-vy7xj
    @ZzKhosh-vy7xj 11 місяців тому

    Thank you so much for this useful video. I have a question. After doing the final step and creating the factors, for one of the subjects that I have, all factors become zero. Could you explain why this happens? Thank you.

  • @ckikic1
    @ckikic1 4 роки тому

    Very good and clear video explaining how to run a PCA analysis! Thank you very much!!

  • @peggy_evergreen1370
    @peggy_evergreen1370 Місяць тому

    Thank you so much for this helpful video ✨

  • @kalpananatarajan7370
    @kalpananatarajan7370 4 роки тому

    Wow...great lecture I ever found in you tube. Thank you so much sir.

  • @mushyydahabid7845
    @mushyydahabid7845 2 роки тому +1

    Thank you so much for this amazing lecture!

  • @IanIsaacLaw
    @IanIsaacLaw 4 місяці тому

    Love your video man...

  • @sumitsourav838
    @sumitsourav838 2 роки тому

    Where is the principal components in this video??

  • @chaitanyalodha3948
    @chaitanyalodha3948 2 роки тому

    That was a great video!! learnt and understood a lot. Thanks!

  • @patriciaborja8198
    @patriciaborja8198 3 роки тому

    Muito obrigada, aqui do Brasil, Salvador-Bahia!

  • @latiefahmad920
    @latiefahmad920 2 роки тому

    Sir i did same but unfortunately kmo box didn't come in results! What could b the reason even though i clicked on kmo ??

  • @kmahim82
    @kmahim82 5 років тому +1

    what to do when our independent variables are of mixed scale…. For example i have likert scale responses and also discrete( 2 or 3 scale) responses…..
    is there any way to perform PCA/EFA on such mixed data?????

  • @mariaaverina2247
    @mariaaverina2247 3 роки тому

    Thank you very much for this video. It helped me a lot!

  • @qasimahmad6714
    @qasimahmad6714 3 роки тому

    Is it important to show 95% confidence ellipse in PCA? why it is so? If my data is not drawing it then what should i do ? can i used PCA score graph without 95% confidence ellipse?

  • @DrEktaPathakMishra
    @DrEktaPathakMishra 2 роки тому

    Thank you for this informative video. 😇😇

  • @charlesngobidi4165
    @charlesngobidi4165 4 роки тому

    Thanks for your video. It was very helpful.

  • @DrHanjabamBarunSharma
    @DrHanjabamBarunSharma 3 роки тому

    like the loading plot easy explanation :-)

  • @andygillam2292
    @andygillam2292 5 років тому

    Very helpful. Great detail!

  • @munibahmad6227
    @munibahmad6227 3 роки тому

    I don't know how to set data for KMO and PCA test. Is there any one to help?

    • @mikecrowson2462
      @mikecrowson2462  3 роки тому

      Hi there, Munib. This is a bit older video on PCA. I have a newer one where I talk about KMO here: ua-cam.com/video/6Ycw3_iQHD0/v-deo.html

  • @ejeducate
    @ejeducate 4 роки тому

    I have a large number of scale items and I need to narrow it down to factors in order to test the hypothesis, so I used factor analysis, I noticed that I can have 3 component, however 1 component when I ran the reliability test for it it was very low (0.4) it is understandable since it only has 2 items, I can't use it because of this. I decided to do what you did as in limit the components to 2 and in the transformation matrix the first component is 0.682 (correlated, I think the word?) with the second, while the second is the negative of that number (-0.682) with the first component and with each other 1 with 1 and 2 with 2 (0.731). Are the values acceptable and can I continue using 2 components? if yes, then how would I interpret the results in my thesis about the component transformation matrix (the one I just mentioned its values) and the total variance explained table (since it shows 3 component not 2)

  • @keithmannix2221
    @keithmannix2221 4 роки тому

    Hi Mike, what is the best way to decide on which rotation method to use on my data?

  • @stisha6449
    @stisha6449 3 роки тому

    Hello Mike, great video! A small question. If we perform an EFA using principal axis factoring and the number of components extracted accounts for 56 % of total variance , is it advisable to consider those 4 ones?

  • @markuggayam6006
    @markuggayam6006 3 роки тому

    Hi Mike! Can I give my questionnaire to my experimental and control groups even BEFORE conducting EFA?
    Due to lack of time to learn and run EFA, the questionnaire I gave to my experimental and control groups is not yet reduced and NO identified factors are determined as of yet.
    My plan is to conduct EFA later on. When reduced items are already available, that's the time that I will be selecting appropriate responses from my study groups for my analysis. Please comment. Thank you

  • @ahmalik566
    @ahmalik566 4 роки тому

    Thank you for very informative video Sir. Furthermore, I have a question. Take for example of this video, results suggest 2 pc's are significant that have eigenvalue more than 1 and it creates two variables in data sheet. How I can transform these two components into one variable?

    • @barhom11eriqat98
      @barhom11eriqat98 2 роки тому

      Yes bro, this is important question, I also looking for the answer!!

    • @ahmalik566
      @ahmalik566 2 роки тому

      @@barhom11eriqat98 Please follow the below mentioned articles
      Malik, A. H., bin Md Isa, A. H., bin Jais, M., Rehman, A. U., & Khan, M. A. (2021). Financial stability of Asian Nations: Governance quality and financial inclusion. Borsa Istanbul Review.
      Malik, A. H., Jais, M. B., Isa, A. H. M., & Rehman, A. U. (2022). Role of social sustainability for financial inclusion and stability among Asian countries. International Journal of Social Economics.

  • @edogankent7047
    @edogankent7047 4 роки тому

    thank you , but I dont understand the difference of factor analysis and principal component analysis. You use varimax here.

    • @mikecrowson2462
      @mikecrowson2462  4 роки тому +4

      Thanks for your question. You have the same question that perplexes many folks learning about these techniques! In the past (and I dare say even today), these two procedures have been used for the same purpose - that is, exploratory factor analysis (EFA). A number of authors (e.g., Russell, 2002) have argued against the use of PCA for performing EFA both for mathematical, conceptual, and interpretative reasons. EFA is based on the common factor model and the analyst attempts to explain only the associations among a set of measured variables - factoring out the unique variance (i.e., specific variance) associated with those variables. This means that with EFA, the analyst is attempting to explain the covariation (not the total variation) by identifying factors that produce that association. PCA centers on breaking down the total variation in a set of measured variables and does not make any demarcation between the variation that is shared and unshared among a set of measured variables. So you can say that EFA is aimed at explaining covariation among a set of variables, whereas PCA is aimed at breaking down the total variation. The common factor model assumes that unobserved factors "produce" the observed covariation among a set of measured variables. PCA does not make this assumption, and is instead used for creating new linear composites of the original variables. Although the two approaches look and behave similarly, their assumptions and aims are quite different. Probably what makes this more confusing is the fact that PCA is sometimes used as a preliminary step (using scree plot or parallel analysis) when attempting to identify number of factors. I could keep going but this is getting long. So, think of it this way: EFA generally is used to explore underlying factors, whereas PCA (as a stand-alone procedure) would be oriented towards data reduction (i.e., reducing a larger number of variables down to a smaller set; which can be useful for addressing multicollinearity when using other procedures). I have a couple of other videos on PCA and EFA with additional learning materials you might check out at ua-cam.com/video/6Ycw3_iQHD0/v-deo.html (PCA) and ua-cam.com/video/M6FUT0h-bhY/v-deo.html (EFA using principal axis factoring). There are also references in the videos. [one other thing: varimax rotation is a strategy for increasing interpretability of factors in EFA or components in PCA. It has nothing to do with the choice of carrying out EFA or PCA, though). I hope this helps. Cheers!

    • @edogankent7047
      @edogankent7047 4 роки тому

      @@mikecrowson2462 Thank you very much fast answer.Finally , I understand main maybe only one difference is using principal component or principal axis , isnt it?Secondly , according to my research social science use usually correlation matrix and chemistry uses covariance matrix , isnt it?

    • @mightyhelen7553
      @mightyhelen7553 4 роки тому +1

      @@mikecrowson2462 Thank you so much!

    • @mikecrowson2462
      @mikecrowson2462  4 роки тому

      Hi there. I'm not clear what you are asking on the first question. On the second, I don't know about disciplinary differences in the use of correlation versus covariance matrix. I'm a social scientist and have always relied on the correlation matrix when factoring. However, I don't know about other disciplines such as chemistry. Best wishes.

    • @mikecrowson2462
      @mikecrowson2462  4 роки тому

      @@mightyhelen7553 Thanks for visiting! I'm glad you found this useful :)

  • @deltami2562
    @deltami2562 4 роки тому

    Hi, thank you for your video. I have a question, concerning the extraction of variables and observations on the same plane! How can I do this please?

    • @mikecrowson2462
      @mikecrowson2462  4 роки тому

      Hi. I'm a bit unclear about exactly you are asking. By the way, you might consider viewing my most recent video on PCA in SPSS (you can also download a Powerpoint underneath the video description): ua-cam.com/video/6Ycw3_iQHD0/v-deo.html
      I have another video on exploratory factor analysis here too: ua-cam.com/video/M6FUT0h-bhY/v-deo.html
      I hope you find these helpful and they clarify things for you. Best wishes.

  • @hectorponce2012
    @hectorponce2012 3 роки тому

    Does the concept of cross loading exist in PCA like it does in EFA? What the criteria are to determine cross loading in case it does exist?

    • @mikecrowson2462
      @mikecrowson2462  3 роки тому

      Hi there. When you are performing PCA or EFA and look into the tables containing component/factor loadings, you will see manifest variables have loadings on all retained components/factors. Since most of the loadings for a manifest variable will be non-zero across components/factors, then the loadings can all be considered cross-loadings. The issue you are speaking to is the question of which component/factor should a manifest variable be used for in its definition. To do that, you should not only be concerned with the issue of meeting a minimum loading criterion for definition, but also that the difference in the magnitude of cross-loadings across components/factors is sufficiently great enough that it makes sense to treat the manifest variable as an indicator of one component/factor as opposed to others. I actually cover this topic in a more recent presentation at ua-cam.com/video/6Ycw3_iQHD0/v-deo.html. Also, the following Powerpoint associated with that video can be download at drive.google.com/file/d/16-arUCU95wrU2G-aAykmiVbMWAzFEsJF/view . I believe in the example, I use a loading differential of .30 as a partial basis in addressing the issue of dealing with cross loadings. I hope this helps! (by the way, my use of 'component/factor' in the above comment is not meant to indicate they are the same. There are mathematical differences between them. But I used that for shorthand since I didn't want to keep saying 'component or factor' over and over again). Cheers!

    • @hectorponce2012
      @hectorponce2012 3 роки тому

      @@mikecrowson2462 I truly appreciate your comment and I will definitely check out those materials. To summarize, if there is a differential of less than .30, let us say .29, then, it would be considered a cross loading for PCA, other things being equal correct?

    • @mikecrowson2462
      @mikecrowson2462  3 роки тому +1

      @@hectorponce2012 If you are using a .30 difference threshold as a basis for deciding whether an item exhibits non-trivial cross-loadings on multiple factors, then any number less (such as .29) would be considered an indication of no real issue. Of course, the use of cutpoints has it's own limitations.
      So when deciding on whether an item is defining one factor and not others, you should really are thinking about loadings in two ways: (a) is there a substantial enough loading (e.g., .40 or >) on a given factor that it is appropriate to treat it as an indicator of that factor? And (b) is there a large enough difference in the loading involving the factor the item is most closely associated with and the loadings on the remaining factors to justify a claim that the item is best treated as an indicator of one factor and not others (if though it will likely have non-zero loadings on the other factors)? Hope this helps!

    • @hectorponce2012
      @hectorponce2012 3 роки тому

      @@mikecrowson2462 Thanks again! One more thing, what book or article should I cite to explain cross loadings in PCA?

    • @mikecrowson2462
      @mikecrowson2462  3 роки тому +1

      @@hectorponce2012 Check out the Powerpoint associated with the video I posted links to in my earlier reply. You should see a reference in there.
      best wishes!

  • @deadlypalms
    @deadlypalms 5 років тому

    Great stuff, thanks! :)

  • @Rimmonin
    @Rimmonin 5 років тому

    The parallel analysis engine website no longer works... :'(

    • @laurencenolan4492
      @laurencenolan4492 5 років тому

      I used it today. It works

    • @mikecrowson2462
      @mikecrowson2462  5 років тому +1

      You can also use Brian O'connor's syntax to carry out parallel analysis: see ua-cam.com/video/xRsiMQ1CLfI/v-deo.html . Under the video description is a link to the page where the syntax can be found.
      .

  • @DrHanjabamBarunSharma
    @DrHanjabamBarunSharma 3 роки тому

    great

  • @hemantsolanki5
    @hemantsolanki5 5 років тому

    can i get the data files

    • @mikecrowson2462
      @mikecrowson2462  5 років тому

      Hello. Links to the data can be found under the video description.

  • @mikecrowson2462
    @mikecrowson2462  4 роки тому

    HI EVERYONE, I HAVE JUST UPLOADED (OCTOBER 2019) AN NEW VIDEO ON THIS TOPIC ALONG WITH ADDITIONAL INSTRUCTIONAL MATERIALS (THAT CAN BE DOWNLOADED) AT: ua-cam.com/video/6Ycw3_iQHD0/v-deo.html . Please be sure to check it out!

  • @rachelcyr4306
    @rachelcyr4306 3 роки тому

    " people just click buttons and don't know what they mean" Sir, I am offended, I have systematic pattern for my button clicking.

    • @mikecrowson2462
      @mikecrowson2462  3 роки тому

      Hi Rachel, to be honest I can't recall that comment as this video was made long ago. Nevertheless, no offense was intended. That said, please allow me to briefly elaborate on the point I believe I was try to make (albeit inarticulately):
      I've come to realize over the years is that while software programs like SPSS do make things easier for folks by prepackaging options and canned output, there is a downside. And that is that all those pre-packaged options and buttons do not encourage users to know why certain options should be preferred or utilized. And sometimes we learn patterns that give us output without really knowing the limitations of the output that is generated or perhaps how to maximize the information that we do get. This is not a sleight on you or anyone who chooses these options. Heck, when I was learning to use SPSS, I learned to rely on patterns and program defaults myself. But the more I have read and thought about what buttons I was pushing, I came to realize that the trade-off for ease-of-use can sometimes be a kind of statistical tunnel vision because program defaults and options only convey the program developer's narrow view of what should be 'in' and what should be left 'out'. [And to be honest, SPSS has seemed to be slow over the years in expanding their options to support best practices based on the most recent literatures.] All this is to say that statistics is a way of thinking systematically about data, and all programs offer a range of options that may or may not be best for accomplishing your aims as a researcher for a given circumstance and for your particular data. My hope is that you, me, and all users of the various statistical programs out there don't simply take what is given to us at face-value, and do not assume that what is given represents the only or necessarily the best way to do things. At any rate, maybe this is a longer comment that it needed to be. However, the stat prof in me just had to respond. Thank you so much for visiting and offering your thoughts. Just an FYI, I do have another video here (ua-cam.com/video/6Ycw3_iQHD0/v-deo.html) on PCA, and you can download data to practice and a supplemental Powerpoint. Best wishes!

    • @rachelcyr4306
      @rachelcyr4306 3 роки тому

      ​@@mikecrowson2462 I am so sorry I meant that as a joke!!! I should have realized it might not come off that way. I found your video incredibly helpful as I am trying to relearn SPSS on the fly!

    • @mikecrowson2462
      @mikecrowson2462  3 роки тому

      Hi Rachel, no worries at all! I was thinking it could be a joke, but decided to answer 'just in case'. Thank you for your clarification, and I'm really happy you are finding my videos helpful! Best wishes as you keep relearning SPSS!

    • @rachelcyr4306
      @rachelcyr4306 3 роки тому

      @@mikecrowson2462 Thank you!!!

  • @oyedirantoyese9049
    @oyedirantoyese9049 5 років тому

    Poor audio

  • @milescooper3322
    @milescooper3322 5 років тому

    Great explanations (but as a speaker, you need to drop the very frequent use of "uh's" and "kind of" because they are distracting to acquiring the content).

  • @vaibhavbangar4992
    @vaibhavbangar4992 4 роки тому

    Thank you for this amazing explanation. Cheers.