How to detect outliers in SPSS

Поділитися
Вставка
  • Опубліковано 19 сер 2024

КОМЕНТАРІ • 43

  • @danielgooner4ever
    @danielgooner4ever 8 років тому +38

    Nice video, SPSS always breaks my brain

  • @JonesAcademy7
    @JonesAcademy7 4 роки тому +1

    Thank you for this video. Years later and it is still helpful.

  • @macacoman
    @macacoman 7 років тому +1

    This is one of my favorite channels on youtube! Thorough yet clear. Keep up the good work man!

  • @lisaheaney31
    @lisaheaney31 4 роки тому +1

    Thanks for providing citations! Really helpful.

  • @yvesburtworthington3244
    @yvesburtworthington3244 7 років тому

    Thanks you for helping me with my homework in Advanced Statistics

  • @anitacarrier9386
    @anitacarrier9386 2 роки тому

    My lecturer told me not to use box plots to check for outliers as it only uses the median and interquatile range rather than the mean, he then advised me to create z-scores to find outliers as this is based on the mean, however, he only showed us how to do that manually and not with spss.

  • @AnelyBek
    @AnelyBek 3 роки тому

    Thank you Dr. How2stats!

  • @gabitroyano
    @gabitroyano 4 роки тому

    Thank you for the explanation! It's very good and simple! Thanks a lot!

  • @dsavkay
    @dsavkay 5 місяців тому

    Thanks, great insight! 💯

  • @snakeyjake7
    @snakeyjake7 5 років тому

    Really helpful, informative and to the point. Thanks!

  • @vbeija
    @vbeija 7 років тому

    Thank you for the instructions and references.

  • @YooBro219
    @YooBro219 3 роки тому

    Sir you the GREAT

  • @saro4761
    @saro4761 7 років тому

    Thanks so much for this valuable information

  • @willjfit9345
    @willjfit9345 3 роки тому +1

    How do you remove the outliers?

  • @mittadileepkumar3756
    @mittadileepkumar3756 7 років тому

    Thank you so much for an amazing explanation. :)

  • @diogotalhinhas1146
    @diogotalhinhas1146 2 роки тому

    muy bueno

  • @kyrank.4321
    @kyrank.4321 6 років тому

    Thanks, this was very helpful

  • @HieuNguyen-ju4zl
    @HieuNguyen-ju4zl 5 років тому

    Thank you

  • @ElizabethPepple
    @ElizabethPepple 5 років тому

    Thank you!

  • @ThePookie25
    @ThePookie25 7 років тому

    Thank you for this!

  • @SaadKhanYousafzai
    @SaadKhanYousafzai 7 років тому +1

    Hi there. First of all I have to thank you for such amazing videos. Secondly I have a problem and I have tried hard to find a solution but all in vain. I had some missing data and on top of it I also removed few outliers. I have multiple variables for single subject. I tried to do a repeated measure ANOVA but just because of one missing variable for a subject, all other variables are also ignored and I am loosing subjects. A had 23 subjects but ANOVA analyze just 14. If I put ZERO in missing varaible's place it gives me lower MEAN value. Please tell me how to fix the missing data so I can analyse all the subjects and it should also not affect my MEANS for all the varaibles.
    P.S: I can not to any computation method (I have seen your MCAR videos) to predict the values. It will mess up my data very bad.

  • @nargisali7298
    @nargisali7298 3 роки тому

    In multivariate analysis, a Zscore = 3.2 would be an outlier if the data set contain 1000 cases?

  • @tsehayneshgedefew5310
    @tsehayneshgedefew5310 2 роки тому

    I do have two questions.first is it mandatory to check normality for individual contnious variables or one by one secondly can we check normality of our data after coding?

  • @RajeshChaudhary
    @RajeshChaudhary 3 роки тому

    It would be great to know about a technique in SPSS to identify an outlier based on standard deviation. Could you please guide on this?

  • @ricardovonschoettler
    @ricardovonschoettler 4 роки тому

    Thanks for the video, it has helped me in my research work. But if I have a query, in the case of time series, if we want to assess normality, should this be done only on the component called "noise"? Thanks

  • @kritikadmonty8991
    @kritikadmonty8991 4 роки тому

    Can we use the method of labelling outliers for non-normal data ? If not how do we identify outlier in non normal data?

    • @how2stats
      @how2stats  4 роки тому

      Depends on how non-normal the distribution is. I'd say skew less than .50 should be fine. There are outlier detection methods for non-normal distributions, but I haven't learned them yet!

  • @diogotalhinhas1146
    @diogotalhinhas1146 2 роки тому

    grazi mile

  • @milenah2227
    @milenah2227 5 років тому

    Good work, thank you for the video! But I've got the problem that my variable is metric with a huge range from 3 to 12 000 000, that is why I can't detect the extreme outliers (multiplier 3.0) visually in the boxplot visualization. The scale is too wide to identify the values that are too low. How can I solve that problem?

    • @how2stats
      @how2stats  5 років тому

      Extreme outliers can distort the visual appeal of a box plot. You might consider simply reporting that the value of 12 000 000 was an outlier and dealt with (either removed or Winsorised). Then, re-do the box plot.

  • @komaljerawla5699
    @komaljerawla5699 4 роки тому

    once you detect an outlier what do you do next? do v remove it from the data set?

    • @how2stats
      @how2stats  4 роки тому

      Good question. I usually winsorize it: ua-cam.com/video/WJuB0vZp6w4/v-deo.html

  • @alexsisccdr
    @alexsisccdr 7 років тому +2

    Great videos. Where can I get the Excel you are using to calculate outliers based on the 2.2 multiplier?

  • @devez7
    @devez7 5 років тому

    so how do u choose the 3 multiplier? u did the same thing

    • @how2stats
      @how2stats  5 років тому +3

      You don't have to "choose" anything. SPSS automatically reports results with the 1.5 and 3.0 multipliers (circles and stars, respectively).

  • @slsmithy8075
    @slsmithy8075 6 років тому

    Hi, probably a dumb questions, but when you go from the Var1 data set to Var2 data set, what would you call the "error bars" in the var2 graph, because technically the top error bar isnt the "maximum" as the "maximum" is the outlier. Thanks.

    • @how2stats
      @how2stats  6 років тому

      It's a fine question. They correspond to the 25th (low bar or lower quartile) and 75th (high bar or upper quartile) percentiles.

    • @slsmithy8075
      @slsmithy8075 6 років тому

      @@how2stats I thought the 25th and 75th percentile were the top and bottom lines of the box?
      Im asking what would you call the error bar above and below the box, given the outlier is the 'maximum'.

  • @shaunlikescheese
    @shaunlikescheese 6 років тому

    Does the 2.2 multiplier break down at all when applied to larger data sets? Say, n = 600?

    • @how2stats
      @how2stats  6 років тому

      Yes. I'd use 2.2 multiplier for samples between 20 and 300. Thereafter, I'd use a multiplier of 3.0.

    • @shaunlikescheese
      @shaunlikescheese 6 років тому

      Is there research supporting this though?

    • @how2stats
      @how2stats  6 років тому

      Yes, check out Hoaglin's research; he might say it in this paper: Hoaglin, D. C., Iglewicz, B., & Tukey, J. W. (1986). Performance of some resistant rules for outlier labeling. Journal of the American Statistical Association, 81(396), 991-999.
      Or another paper in that time period.

  • @Sharpdus
    @Sharpdus 5 років тому

    so how do you delete this damn 12