Checking normality using skewness, kurtosis, Kolmogorov-Smirnov and Shapiro-Wilk tests

Поділитися
Вставка
  • Опубліковано 29 лис 2024

КОМЕНТАРІ • 109

  • @wajahatalighulam6924
    @wajahatalighulam6924 Рік тому +1

    Kind and Soft way of teaching, Thanks Sir

  • @panpanyang8938
    @panpanyang8938 4 роки тому +4

    easy to follow and very informative explanination

  • @laylaboroon1579
    @laylaboroon1579 2 роки тому +2

    Thank you so much for the great presentation. It was really informative.

  • @farzanehtaghaviekrami8154
    @farzanehtaghaviekrami8154 4 роки тому +2

    Thank you for your good explanation.

  • @ahmedaremu2304
    @ahmedaremu2304 5 місяців тому

    Thank you, Prof. I would be glad if you could provide sources for the thresholds.

  • @mudasserabbas9260
    @mudasserabbas9260 3 роки тому +5

    Thank you for this wonderful video, I would really appreciate it if you can provide any reference for skewness value between -2 and +2. Absolute kurtosis value between -7 and +7 Guidelines. thankyou

    • @VahidAryadoust
      @VahidAryadoust  3 роки тому +2

      Please see the following video for a list of references: ua-cam.com/video/Kw5zaxB8Zac/v-deo.html&

    • @mudasserabbas9260
      @mudasserabbas9260 3 роки тому

      @@VahidAryadoust thank you so much

    • @vainbow4632
      @vainbow4632 9 місяців тому

      @@VahidAryadoust The exponential distribution has an excess kurtosis of 6 and a skewness of 2. Those bounds on the skewness and kurtosis for large samples here are so ridiculously wide that one would call almost any distribution normal using them.

  • @crazystatistician
    @crazystatistician Рік тому

    Hi Dr. Vahid,
    Thank you very much for this useful video!
    I have 2 questions about normality:
    1. When we are calculating the skewness and kurtosis values or performing the Kolmogorov and Shapiro-Wilk tests, should we do these tests with the mean of the variables or items of each scale?
    2. Should we also look at the histogram to decide whether our data is normally distributed, apart from these tests and skewness and kurtosis values?
    Cheers,

  • @sudeshchinthaka9120
    @sudeshchinthaka9120 2 роки тому

    Thank you Dr...So much informative and learned lot..

  • @TODM4XNL
    @TODM4XNL 4 місяці тому

    Thanks for the great video. I have a question: Would you happen to have a reference where I could read about the material your presented in the video?

  • @mahirjibril8097
    @mahirjibril8097 2 роки тому +1

    Appreciated for your elaborative video. Just wondering if you citation is available for the assumptions

  • @DP-xo2gh
    @DP-xo2gh 3 роки тому +5

    Thank you for your presentation. Could you please provide a citation for recommending z range of -3.29 - 3.29 for samples between 50-300?

    • @VahidAryadoust
      @VahidAryadoust  3 роки тому

      Typically many multivariate analysis textbooks cover such requirements. Please check out Hair et al's volume.

  • @amirhoseinmontazeri2570
    @amirhoseinmontazeri2570 4 роки тому +7

    thanks for your great and useful video. I have a question:
    can you reference those 4 criteria, please?
    (i mean criteria about skewness, kurtosis and sample size for Kolmogorov-Smirnov and Shapiro-Wilk tests )

    • @VahidAryadoust
      @VahidAryadoust  4 роки тому +2

      Hi, if you mean reference textbooks, you can look here:
      www.routledge.com/Quantitative-Data-Analysis-for-Language-Assessment-Volume-I-Fundamental/Aryadoust-Raquel/p/book/9781138733121

    • @medinachely
      @medinachely 4 роки тому +1

      @@VahidAryadoust I have the same situation with my data. The histogram, skewness and curtosis are in the normal distribution. But the KS test results is .000 of Sig. I'm reading the book you recommended, but I can't find in the discussion of KS and large sample. What is the chapter that discuss this. I need it to report in my paper.

    • @VahidAryadoust
      @VahidAryadoust  4 роки тому

      @@medinachely if your sample size is not too small, you can stick with the results of skewness and kurtosis analysis.

    • @mohamedzeineldin9707
      @mohamedzeineldin9707 3 роки тому

      @@VahidAryadoust could you please provide a reference for this .. as i have to mention in my paper why i did that

    • @VahidAryadoust
      @VahidAryadoust  3 роки тому

      @@mohamedzeineldin9707 Please see this book/ chapter 1:
      www.routledge.com/Quantitative-Data-Analysis-for-Language-Assessment-Volume-I-Fundamental/Aryadoust-Raquel/p/book/9780367671396

  • @yaregaltayelegesse2267
    @yaregaltayelegesse2267 11 місяців тому

    Hello sir, I liked it very much but could you please mention a source for the explanation you have given in relation to kolmogove and Shapiro-wilk tests so that we can cite. I faced the same problem because my data is large which is 600

  • @koshaoza8657
    @koshaoza8657 2 місяці тому

    Hello sir.Thank you for this informative video. my sample is 770. My question arises regarding my skewness and kurtosis values, which suggest that my data is normally distributed. However, the results from the Shapiro-Wilk and Kolmogorov tests, along with histograms and Q-Q Plots, indicate that the data is not normally distributed. Which set of results should I trust?

  • @ayeshakhalil1947
    @ayeshakhalil1947 Рік тому +1

    Is there a reference for these ranges being normal

  • @ahsanqureshi3467
    @ahsanqureshi3467 3 роки тому +3

    hi, is there a reference identifying Absolute skewness value between -2 and +2. Absolute kurtosis value between -7 and +7 Guidelines?

    • @VahidAryadoust
      @VahidAryadoust  3 роки тому

      Please see this video for more information: ua-cam.com/video/Kw5zaxB8Zac/v-deo.html

  • @crazystatistician
    @crazystatistician Рік тому +2

    Hi, my sample is 500. Another question arises regarding my skewness and kurtosis values, which suggest that my data is normally distributed. However, the results from the Shapiro-Wilk and Kolmogorov tests, along with histograms and Q-Q Plots, indicate that the data is not normally distributed. Which set of results should I trust?

    • @MeghaR-v9n
      @MeghaR-v9n 3 місяці тому

      same my sample size is 300+, and i am so confused about this

    • @sabrinazazoun3733
      @sabrinazazoun3733 3 місяці тому

      Same here, I have a sample of 20 rats distributed on 4 groups of 5 rats each and found the same problem as you,what to do?

  • @soehartosoeharto8471
    @soehartosoeharto8471 4 роки тому +2

    thanks for your sharing

  • @dsavkay
    @dsavkay Рік тому +1

    THanks! Amazing

  • @bartul5645
    @bartul5645 3 місяці тому

    If there were control and experiment groups, should we perform normality tests for parameters one in the control and one in the experiment? If so, that parameter is normally distributed in control and in experiment then can we say it's normally distributed? Or else we should also check if it is normally distributed in general?

  • @littilgirl3220
    @littilgirl3220 4 місяці тому

    I am ganna try it tommorow ,thank u

  • @shubhamjain1334
    @shubhamjain1334 Рік тому +1

    thank you so much

  • @Drkelvinagbakwuru
    @Drkelvinagbakwuru 2 роки тому

    Thank you for this presentation sir. But I want to ask, why did you use only four variables in the "Dependent List" column when we have other variables in the data?

    • @VahidAryadoust
      @VahidAryadoust  2 роки тому

      The video is only for demonstration. You can use more variables, but in that case, the sample should be larger.

  • @Alhamzah_F_Abbas
    @Alhamzah_F_Abbas 3 роки тому

    Very informative session, thank you very much

  • @doors22903
    @doors22903 2 роки тому +1

    Thank you for the informative helpful video, Doctor! If I may ask few questions, are the four variables that you used in this video aggregate or mean of each representing indicators? For example, the variable "comprehession" is aggregate or mean of indicators "qrc1r, qrc2r, qrc3r, and so on" I see in the video? Would appreciate your response, thank you.

  • @Yudo780
    @Yudo780 5 місяців тому

    thank you very much Sir..

  • @vannphk7834
    @vannphk7834 4 роки тому +1

    Very helpful, thank you !

  • @tanzeemaruhi1148
    @tanzeemaruhi1148 3 роки тому +1

    Hi, why are some of the skewness and kurtosis values different for the first table (from where you found z value) and the second table ( the more detailed one)?

    • @VahidAryadoust
      @VahidAryadoust  3 роки тому +1

      It is probably due to the rounding methods used in the two tables.

  • @mennatullah2551
    @mennatullah2551 3 роки тому +1

    Can you please state reference for normality if z range between 3 and -3 it is normal

    • @VahidAryadoust
      @VahidAryadoust  3 роки тому

      Please see Kline 2015 for further information.

  • @sritama12
    @sritama12 3 роки тому +1

    Hello Dr, very informative lecture. Could you give some references for the ranges mentioned for the absolute value of skewness and kurtosis? Thanks

    • @VahidAryadoust
      @VahidAryadoust  3 роки тому

      sure, please see Kline (2015) (a book about structural equation modeling).

    • @sritama12
      @sritama12 3 роки тому

      @@VahidAryadoust Thank you Dr Aryadoust. I have the third edition of the book, where Kurtosis cut off is mentioned as +3to -3. And skewness as +.5 to -.5. I will check the particular edition.

    • @chuza2003
      @chuza2003 3 роки тому

      @@sritama12 Did you find it in 5th edition?

    • @sritama12
      @sritama12 3 роки тому

      @@chuza2003 No,

  • @unknowninvictus2520
    @unknowninvictus2520 2 роки тому

    Hello, I have a question, Where could I find the (statistical) theory of this video. More precisely what literature was used for the values and sample sizes for the criteria of skewness and kurtosis. Many thanks!

  • @marchlel
    @marchlel 3 роки тому +1

    Hi Dr. Thank you so much for sharing this. I have a question:
    Do we need both absolute skewness and kurtosis value or only one of them?

    • @VahidAryadoust
      @VahidAryadoust  3 роки тому

      Both of them should be reported in your paper.

    • @marchlel
      @marchlel 3 роки тому

      @@VahidAryadoustI'm sorry, maybe my question is not clear. To consider the data is normal, both absolute value should follow the value given? For example, the data show that absolute skewness is between +2 to -2, but not the absolute kurtosis value. So the data is consider to be normal distributed? Thank you in advanced Dr.

    • @VahidAryadoust
      @VahidAryadoust  3 роки тому

      @@marchlel Yes, both the indices (skewness and Kurtosis) should follow the constraint tenable. Note that Kline (2015) has a more liberal range for Kurtosis: -7 to +7, whereas skewness has a more stringent range in normal data (-2 to +2).

    • @marchlel
      @marchlel 3 роки тому +1

      @@VahidAryadoust thank you so much Dr 😊 I really appreciate it!

  • @fondueeundof3351
    @fondueeundof3351 2 роки тому

    How can I check for normality using skewness and kurtosis estimators that already assume the underlying distribution to be normal? Isn't that going in circles?

    • @VahidAryadoust
      @VahidAryadoust  Рік тому +1

      There are different ways to estimate skewness & Kurtosis. One should consult the literature to choose the best.

  • @rayboat119
    @rayboat119 2 роки тому

    Thank you for the presentation. I want to conduct an SEM, but it seems my data is not normally distributed ... can I proceed with the analysis?

    • @VahidAryadoust
      @VahidAryadoust  2 роки тому

      Do not use ML method of parameter estimation.

    • @rayboat119
      @rayboat119 2 роки тому +1

      Thank you. Your videos are super helpful.

  • @szekhiongfam5958
    @szekhiongfam5958 Рік тому

    Hi doctor, can I ask for my data with sample size of 50 patients, the skewness and kurtosis test is within normal range, however the SW test was not normally distributed, may i know which one to follow, thank you 😊

  • @gracethong3883
    @gracethong3883 3 роки тому

    if i evaluate normality using skewness for 30 variables, all 29 variables are distributed within normal range except one variable is not skew normally. Can I choose parametric test for these 30 variables since all 30 variables are consider as one factor?

  • @user-ho8yj9xo4m
    @user-ho8yj9xo4m 3 роки тому

    thank you Dr

  • @asbinanceasbinance1473
    @asbinanceasbinance1473 3 роки тому

    Thank you for the video. Do you have a reference for the skewness and kurtosis value ranges for larger samples? Where do these ranges come from? Thanx a lot!

    • @VahidAryadoust
      @VahidAryadoust  3 роки тому +1

      Please see Kline (2015), a volume on structural equation modeling.

    • @asbinanceasbinance1473
      @asbinanceasbinance1473 3 роки тому

      @@VahidAryadoust Thank you very much for your answer and for the source. This is a great help. Best regards.

  • @afnanaboras7649
    @afnanaboras7649 4 роки тому +2

    how ca I calculate the Z range please?

    • @kailiangtiew3577
      @kailiangtiew3577 3 роки тому

      The Z value can be get from your computed statistics of skewness and Kurtosis value divide by standard errors.

  • @henokasfaw8254
    @henokasfaw8254 2 роки тому

    My sample size is 180. All the Z scores are below 3.29(except one). But the Kolmogorov-Smirnov and Shapiro-Wilk tests showed the data is not normally distributed(significance less than 0.05). Should I consider it as normally distributed or not?

  • @aqsashabbir1380
    @aqsashabbir1380 2 роки тому +1

    Hello Dr.
    My sample size is less than 300 so (- 3.29 +3.29) range i used.. But if i perform other styles to check normality, then my data is considered not normal distributed.
    Please guide me if my data is normal then which test i slected paramtric or non.pramatric? Thnk u in advance 😊

    • @VahidAryadoust
      @VahidAryadoust  2 роки тому

      For normal data, please use parametric tests.

    • @aqsashabbir1380
      @aqsashabbir1380 2 роки тому

      @@VahidAryadoust thank you sir for guiding me.. One more thing is that my data is normaly distributed and apply parametric test but my data is ordinal because i use likert scale data in questionneir..... Then what can i do?

    • @aqsashabbir1380
      @aqsashabbir1380 2 роки тому

      And i read that for ordinal data apply non.parametric test.... I'm soo much confused about it.

    • @VahidAryadoust
      @VahidAryadoust  2 роки тому

      @@aqsashabbir1380 That is fine. You can still look into its normality.

    • @aqsashabbir1380
      @aqsashabbir1380 2 роки тому +1

      @@VahidAryadoust thank you so much sir 🙂

  • @Cxcxco89
    @Cxcxco89 2 роки тому

    Thank you for sharing professor, would you suggest to use the kurtosis/skeweness rule of thumb in order to check the normality distribution of a likert scale on 5 points? I know there is debate on the possibility to approximate the distribution of this kind of scale to normality

    • @VahidAryadoust
      @VahidAryadoust  2 роки тому +1

      That is a decision to be made by the analyst. To me, skewness and kurtosis rules will be fine.

    • @Cxcxco89
      @Cxcxco89 2 роки тому

      @@VahidAryadoust Thanks again

  • @Katrina86
    @Katrina86 3 роки тому

    Dear Professor, thank you for a good lecture. I would like to ask if I have a small sample (less than 50 cases) and Shapiro-Wilk and Kolmogorov-Smirnov tests show that distribution is normal, however when I inspect the histogram it looks like not normal distribution of the variable. What should I conclude from this? Thank you in advance.

    • @VahidAryadoust
      @VahidAryadoust  3 роки тому +1

      I would go with the formal tests, i.e., Shapiro-Wilk and Kolmogorov-Smirnov tests.

    • @Katrina86
      @Katrina86 3 роки тому +1

      @@VahidAryadoust Thank you!

  • @MoonLight-nc4fi
    @MoonLight-nc4fi 3 роки тому

    Hi Dr. Vahid. I have a question
    about the descriptive stats in your paper ( gender and academic major bias in peer assessment of oral presentations). For the scoring criteria scores, did you run the analyses on raw scores? How many raters rated each student’s performance?
    In my case, each student has been rated twice by 2 different raters using an analytical rubric. Do you have any suggestion for running this type of desc stats? Do I put all the scores in spas? Thanks a lot

    • @VahidAryadoust
      @VahidAryadoust  3 роки тому +1

      You should run agreement analysis or inter-rater reliability first. If these look fine, then average the scores and compute kurtosis, skewness etc of the average scores. Videos on rater agreement analysis etc are available in this channel.

  • @davidbell7361
    @davidbell7361 3 роки тому

    excellent. Thank you...

  • @solomon5955
    @solomon5955 3 роки тому

    what is the reason beyond for Shapiro-Wilk didn't achieve, but the histogram PP plot and other results are well??

    • @VahidAryadoust
      @VahidAryadoust  3 роки тому

      SW is a formal statistical test. I would suggest using skewness and kurtosis for examining normality.

  • @mariasusanoliveratorres4295
    @mariasusanoliveratorres4295 3 роки тому

    Hello, If I have a sample of 301 participants, is it consider unreliable? or it does not change that much. thank you.

    • @VahidAryadoust
      @VahidAryadoust  3 роки тому

      No, the sample seems to be fine (depending on the number of your variables.).

  • @0127550443
    @0127550443 2 роки тому

    HI, I'm having 300 respondents in my research, is the Kolmogorov-Smrinov test suitable for to determine the normality of the samples?

    • @VahidAryadoust
      @VahidAryadoust  2 роки тому +1

      I suggest using skewness and kurtosis the way discussed in the video.

    • @0127550443
      @0127550443 2 роки тому

      @@VahidAryadoust Alright,thank you

  • @OumaymaBarbou
    @OumaymaBarbou 6 місяців тому

    Thanxxx a lot !

  • @ericoti4624
    @ericoti4624 3 роки тому

    very helpfull thank you very much

  • @danielcanoy5803
    @danielcanoy5803 2 роки тому

    Hi, Dr. Can we run a normality test for Likert Scale data? Or Likerts are already understood as not normally distributed?

    • @VahidAryadoust
      @VahidAryadoust  2 роки тому +1

      There are two schools of thought:1) yes, you can (this is practically the dominant school of thought) 2) no, since Likert scales are not normally distributed.

    • @danielcanoy5803
      @danielcanoy5803 2 роки тому +2

      @@VahidAryadoust Thanks, Dr.!

    • @danielcanoy5803
      @danielcanoy5803 2 роки тому

      Another question, Dr., what if the results of the Confirmatory Factor Analysis' CFI, TLI, RMSEA, SMNR were below the significant values, what should a researcher do next?

    • @VahidAryadoust
      @VahidAryadoust  2 роки тому

      @@danielcanoy5803 Look at modification indices, to find out how to improve the fit of the model. For example, you might be prompted to identify the worst (the lowest) path coefficients and remove the path; re-run the analysis iteratively till you arrive at a better model.