


























Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
Instructions on how to use R packages and techniques to assess the normality and homogeneity of variances for parametric statistics. It covers graphical displays, skewness and kurtosis, and statistical tests such as Shapiro-Wilk and Levene's test. The document also includes examples using the Festival dataset.
Typology: Study notes
1 / 34
This page cannot be seen from the preview
Don't miss anything!
http://www.pelagicos.net/classes_biometry_fa18.htm
Characteristics of Normal Distributions Unimodal, Symmetrical, Bell-shaped
Biologist worried about potential health effects of music festivals. Measured hygiene of 810 concert-goers over the three days of a music festival. Hygiene measured using standardized index (from 0 to 4): 0 = you smell terribly 4 = you smell beautifully Import Download Festival Data (MusicFestival.xlsx) For ease of use, rename the Data Set “Festival”
Festival <- DownloadFestival
day1 day2 day histogram density
The solid red line is the expected pattern a normal distribution with the same mean and SD and the sampled data. Points outside of the dashed line envelope suggest significant deviations day
day 2 day 3
We can also explore the summary statistics describing the three datasets (day1, day2, day3) using RCmdr: What statistics would you use to assess data normality? NOTE: multiple datasets can be analyzed at once
Exploring the summary statistics describing the three datasets (day1, day2, day3) using RCmdr:
numSummary(Festival[,c("day1", "day2", "day3"), drop=FALSE], statistics=c("mean", "quantiles", "skewness", "kurtosis"), quantiles=c(.5), type="2") mean skewness kurtosis 50% n NA day1 1.7933580 8.865312 170.4502658 1.79 810 0 day2 0.9609091 1.095226 0.8222057 0.79 264 546 day3 0.9765041 1.032868 0.7315003 0.76 123 687
Exploring additional datasets using other functions: stat.desc() function in psych package
stat.desc(Festival$day1, basic = FALSE, norm = TRUE) basic argument: Basic statistics included if TRUE (Note: FALSE is the default) norm argument: Statistics relating to normal distribution included if TRUE (Note: FALSE is the default)
stat.desc(Festival$day1, basic = FALSE, norm = TRUE) median mean 1.790000e+00 1.793358e+ SE.mean C.I.mean.0. 3.318617e- 02 6.514115e- 02 var std.dev 8.920705e- 01 9.444949e- 01 coef.var
5.266627e- 01