Chi-Square Analysis: Testing the Distribution of Observed Frequencies | Lecture notes Psychology

Chi Square Analysis

When do we use chi square?

More often than not in psychological research, we find ourselves collecting scores

from participants. These data are usually continuous measures, and might be scores

on a questionnaire or psychological scale, reaction time data or memory scores, for

example. And when we have this kind of data, we will usually use it to look for

mean differences on scores between or within groups (e.g. using t-tests or ANOVAs),

or perhaps to look for relationships between different types of scores that we have

collected (e.g. correlation, regression).

However sometimes we do not have this kind of data. Sometimes data will be a lot

simpler than this, instead consisting only of frequency data. In these cases

participants do not contribute scores for analysis; instead they each contribute to a

“head count” within different grouping categories. This kind of data is known as

categorical data, examples of which could be gender (male or female) or university

degree classifications (1, 2:1, 2:2, 3, pass or fail) – or any other variable where each

participant falls into one category. When the data we want to analyse is like this, a

chi-square test, denoted χ², is usually the appropriate test to use.

What does a chi-square test do?

Chi-square is used to test hypotheses about the distribution of observations in

different categories. The null hypothesis (Ho) is that the observed frequencies are

the same as the expected frequencies (except for chance variation). If the observed

and expected frequencies are the same, then χ² = 0. If the frequencies you observe

are different from expected frequencies, the value of χ² goes up. The larger the

value of χ², the more likely it is that the distributions are significantly different.

…but what does this mean in English?

To try and explain this a little better, let's think about a concrete example. Imagine

that you were interested in the relationship between road traffic accidents and the

age of the driver. We could randomly obtain records of 60 accidents from police

archives, and see how many of the drivers fell into each of the following age-

categories: 17-20, 21-30, 31-40, 41-50, 51-60 and over 60. If there is no relationship

between accident-rate and age, then the drivers should be equally spread across the

different age-bands (i.e. there should be similar numbers of drivers in each

category). This would be the null hypothesis. However, if younger drivers are more

likely to have accidents, then there would be a large number of accidents in the

younger age-categories and a low number of accidents in the older age-categories.

Chi-Square Analysis: Testing the Distribution of Observed Frequencies, Lecture notes of Psychology