To introduce plotting scatter graphs and understanding correlation I ask students to think about the relationships between different variables and to describe how they might be related.
Here’s my starter activity which students discuss in pairs then present to me on mini-whiteboards.
When the students have had time to discuss the matching pairs we talk about how each graph is likely to look if we were to plot eight typical samples.
As we begin the first example we discuss the type of relationship we expect to see when time spent reading is plotted against time spent watching TV for a sample of ten people. The consensus is the more time people spend reading the less time they are likely to spend watching TV. I ask the class to sketch on their mini-whiteboards what the scatter graph might look like if our hypothesis is correct.
For the first couple of examples I provide the scaled axes for the class. In later examples on plotting scatter graphs and understanding correlation I expect students to choose and draw their own axes on A4 graph paper with appropriate scaling.
When we have plotted the points, I introduce the term correlation as a means to describe the relationship between two variables. There are two types of correlation.
If two variables are not related the points will be scattered so no correlation is apparent.
A line of best fit can be used to clearly illustrate the directional trend of the data. The closer the points are to the line of best fit the stronger the correlation. We discuss the strength of the correlation as an indication of how closely two variables are related.
The line of best fit also helps to predict the value of one variable when the other is known. It is noted in several examiners reports by AQA and Edexcel that students are more likely to correctly estimate the value of a missing data point and identify anomalous data points if they use a line of best fit.
In my experience, there are three main misconceptions when drawing lines of best fit.
The line of best fit connects to the origin.
The line of best fit is drawn as a line segment connecting the extreme value to the origin.
The line of best fit passes through each of the points.
As an extended plenary I challenge the students to create a scatter graph based on their own hand and foot size. Before we collect any data, I ask the students to write down their own hypothesis at the top of the A4 graph paper.
This is a fun activity which requires the group to work together so everyone has everybody else’s data. Whenever I do this it amazes me which student steps up to take charge of organising everyone. I do my best to keep out of the way and let the students manage the data collection so it is fair and accurate.
In the next lesson we go on to discuss the limitation of using scatter graphs and correlation to identify causation. We consider examples such as the number of ice creams sold and drownings would correlate in the summer months as increased temperatures causes people to go swimming and eat ice cream. However, eating ice creams do not cause people to drown.
In my experience, students, in general, find the concept of a mean straightforward to calculate and understand. However, the mean alone does not provide a complete picture of a set of data. To achieve this, a measure of spread is also required. The range is the simplest measure that can be used for this. Not […]
Solving problems with angles in parallel lines is like solving a murder mystery. One clue leads on to the next and the next until the murderer is found. However, it doesn’t end there. The detectives need to explain their reasoning in court using the relevant laws and procedures should the murderer plead not guilty. If […]
An equation is when one expression, or term, is equal to another. To solve an equation means to find the value of the variable (represented by a letter) that makes the two expressions equal. There are two types of equations for secondary school mathematics, linear and none-linear. In this blog I write about how I […]