Your Basket 0 items - £0.00

To introduce plotting scatter graphs and understanding correlation I ask students to think about the relationships between different variables and to describe how they might be related.

Here’s my starter activity which students discuss in pairs then present to me on mini-whiteboards.

When the students have had time to discuss the matching pairs we talk about how each graph is likely to look if we were to plot eight typical samples.

As we begin the first example we discuss the type of relationship we expect to see when time spent reading is plotted against time spent watching TV for a sample of ten people. The consensus is the more time people spend reading the less time they are likely to spend watching TV. I ask the class to sketch on their mini-whiteboards what the scatter graph might look like if our hypothesis is correct.

For the first couple of examples I provide the scaled axes for the class. In later examples on plotting scatter graphs and understanding correlation I expect students to choose and draw their own axes on A4 graph paper with appropriate scaling.

When we have plotted the points, I introduce the term correlation as a means to describe the relationship between two variables. There are two types of correlation.

- A positive correlation means as one variable increases, or decreases, so does the other.
- A negative correlation means as one variable increases the other will decrease.

If two variables are not related the points will be scattered so no correlation is apparent.

A line of best fit can be used to clearly illustrate the directional trend of the data. The closer the points are to the line of best fit the stronger the correlation. We discuss the strength of the correlation as an indication of how closely two variables are related.

The line of best fit also helps to predict the value of one variable when the other is known. It is noted in several examiners reports by AQA and Edexcel that students are more likely to correctly estimate the value of a missing data point and identify anomalous data points if they use a line of best fit.

In my experience, there are three main misconceptions when drawing lines of best fit.

The line of best fit connects to the origin.

The line of best fit is drawn as a line segment connecting the extreme value to the origin.

The line of best fit passes through each of the points.

As an extended plenary I challenge the students to create a scatter graph based on their own hand and foot size. Before we collect any data, I ask the students to write down their own hypothesis at the top of the A4 graph paper.

This is a fun activity which requires the group to work together so everyone has everybody else’s data. Whenever I do this it amazes me which student steps up to take charge of organising everyone. I do my best to keep out of the way and let the students manage the data collection so it is fair and accurate.

In the next lesson we go on to discuss the limitation of using scatter graphs and correlation to identify causation. We consider examples such as the number of ice creams sold and drownings would correlate in the summer months as increased temperatures causes people to go swimming and eat ice cream. However, eating ice creams do not cause people to drown.

My name is Jonathan Robinson and I passionate about teaching mathematics. I am currently Head of Maths in the South East of England and have been teaching for over 15 years. I am proud to have helped teachers all over the world to continue to engage and inspire their students with my lessons.

Higher GCSE maths students are required to plot and interpret histograms with unequal class widths. Drawing histograms with unequal class widths are very common in GCSE maths papers.

September 15, 2020

How to introduce the sin, cos and tan trigonometric identities.

September 3, 2020

How to teach calculating the original amount after a percentage change.

August 25, 2020

The importance of the range when comparing comparing datasets.