nrennie.rbind.io/MFC-CDT-data-viz

Why visualise data?

Data visualisation has two main purposes:

  • Exploratory: informs data analysis
  • Explanatory: communicating insights and results

book shelf cartoon

What does this data look like?

Statistic Value
Mean(x) 54.26
Mean(y) 47.83
Standard deviation(x) 16.77
Standard deviation(y) 26.94
Correlation(x, y) -0.06

Summary statistics aren’t enough!

Communicating insights with data visualisation

Grab attention

Visualisations stand out. If a reader is short on time or uncertain about whether a document is of interest, an attention-grabbing visualisation may entice them to start reading.

Improve access to information

Textual descriptions can be lengthy and hard to read, and are frequently less precise than a visual depiction showing data points and axes.

Summarise content

Visual displays allow for summarising complex textual content, aiding the reader in memorising key points.

Communicating insights with data visualisation

John Snow collected data on cholera deaths and created a visualisation where the number of deaths was represented by the height of a bar at the corresponding address in London.

This visualisation showed that the deaths clustered around Broad Street, which helped illustrate the cause of the cholera transmission, the Broad Street water pump.

Snow. 1854.

John Snow cholera map

What’s the purpose of your visualistion?

Data visualisations must serve a purpose.

Ask yourself:

  • What is the purpose?
  • Does the visualisation support the purpose?
  • Is it quick, accurate, and intuitive?

What are you trying to communicate?


Detailed, accurate numbers?


Or the big picture message?

Line chart showing increase in temperature over time


Warming stripes chart showing increase in temperature over time

Why do pie charts have a bad reputation?

Why do 3D charts have a bad reputation?

What value does the bar represent?

Two 3D bar charts

It’s not that 3D charts are always bad!


3D map showing roads around a hillside


Surface plot with viridis colour palette

Longer labels are best on the y-axis, horizontally.

How do the two bars compare?

Axes don’t always have to start at zero

Order categories…

…in a sensible way

Badly ordered chart of covid cases

Source: Georgia Department of Public Health

Order categories appropriately

Default:

Magnitude ordered:

Naturally ordered:

Activity 1

In groups, discuss the following chart. What is good and bad about it?

Bar chart

Discussion

Oil spill data

Discussion

Oil spill data

Discussion

Oil spill data