More effective data visualisations for improving communication with decision makers

Centre for Marketing Analytics and Forecasting: Friday Forecasting Talks
15 November 2024

Dr Nicola Rennie

About me

Lecturer in Health Data Science within the Centre for Health Informatics, Computing, and Statistics.


Background in statistics, operational research (transport), and data science consultancy.


Collaborate with local NHS trusts on data science projects.


Co-author of Royal Statistical Society’s Best Practices for Data Visualisation guidance.




CHICAS logo

About me

Flowchart of forecasting process

Map of bikesharing stations with some highlighted

Why develop a data visualisation guide?

How did the Royal Statistical Society’s Best Practices for Data Visualisation guide start?

A survey in 2021 asked Royal Statistical Society (RSS) members their views on Significance magazine.

Respondents were asked, “What aspects of content could be improved?”

  • “Better, more consistent charts… I’d like to see a house style like The Economist
  • “The figures are often difficult to read…”
  • “The plots sometimes look amateurish…”

Help wanted

“RSS publications seek data visualisation expert to develop best practice guidance.”

Photo of Andreas

Andreas Krause, Idorsia Pharmaceuticals

Photo of Nicola

Nicola Rennie, Lancaster University

Photo of Brian

Brian Tarran, Royal Statistical Society

What’s the aim of the guide?

The guide would:

  • Help contributors develop data visualisations that are appropriate for RSS publications.
  • Be a source of information to aid people in developing charts that are high quality, readable, effective at conveying information, and fulfill their intended purpose.
  • Summarise and link to authoritative advice on chart styles and formats for different types of data.
  • Show how to override software defaults in common data visualisation software and packages.

Skip forward quite a few months…

Screenshot of data vis guide homepage

The rest of this talk…

In this session we will cover…

  • why you should visualise data;

  • some guidelines for making better charts;

  • examples of good and bad charts!

The role of visualisation

Why visualise data?

Data visualisation has two main purposes:

  • Exploratory data analysis and identifying data issues
  • Communicating insights and results

book shelf cartoon

Communicating insights with data visualisation

Grab attention

Visualisations stand out. If a reader is short on time or uncertain about whether a document is of interest, an attention-grabbing visualisation may entice them to start reading.

Improve access to information

Textual descriptions can be lengthy and hard to read, and are frequently less precise than a visual depiction showing data points and axes.

Summarise content

Visual displays allow for summarising complex textual content, aiding the reader in memorising key points.

What are you trying to communicate?

Data visualisations must serve a purpose.

Ask yourself:

  • What is the purpose?
  • Does the visualisation support the purpose?
  • Is it quick, accurate, and intuitive?

Why do pie charts have a bad reputation?

“All variations [of pie charts] lead to overestimation of small values and underestimation of large ones.” Kesara and Skau (2016)

Why do 3D charts have a bad reputation?

On the plot on the left, how tall is the bar?

Two 3D bar charts

Designing better visualisations

Elements of charts

  • Layout
  • Aspect ratio
  • Lines
  • Points
  • Colours
  • Axes
  • Symbols
  • Legends
  • Orientation
  • Auxiliary elements
  • Dimensionality

Layouts, aspect ratios, and axes

Layouts, aspect ratios, and axes

Layouts, aspect ratios, and axes

Layouts, aspect ratios, and axes

Longer labels are best on the y-axis, horizontally.

Box plots are just summary statistics in disguise…

Box plots hide information.

Layouts, aspect ratios, and axes

Should the axes start at 0?

Layouts, aspect ratios, and axes

They don’t always have to start at zero…

Layouts, aspect ratios, and axes

Layouts, aspect ratios, and axes

Badly ordered chart of covid cases

Source: Georgia Department of Public Health

Layouts, aspect ratios, and axes

Order categories appropriately…

Default:

Magnitude ordered:

Naturally ordered:

Lines

  • Suggest an order
  • Suggest continuity

Multiple lines

Avoid spaghetti plots!

Multiple lines

Alternatives to spaghetti:

  • Show a smaller number of lines (e.g. compare a few countries to average)
  • Use facets (AKA small multiples)
  • Use colour only to highlight lines

Colours

Why use colours in data visualisation?

  • Colours should serve a purpose, e.g. discerning groups of data.

  • Colours can highlight or emphasise parts of your data.

  • Not always the most effective for, e.g. communicating differences between variables.

Colours

Different types of colour palettes…


… for different types of data.

Examples of sequential, diverging, and qualitative palettes

Colours

Is this a good choice of colour?

Colours

Check for colourblind friendly plots with colorblindr::cvd_grid(g).

Legends

  • Improve accessibility with direct labelling
  • Use logical ordering, and keep it close to the data

Key points

  • Charts should have a purpose

  • Actively design visualisations

  • Default settings aren’t always the best choices

  • Every rule should be broken for some visualisations

Good charts don’t have to be boring!

Cara Thompson (cararthompson.com)

Stacked diverging bar chart of lego colours

Cedric Scherer (cedricscherer.com)

small multiples are charts of college basketball

Good charts don’t have to be boring!

Tanya Shapiro (tanyaviz.com)

Supreme court justice chart

Dan Oehm (gradientdescending.com)

Sloped area chart

Contributing to the guide

Quarto

Quarto is an open-source scientific and technical publishing system that allows you to combine text, images, code, plots, and tables in a fully-reproducible document.

Quarto has support for multiple languages including R, Python, Julia, and Observable. It works for a range of output formats such as PDFs, HTML documents, websites, presentations,…

quarto hex sticker logo

GitHub

The source code for the guide is stored on GitHub.

If you want to contribute to the guide, the easiest way is via GitHub.

Resources

Read the guide: rss.org.uk/datavisguide