Scatterplot

Scatterplot

From Business Intelligence

Jump to: navigation, search


Description

Also known as a scatter graph or dot chart. Scatterplots show relationships and correlations between two quantitative variables. Common scatterplots are mapped on a Cartesian coordinate system as dots scattered across the plane. Use this to visualize relationships between variables. It can represent large datasets with exceptions, such as outliers. It is especially useful for seeing trends in the data.



Considerations

For scatterplots

  • It is especially important to eliminate or minimize grid lines in scatter plots, as the data points are dots that can be hard to see.
  • To increase clarity of the graphic, try a dot-dash plot, which puts dashes on the axes corresponding to where the points are.
  • Try showing just the range of the plotted data on the axes rather than the whole range.

For bubble plots

  • Does not work well with a range that includes negative numbers
  • Does not convey exact values
  • People have a hard time visually distinguishing size differences between circles

Related techniques

Bubble chart: This variant uses area size to encode a value. Commonly, it is shown on an x-y axis. However, another way to employ the bubble chart is as a chart that displays values as circles of varying size but arranged arbitrarily. Good for comparing a set of values, especially when the values differ greatly. It can also be interactive - if there is more than one dimension the user can pick which dimension to display, such as at Gapminder.

Links

  • Learn more about the scatterplot at ManyEyes at and Wikipedia.
  • More information about the bubble chart is available at ManyEyes, and directions for creating a bubble chart in Excel are available from Microsoft.
  • Step-by-step instructions for designing a Tufte-compatible bubble chart and scatterplot in Excel 2007.
  • A spreadsheet with working examples of Tufte's dot-dash plot and a multi-series plot is available from Juice Analytics.