Every once in a while I check D3’s webside and find several new online data visualization that could be put to good use by water systems engineers. The two tools that will be presented here were found at D3’s website and adapted from Kai Chung’s (interactive parallel axis plot, licensed under the BSD licence) and Benjiec’s (interactive pair-wise scatter plot, licensed under the MIT licence) codes.
Interactive Parallel Axis Tool
The current version of the interactive parallel axes tool can be found here. Its features are:
- Colors by group
- Data brushing
- Pie charts indicating group breakdown and total brushed, updated as brushing is performed
- Export, keep, and remove data manipulation tools for brushed data
- Variable line opacity for clean visualization of large data sets
The first step to use the tool is to create a column in your data file called “group” (all in small letters), and assign a group to each data point in the file. All columns must have header names. Then open the parallel axes tool website and click on the “choose file” button:
A dialog should open for you to upload your data file:
After uploading your data file, your screen should look similar to this:
In the above screenshot you see:
- The group coloring legend
- The parallel axes plot
- Your data in a tabular format
- Data manipulation buttons
- Pie chart breakdown of your data
More details about each feature are given next. If your data set is large (dozens of points or more) it may be hard to identify an individual in the plot so that you can compare it with the others. One way to get around this is by hoovering the mouse over the data in tabular format. If you place the mouse over a certain row in the table, the corresponding line in the plot will be highlighted, as shown below:
In order to brush off points out of a given interval on one of the axes, just hold click on a point on the axis and drag the mouse up and down. Multiple axes can be brushed at a given time, although only one interval per axis is allowed. The data breakdown pie charts automatically update to reflect the data left in the plot. After brushing, the chart should look like the one below.
If you want to explore the tradeoffs between two axes that are not next to each other, you can hold click on an axis title and drag it to the position next to the axis you want to compare it to.
After brushing, you may want to create a new data set off the remaining data points in the plot. If you click on the “Export” button on the top right corner of the chart, a new CSV (Comma Separated File) file containing only the remaining points will be download to your computer.
If, on the other hand, you are using brushing to select points you want to get rid of, just click on the “Remove” button after brushing your data.
The resulting data set will consist of the points that were brushed off. The keep button has the opposite effect.
Interactive Pairwise Scatter Plot Tool
The features of the interactive pairwise scatter plot tool are:
- Colors by data group
- Data brushing
- Display only certain data groups
- Group breakdown scatter plots
The first step to use the tool is to create a column in your data file called “group” (all in small letters), and assign a group to each data point in the file. After this is done, open the parallel axis tool website and click on the “choose file” button:
A dialog should open for you to upload your data file:
After uploading your data file, your screen should look similar to this:
In the above screenshot you see:
- A “+” and a “-” links to increase or decrease the size of the scatter plots.
- Two links for how to color the plot, if uniform color or by group
- Which data columns to plot.
- Which groups to breakdown in scatter plots
- The scatter plots
More details about each are given next. If your file has too many data columns and you only want to visualize pairwise scatter plots of a few of them, this can be done by unchecking the boxes under “Include variables” corresponding to the columns you want to hide.
In order to brush off points out of a given a pairwise interval in one of the subplots, just hold click on a point on the axis and drag the mouse to select the interval. Only one subplot can be brushed at a given time. After hiding one of the variables and brushing the plot, the chart should look like the one below.
If you click on the “group” link under “select a variable to color”, your data points will be assigned colors according to the group they belong to, as in the screenshot below.
If you check any two boxes under “Drill and Expand”, the first checked box will be made the x-axis of all subplots and the variables corresponding tot he unchecked boxes will be in the y-axes of each row of subplots. The second checked box will be broken into multiple smaller intervals, which effectively brushes the data into tiny intervals, each represented in a subplot.
I hope you find these tools useful!