The bixplot: A variation on the boxplot suited for bimodal data
By: Camille M. Montalcini, Peter J. Rousseeuw
Potential Business Impact:
Shows hidden groups in data.
Boxplots and related visualization methods are widely used exploratory tools for taking a first look at collections of univariate variables. In this note an extension is provided that is specifically designed to detect and display bimodality and multimodality when the data warrant it. For this purpose a univariate clustering method is constructed that ensures contiguous clusters, meaning that no cluster has members inside another cluster, and such that each cluster contains at least a given number of unique members. The resulting bixplot display facilitates the identification and interpretation of potentially meaningful subgroups underlying the data. The bixplot also displays the individual data values, which can draw attention to isolated points. Implementations of the bixplot are available in both Python and R, and their many options are illustrated on several real datasets. For instance, an external variable can be visualized by color gradations inside the display.
Similar Papers
ggskewboxplots: Enhanced Boxplots for Skewed Data in R
Methodology
Finds real outliers in messy data better.
Unifying Boxplots: A Multiple Testing Perspective
Methodology
Makes charts better at finding weird data points.
Bi-dendrograms for clustering the categories of a multivariate categorical data set
Methodology
Simplifies complex data by grouping similar information.