×

Better biplots. (English) Zbl 1453.62046

Summary: The elements of a biplot are (i) a set of axes representing variables, usually concurrent at the centroid of (ii) a set of points representing samples or cases. The axes are (approximations to) conventional coordinate axes, and therefore may be labelled and calibrated. Especially when there are many points (perhaps several thousand) the whole effect can be very confusing but this may be mitigated by:
1.
Giving a density representation of the points.
2.
While respecting the calibrations, moving the axes to new positions more remote from the points, and possibly jointly rotating axes and points.
3.
The use of colour – when permissible.
4.
Choosing more than one centre of concurrency.
The principles are quite general but we illustrate them by examples of the Categorical Principal Component Analysis of the responses to questions concerning migration in Germany. This application introduces the additional interest of representing ordered categorical variables by irregularly calibrated axes.

MSC:

62-08 Computational methods for problems pertaining to statistics
62H25 Factor analysis and principal components; correspondence analysis

Software:

R
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Blasius, J.; Gower, J. C., Multivariate prediction with nonlinear principal components analysis: Application, Quality and Quantity, 39, 373-390 (2005)
[2] Cleveland, W. S., Visualizing Data (1993), Hobart Press: Hobart Press Summit, New Jersey, USA
[3] Cook, R. D.; Weisberg, S., Residuals and Influence in Regression (1982), Chapman & Hall/CRC: Chapman & Hall/CRC London · Zbl 0564.62054
[4] De Leeuw, J., Nonlinear principal component analysis and related techniques, (Greenacre, M.; Blasius, J., Multiple Correspondence Analysis and Related Techniques (2006), Chapman & Hall/CRC: Chapman & Hall/CRC Boca Raton, FL), 107-133 · Zbl 1277.62163
[5] Eilers, P. H.C.; Goeman, J. J., Enhancing scatterplots with smoothed densities, Bioinformatics, 20, 623-628 (2004)
[6] Gabriel, K. R., The biplot-graphic display of matrices with applications to principal component analysis, Biometrika, 58, 453-467 (1971) · Zbl 0228.62034
[7] Gifi, A., Nonlinear Multivariate Analysis (1990), J. Wiley: J. Wiley Chichester · Zbl 0697.62048
[8] Gower, J.; Hand, D., Biplots (1996), Chapman & Hall/CRC: Chapman & Hall/CRC London · Zbl 0867.62053
[9] Heiser, W. J.; Meulman, J. J., Homogeneity Analysis: Exploring the Distribution of Variables and their Nonlinear Relationships, (Greenacre, M.; Blasius, J., Correspondence Analysis in the Social Sciences. Recent Developments and Applications (1994), Academic Press: Academic Press London), 179-209
[10] Le Roux, B.; Rouanet, H., Geometric Data Analysis: From Correspondence Analysis to Structured Data (2004), Kluwer: Kluwer Dordrecht · Zbl 1096.62062
[11] Meulman, J. J.; Heiser, W. J., SPSS Categories 13.0 (2004), SPSS Inc.: SPSS Inc. Chicago
[12] R Development Core Team, 2005. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna. www.R-project.org; R Development Core Team, 2005. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna. www.R-project.org
[13] Scheuch, E.K., 2000. The use of ISSP for comparative research. In: ZUMA-Nachrichten 47. pp. 64-74; Scheuch, E.K., 2000. The use of ISSP for comparative research. In: ZUMA-Nachrichten 47. pp. 64-74
[14] Whittaker, E. T., On a new method of graduation, Proceedings of the Edinburgh Mathematical Society, 41, 63-75 (1923)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.