zbMATH — the first resource for mathematics

Analysis of a European Union election using principal component analysis. (Analysis of an European Union election using principal component analysis.) (English) Zbl 1247.91053
Summary: While studying the results from one European Parliament election, the question of principal component analysis (PCA) suitability for this kind of data was raised. Since multiparty data should be seen as compositional data (CD), the application of PCA is inadvisable and may conduct to ineligible results. This work points out the limitations of PCA to CD and presents a practical application to the results from the European Parliament election in 2004. We present a comparative study between the results of PCA, Crude PCA and Logcontrast PCA (cf., e.g., [J. Aitchison, Biometrika 70, 57–65 (1983; Zbl 0515.62057)]). As a conclusion of this study, and concerning the mentioned data set, the approach which produced clearer results was the Logcontrast PCA. Moreover, Crude PCA conducted to misleading results since nonlinear relations were presented between variables and the linear PCA proved, once again, to be inappropriate to analyse data which can be seen as CD.

91B12 Voting theory
62H25 Factor analysis and principal components; correspondence analysis
62P25 Applications of statistics to social sciences
Full Text: DOI
[1] Aitchison J (1983) Principal component analysis of compositional data. Biometrika 70: 57–61 · Zbl 0515.62057 · doi:10.1093/biomet/70.1.57
[2] Aitchison J (1986) The statistical analysis of compositional data. Springer, London · Zbl 0688.62004
[3] Barceló-Vidal C (2003) When a data set can be considered compositional? In: CoDaWork03: Compositional Data Analysis Workshop, Girona, Spain. Available at http://ima.udg.es/Activitats/CoDaWork03/
[4] Bradu D, Gabriel KR (1978) The biplot as a diagnostic tool for models of two-way tables. Technometrics 20: 47–68 · Zbl 0381.62004 · doi:10.1080/00401706.1978.10489617
[5] Butler A, Glasbey C (2008) A latent Gaussian model for compositional data with zeros. J R Stat Soc C Appl Stat 57: 505–520 · doi:10.1111/j.1467-9876.2008.00627.x
[6] Chayes F, Trochimczyk J (1978) An effect of closure on the structure of principal component. Math Geol 10: 323–333 · doi:10.1007/BF01031737
[7] Gabriel KR (1971) The biplot graphic display of matrices with application to principal component analysis. Biometrika 58: 453–467 · Zbl 0228.62034 · doi:10.1093/biomet/58.3.453
[8] Jolliffe IT (2002) Principal component analysis. Springer, New York · Zbl 1011.62064
[9] Katz JN, King G (1999) A statistical model for multiparty electoral data. Am Polit Sci Rev 93: 15–32 · doi:10.2307/2585758
[10] Kucera M, Malmgren BA (1998) Logratio transformation of compositional data–a resolution of the constant sum constraint. Mar Micropaleontol 34: 117–120 · doi:10.1016/S0377-8398(97)00047-9
[11] Palarea-Albaladejo J, Martín-Fernández JA (2008) A modified EM alr-algorithm for replacing rounded zeros in compositional data sets. Comput Geosci 34: 902–917 · doi:10.1016/j.cageo.2007.09.015
[12] Thi-Henestrosa S, Martín-Fernández JA (2005) Dealing with compositional data: the freeware coDaPack. Math Geol 7: 773–793 · Zbl 1152.86312 · doi:10.1007/s11004-005-7379-3
[13] van den Boogaart KG, Tolosana-Delgado R (2008) Compositions: a unified R package to analyze compositional data. Comput Geosci 34(4): 320–338 · doi:10.1016/j.cageo.2006.11.017
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.