Compositional data are quantitative descriptions of the parts of some whole, conveying exclusively relative information. Examples are found in various fields, including geology, medicine, chemistry, agriculture, economics, social science, etc. This concise book presents a very applied introduction to compositional data analysis, focussing on the use of R for analysis. It includes lots of real examples, code snippets, and colour figures, to illustrate the methods.



Autorentext

Michael Greenacre is Professor of Statistics at the Universitat Pompeu Fabra, Barcelona, Spain, where he teaches a course, amongst others, on Data Visualization. He has authored and co-edited nine books and 80 journal articles and book chapters, mostly on correspondence analysis, the latest being Correspondence Analysis in Practice (Third Edition) in 2016. He has given short courses in fifteen countries to environmental scientists, sociologists, data scientists and marketing professionals, and has specialized in statistics in ecology and social science.



Klappentext

Compositional Data Analysis in Practice is a user-oriented practical guide to the analysis of data with the property of a constant sum, for example percentages adding up to 100%. Compositional data can give misleading results if regular statistical methods are applied, and are best analysed by first transforming them to logarithms of ratios. This book explains how this transformation affects the analysis, results and interpretation of this very special type of data. All aspects of compositional data analysis are considered: visualization, modelling, dimension-reduction, clustering and variable selection, with many examples in the fields of food science, archaeology, sociology and biochemistry, and a final chapter containing a complete case study using fatty acid compositions in ecology. The applicability of these methods extends to other fields such as linguistics, geochemistry, marketing, economics and finance.

R Software

The following repository contains data files and R scripts from the book https://github.com/michaelgreenacre/CODAinPractice. The R package easyCODA, which accompanies this book, is available on CRAN -- note that you should have version 0.25 or higher. The latest version of the package will always be available on R-Forge and can be installed from R with this instruction: install.packages("easyCODA", repos="http://R-Forge.R-project.org").



Inhalt

What are compositional data, and why are they special?

Geometry and visualization of compositional data.

Logratio transformations.

Properties and distributions of logratios.

Regression models involving compositional data.

Dimension reduction using logratio analysis.

Clustering of compositional data.

The problem of zeros, with some solutions.

Simplifying the task: variable selection.

Case study: Fatty acids of marine amphipods.

Appendix A: Theory of compositional data analysis.

Appendix B Bibliography of compositional data analysis

Appendix C Computation of compositional data analysis

Appendix D Glossary of terms

Appendix E Epilogue

Titel
Compositional Data Analysis in Practice
EAN
9780429849015
Format
E-Book (epub)
Veröffentlichung
17.07.2018
Digitaler Kopierschutz
Adobe-DRM
Anzahl Seiten
120