On Bicompositional Correlation
Author
Summary, in English
A composition is a vector of positive components summing to a constant, usually taken to be 1. Hitherto the research on compositional correlation has mainly focused on the correlation between the components of composition. This thesis is concerned with modelling the correlation between two compositions.
We introduce a generalization of the Dirichlet distribution to simultaneously describe two compositions, i.e. a bicompositional Dirichlet distribution. The covariation between the two compositions is modelled by a parameter γ. If γ=0, then the two compositions are independent. For compositions with two components, we prove for which γ the distribution exists. We also give expressions for the normalization constant and other properties, such as moments, marginal and conditional distributions. For compositions that have more than two components, we present expressions for the normalization constant and other properties for all non-negative integers γ. We also present a method for generating random numbers from the distribution for all γ≥0 and for some γ<0 if the compositions have two components. The method is based on the rejection method.
We use this bicompositional distribution and a general measure of correlation based on the concept of information gain to calculate a measure of correlation between two compositions for a large number of models. Finally we present an estimator of the general measure of correlation. We compare two suggestions of confidence intervals for the general measure of correlation.
We introduce a generalization of the Dirichlet distribution to simultaneously describe two compositions, i.e. a bicompositional Dirichlet distribution. The covariation between the two compositions is modelled by a parameter γ. If γ=0, then the two compositions are independent. For compositions with two components, we prove for which γ the distribution exists. We also give expressions for the normalization constant and other properties, such as moments, marginal and conditional distributions. For compositions that have more than two components, we present expressions for the normalization constant and other properties for all non-negative integers γ. We also present a method for generating random numbers from the distribution for all γ≥0 and for some γ<0 if the compositions have two components. The method is based on the rejection method.
We use this bicompositional distribution and a general measure of correlation based on the concept of information gain to calculate a measure of correlation between two compositions for a large number of models. Finally we present an estimator of the general measure of correlation. We compare two suggestions of confidence intervals for the general measure of correlation.
Department/s
Publishing year
2010
Language
English
Publication/Series
Doctoral Theses in Statistics
Full text
- Available as PDF - 655 kB
- Available as PDF - 250 kB
- Download statistics
Document type
Dissertation
Topic
- Probability Theory and Statistics
Keywords
- Joint correlation coefficient
- Empirical confidence coefficient
- Dirichlet distribution
- Correlation
- Composition
- Compositional data
- Random variate generation
- Simplex
Status
Published
Supervisor
ISBN/ISSN/Other
- ISSN: 1651-7938
- ISBN: 978-91-628-8028-6
Defence date
16 April 2010
Defence time
10:15
Defence place
EC3:207, Holger Crafoords Ekonomicentrum
Opponent
- Peter Guttorp (Professor)