Abstract
Correlation matrices are omnipresent in multivariate data analysis. When the number d of variables is large, the sample estimates of correlation matrices are typically noisy and conceal underlying dependence patterns. We consider the case when the variables can be grouped into K clusters with exchangeable dependence; this assumption is often made in applications, e.g., in finance and econometrics. Under this partial exchangeability condition, the corresponding correlation matrix has a block structure and the number of unknown parameters is reduced from d(d−1)∕2 to at most K(K + 1)∕2. We propose a robust algorithm based on Kendall's rank correlation to identify the clusters without assuming the knowledge of K a priori or anything about the margins except continuity. The corresponding block-structured estimator performs considerably better than the sample Kendall rank correlation matrix when K<d. The new estimator can also be much more efficient in finite samples even in the unstructured case K = d, although there is no gain asymptotically. When the distribution of the data is elliptical, the results extend to linear correlation matrices and their inverses. The procedure is illustrated on financial stock returns.
Original language | English |
---|---|
Pages (from-to) | 400-422 |
Number of pages | 23 |
Journal | Journal of Multivariate Analysis |
Volume | 169 |
DOIs | |
Publication status | Published - Jan 2019 |
Externally published | Yes |
Bibliographical note
Publisher Copyright:© 2018 Elsevier Inc.
Keywords
- Agglomerative clustering
- Constrained maximum likelihood
- Copula
- Kendall's tau
- Parameter clustering
- Shrinkage