Geofizika, Vol. 23 No. 1, 2006.
Original scientific paper
Application of tetrachoric and polychoric correlation coefficients to forecast verification
Josip Juras
Zoran Pasarić
Abstract
The measure of association in 2 x 2 (K x K) contingency tables known as tetrachoric (polychoric) correlation coefficient is recalled. These measures rely on two assumptions: 1) there exist continuous latent variables underlying the contingency table and 2) joint distribution of corresponding standard normal deviates is bivariate normal. It is shown that, in practice, the tetrachoric (polychoric) correlation coefficient is an estimate of Pearson correlation coefficient between the latent variables. Consequently, these measures do not depend on bias nor on marginal frequencies of the table, which implies a natural and convenient partition of information (carried by the contingency table), between association, bias and probability of the event and subsequently enables the analysis of how other scores depend on bias and marginal frequencies. Results extended to K x K tables lead to eventual reduction in dimensionality from K2 to 2K. The theoretical findings are illustrated through analysis of real-life, 6 x 6 contingency tables on verification of quantitative precipitation forecasts.
Keywords
tetrachoric correlation coefficient; contingency table; forecast evaluation
Hrčak ID:
4211
URI
Publication date:
30.6.2006.
Visits: 2.730 *