Skip to the main content

Original scientific paper

Application of tetrachoric and polychoric correlation coefficients to forecast verification

Josip Juras
Zoran Pasarić


Full text: english pdf 284 Kb

page 59-82

downloads: 1.011

cite


Abstract

The measure of association in 2 x 2 (K x K) contingency tables known as tetrachoric (polychoric) correlation coefficient is recalled. These measures rely on two assumptions: 1) there exist continuous latent variables underlying the contingency table and 2) joint distribution of corresponding standard normal deviates is bivariate normal. It is shown that, in practice, the tetrachoric (polychoric) correlation coefficient is an estimate of Pearson correlation coefficient between the latent variables. Consequently, these measures do not depend on bias nor on marginal frequencies of the table, which implies a natural and convenient partition of information (carried by the contingency table), between association, bias and probability of the event and subsequently enables the analysis of how other scores depend on bias and marginal frequencies. Results extended to K x K tables lead to eventual reduction in dimensionality from K2 to 2K. The theoretical findings are illustrated through analysis of real-life, 6 x 6 contingency tables on verification of quantitative precipitation forecasts.

Keywords

tetrachoric correlation coefficient; contingency table; forecast evaluation

Hrčak ID:

4211

URI

https://hrcak.srce.hr/4211

Publication date:

30.6.2006.

Article data in other languages: croatian

Visits: 2.730 *