Interrater reliability: the kappa statistic

McHugh, Mary L.

Biochemia Medica, Vol. 22 No. 3, 2012.

Pregledni rad

Interrater reliability: the kappa statistic

Mary L. McHugh ; Department of Nursing, National University, Aero Court, San Diego, California

Puni tekst: engleski pdf 180 Kb

str. 276-282

preuzimanja: 24.036

citiraj

APA 6th Edition

McHugh, M.L. (2012). Interrater reliability: the kappa statistic. Biochemia Medica, 22 (3), 276-282. Preuzeto s https://hrcak.srce.hr/89395

MLA 8th Edition

McHugh, Mary L.. "Interrater reliability: the kappa statistic." Biochemia Medica, vol. 22, br. 3, 2012, str. 276-282. https://hrcak.srce.hr/89395. Citirano 27.04.2024.

Chicago 17th Edition

McHugh, Mary L.. "Interrater reliability: the kappa statistic." Biochemia Medica 22, br. 3 (2012): 276-282. https://hrcak.srce.hr/89395

Harvard

McHugh, M.L. (2012). 'Interrater reliability: the kappa statistic', Biochemia Medica, 22(3), str. 276-282. Preuzeto s: https://hrcak.srce.hr/89395 (Datum pristupa: 27.04.2024.)

Vancouver

McHugh ML. Interrater reliability: the kappa statistic. Biochemia Medica [Internet]. 2012 [pristupljeno 27.04.2024.];22(3):276-282. Dostupno na: https://hrcak.srce.hr/89395

IEEE

M.L. McHugh, "Interrater reliability: the kappa statistic", Biochemia Medica, vol.22, br. 3, str. 276-282, 2012. [Online]. Dostupno na: https://hrcak.srce.hr/89395. [Citirano: 27.04.2024.]

Sažetak

The kappa statistic is frequently used to test interrater reliability. The importance of rater reliability lies in the fact that it represents the extent to which the data collected in the study are correct representations of the variables measured. Measurement of the extent to which data collectors (raters) assign the same score to the same variable is called interrater reliability. While there have been a variety of methods to measure interrater reliability, traditionally it was measured as percent agreement, calculated as the number of agreement scores divided by the total number of scores. In 1960, Jacob Cohen critiqued use of percent agreement due to its inability to account for chance agreement. He introduced the Cohen’s kappa, developed to account for the possibility that raters actually guess on at least some variables due to uncertainty. Like most correlation statistics, the kappa can range from -1 to +1. While the kappa is one of the most commonly used statistics to test interrater reliability, it has limitations. Judgments about what level of kappa should be acceptable for health research are questioned. Cohen’s suggested interpretation may be too lenient for health related studies because it implies that a score as low as 0.41 might be acceptable. Kappa and percent agreement are compared, and levels for both kappa and percent agreement that should be demanded in healthcare studies are suggested.

Ključne riječi

kappa; reliability; rater; interrater

Hrčak ID:

89395

URI

https://hrcak.srce.hr/89395

Datum izdavanja:

15.10.2012.

Posjeta: 49.939 *

Prijava i registracija

Biochemia Medica, Vol. 22 No. 3, 2012.

Sažetak

Ključne riječi

Hrčak ID:

URI

Datum izdavanja: