了解 Intra-class Correlation Correlation 组内相关系数

什么是intra-class coefficient correlation?翻译过来是组内相关系数。

为什么会需要它呢?

当我们的调查中有许多不同的采访者、评分者或评估者时,通常使用类内相关系数(ICC)。假设我们的调查中有n个参与者(或项目),每个参与者由k个不同的面试官进行评估。我们感兴趣的是知道我们看到的k个面试官之间的一致程度(面试官是否为每个参与者记录了相同的结果)。ICC是我们的数据中总方差的比例,它由采访者之间的方差解释。在大多数情况下,ICC的值在0到1之间,当ICC接近1时,我们看到审查员之间有一个完美的一致,当ICC接近0时,我们看到审查员之间没有一致。

我觉得下边的这个例子解释的比较清楚。


Intraclass correlation measures the reliability of ratings or measurements for clusters — data that has been collected as groups or sorted into groups. A related term is interclass correlation, which is usually another name for Pearson correlation (other statistics can be used, like Cohen’s kappa, but this is rare). Pearson’s is usually used for inter-rater reliability when you only have one or two meaningful pairs from one or two raters. For more, you’ll want to use the ICC. Like most correlation coefficients, the ICC ranges from 0 to 1.

  • A high Intraclass Correlation Coefficient (ICC) close to 1 indicates high similarity between values from the same group.

  • A low ICC close to zero means that values from the same group are *not *similar.

This is best illustrated with an example. In the image below, values from the same group are clustered fairly tightly together. For example, group 3 (on the x-axis) is clustered between about -1.3 and -0.4 on the y-axis. Most of the groups are similarly clustered, giving the entire set a high ICC of 0.91:

A dotplot of a dataset with high intraclass correlation. Image: skbkekas|Wikimedia Commons.

Compare that set to the following graph of a dataset with an extremely low ICC of 0.07. The values within groups are widely scattered without any clusters:

Dataset with a low ICC. Image: Skbkekas|Wikimedia Commons.

Common Uses and Calculation

The ICC is used to measure a wide variety of numerical data from clusters or groups, including:

  • How closely relatives resemble each other with regard to a certain characteristic or traits.

  • Reproducibility of numerical measurements made by different people measuring the same thing.

Calculating the ICC is very complex by hand, in part because of the number of ICC formulas to choose from, and partly because the formulas themselves are complex. The main reason for all of this complexity is that the ICC is very flexible and can be adjusted for inconsistent raters for all ratees. For example, let’s say you have a group of 10 raters who rate 20 ratees. If 9 of the raters rate 15 of the ratees and 1 rater rates all of them, or if 10 raters rate 2 each, you can still calculate the ICC.

Calculating the ICC is usually performed with software, and each program has its own terminology and quirks. For example, in SPSS, you’re given three different options for calculating the ICC.

  • If you have inconsistent raters/ratees, use “One-Way Random.”

  • If you have consistent raters/ratees (e.g. 10 raters each rate 10 ratees), and you have sample data. use “Two-Way Random.”

  • If you have consistent raters/ratees (e.g. 10 raters each rate 10 ratees), and you have population data, use “Two-Way Random.”


参考链接:

https://www.statisticshowto.datasciencecentral.com/intraclass-correlation/

http://www.adasis-events.com/statistics-blog/2013/4/25/what-is-the-intra-class-correlation-coefficient.html

https://www.uvm.edu/~dhowell/StatPages/

20190323

©著作权归作者所有,转载或内容合作请联系作者
平台声明:文章内容(如有图片或视频亦包括在内)由作者上传并发布,文章内容仅代表作者本人观点,简书系信息发布平台,仅提供信息存储服务。

推荐阅读更多精彩内容

  • rljs by sennchi Timeline of History Part One The Cognitiv...
    sennchi阅读 12,160评论 0 10
  • **2014真题Directions:Read the following text. Choose the be...
    又是夜半惊坐起阅读 13,469评论 0 23
  • 上学的时候,同专业有个姑娘很招男孩子喜欢,我想我们很多人都喜欢她吧,漂亮、得体、一切都恰到好处。当然她各方面条件也...
    96b241183f0b阅读 1,560评论 0 2
  • 昨天刚看完大冰的《阿弥陀佛,么么哒》延续《乖,摸摸头》的基调,大冰继续以他的小屋,他的浪荡不羁爱自由的生涯,告他...
    幻美心阅读 2,662评论 0 0
  • 姓名:李丹丹 公司:山东万洁环保科技有限公司 组别:第373期 利他一组 【日精进打卡第310天】 【知~学习】 ...
    小雨淅淅373阅读 708评论 0 0