What are benefits and weaknesses of various binary classification metrics?

Accuracy
Definition - Proportion of instances you predict correctly.
Strengths - Very intuitive and easy to explain.
Weaknesses - Works poorly when the signal in the data is weak compared to the signal from the class imbalance. Also, you cannot express your uncertainty about a certain prediction.

Area under the curve (AUC)
Definition (intuitive) - Given a random positive instance and a random negative instance, the probability that you can distinguish between them.
Definition (direct) - The area under the ROC curve
Strengths - Works well when you want to be able to test your ability to distinguish the two classes.
Weaknesses - You may not be able to interpret your predictions as probabilities if you use AUC, since AUC only cares about the rankings of your prediction scores and not their actual value. Thus you may not be able to express your uncertainty about a prediction, or even the probability that an item is successful.

LogLoss / Deviance
Strengths - Your estimates can be interpreted as probabilities.
Weaknesses - If you have a lot of predictions that are near the boundaries, your error metric may be very sensitive to false positives or false negatives.

F-score, Mean Average Precision, Cohen's Kappa

These are more esoteric and not as often used for general binary classification tasks. You may see them in specific subfields (e.g. F-score in NLP and Precision metrics in information retrieval)

最后编辑于
©著作权归作者所有,转载或内容合作请联系作者
平台声明:文章内容(如有图片或视频亦包括在内)由作者上传并发布,文章内容仅代表作者本人观点,简书系信息发布平台,仅提供信息存储服务。

推荐阅读更多精彩内容

  • PLEASE READ THE FOLLOWING APPLE DEVELOPER PROGRAM LICENSE...
    念念不忘的阅读 13,568评论 5 6
  • **2014真题Directions:Read the following text. Choose the be...
    又是夜半惊坐起阅读 10,130评论 0 23
  • 那天,到朋友的店里,为她办理车险。之后,她打量着我的大衣说:“开春了,你该换短款的衣服穿了。” 我突然想起来,这件...
    爱阅沈阳阅读 174评论 0 0
  • 企业人力资源战略规划 在创业型的企业中,一般会遇到这样的问题: 1、企业发展迅速,在很短的时间里无论...
    Sim2阅读 525评论 0 3
  • 天若有情天亦老,摇摇幽恨难禁。 天若有情天亦老,此意如何,细似轻丝渺似波。 天若有情天亦老,月如无恨月长圆。 天若...
    枫羽326阅读 168评论 0 0