ML Basic #4 | Evaluation Metrics

2017-01-03 2. Machine Learning Comments

Introduction

The probability of a positive prediction, conditioned on truly being positive.
- Ex1. Proportion of predicted cancer among patients who actually had cancer

$$ \frac{TP}{P} = \frac{TP}{TP+FN} $$

The probability of a negative prediction, conditioned on truly being negative.
- Ex2. Proportion of predicted normal out of actual normal people

$$ \frac{TN}{N} = \frac{TN}{TN+FP} $$

The false positive rate is calculated as the ratio between the number of negative events wrongly categorized as positive (false positives) and the total number of actual negative events (regardless of classification).

$$ FPR = \frac{FP}{N} = 1-TNR = 1-Specificity $$

The probability of a true positive prediction, among every positive predictions
- The ratio of truely cancer among predicted cancer
- If “Sentivity” is from the point of view of the answer (label), “Precision” is the interpretation from the point of view of the model

$$ \frac{TP}{TP+FP} $$

F1-Score : The harmonic average of Precision and Recall (harmonic average : the more unbalanced the value used for average calculation, the more the penalty is applied so that the average is calculated closer to the smaller value)
- Micro (Averaged) F1 : is calculated by considering the total TP, total FP and total FN of the model. It does not consider each class individually, It calculates the metrics globally.
- Macro (Averaged) F1 : calculates metrics for each class individually and then takes unweighted mean of the measures.
- Weighted F1: it takes a weighted mean of the measures. The weights for each class are the total number of samples of that class.
Reference
- Metrics in Machine Learning - gaussian37

$$ 2 \times \frac{Precision \times Recall}{Precision + Recall } $$