Interview for Machine Learning

Tag: Metrics

ML0036 Confusion Matrix
In which scenarios is a Confusion Matrix most useful for evaluating machine learning models, and why?
Answer
A Confusion Matrix is a table that visualizes the performance of a classification model by comparing the predicted and actual class labels. It displays the counts of True Positives (correctly predicted positives), True Negatives (correctly predicted negatives), False Positives (incorrectly predicted positives), and False Negatives (incorrectly predicted negatives). While its form is simple, it becomes indispensable whenever you need more insight than overall accuracy. Below are the key scenarios where a confusion matrix shines.
(1) Imbalanced Datasets: Reveals if the minority class is being predicted well, unlike overall accuracy.
(2) Understanding Error Types: Shows True Positives, True Negatives, False Positives, and False Negatives, which is crucial when different errors have different costs (e.g., medical tests, fraud detection).
(3) Multi-Class Classification: Identifies which specific classes are being confused.
(4) Comparing Models: A detailed comparison of model strengths and weaknesses beyond overall accuracy.
Here is an example binary class Confusion Matrix.
Login to view more content
May 17, 2025
ML0016 AUC
What is AUC?
Answer
AUC (Area Under the Curve) is a measure of a model’s ability to distinguish between positive and negative classes, based on the ROC (Receiver Operating Characteristic) curve. It quantifies the area under the ROC curve, where the curve represents the trade-off between the True Positive Rate (TPR) and False Positive Rate (FPR) at various thresholds.
AUC Range:
1.0: Perfect classifier
0.5: Random guessing
Below 0.5: Worse than random guessing, which rarely happens
Login to view more content
March 27, 2025
ML0015 ROC Curve
What is the ROC Curve, and how is it plotted?
Answer
The ROC (Receiver Operating Characteristic) curve is a graphical representation used to evaluate the performance of a binary classification model by comparing its True Positive Rate against its False Positive Rate at various threshold settings.
Key Concepts:
True Positive Rate (TPR): Also called sensitivity or recall, it measures the proportion of actual positives correctly identified.
${\large \text{TPR} = \displaystyle\frac{\text{True Positives}}{\text{True Positives} + \text{False Negatives}}}$
False Positive Rate (FPR): The proportion of negatives incorrectly classified as positive.
${\large \text{FPR} = \displaystyle\frac{\text{False Positives}}{\text{False Positives} + \text{True Negatives}}}$
Thresholds: Classification models output scores (often probabilities). A threshold determines the cutoff for labeling a prediction as positive or negative. The ROC curve is built by varying this threshold.
Steps to Plot the ROC Curve:
Train a Model: Train the binary classification model on the labelled dataset.
Generate Probabilities: Instead of predicting class labels directly, generate probability scores for the positive class.
Calculate TPR and FPR: Calculate the TPR and FPR for various threshold values
Plot the Curve: Plot the TPR against the FPR for each threshold, creating the ROC curve.
In an ROC curve:
The x-axis shows FPR (1 – Specificity)
The y-axis shows TPR (Sensitivity or Recall).
Each point represents a TPR/FPR pair for a specific threshold.
Login to view more content
March 27, 2025
ML0014 Confusion Matrix
What is the confusion matrix?
Answer
A confusion matrix is a table that summarizes the performance of a classification model by comparing its predicted labels against the actual labels. For binary classification, it is typically organized into a 2×2 table containing:
True Positives (TP): Cases where the model correctly predicts the positive class
False Positives (FP): Cases where the model incorrectly predicts the positive class.
False Negatives (FN): Cases where the model incorrectly predicts the negative class.
True Negatives (TN): Cases where the model correctly predicts the negative class.
It provides a detailed breakdown of the model’s predictions compared to the actual outcomes, which helps in understanding not only how many predictions were correct, but also the types of errors being made.
Login to view more content
March 22, 2025
ML0013 Accuracy
What is accuracy?
Answer
Accuracy in machine learning is a metric used to evaluate the performance of a model, particularly in classification tasks. It is the ratio of correct predictions to the total number of predictions made.
Mathematically, it’s defined as:
${\large \text{Accuracy} = \displaystyle\frac{\text{TP} + \text{TN}}{\text{TP} + \text{TN} + \text{FP} + \text{FN}}}$
If a model correctly predicts the class for 99 out of 100 samples, its accuracy is 99%.
True Positives (TP): The model correctly predicts the positive class.
False Positives (FP): The model incorrectly predicts the positive class (it predicted positive, but it was negative)
True Negatives (TN): The model correctly predicts the negative class.
False Negatives (FN): The model incorrectly predicts the negative class (it predicted negative, but it was positive).
Login to view more content
March 22, 2025
ML0012 F1 Score
What is F1 Score?
Answer
The F1 score is a crucial metric used to evaluate the performance of classification models, particularly when there’s an imbalance between the classes. It provides a balance between Precision and Recall, combining them into a single metric.
${\large \text{F1 Score} = \displaystyle\frac{2 \times \text{Precision} \times \text{Recall}}{\text{Precision} + \text{Recall}}}$
Login to view more content
March 20, 2025
ML0011 Precision and Recall
What are Precision and Recall?
Answer
Precision and recall are two fundamental metrics used to evaluate the performance of classification models, especially when dealing with imbalanced datasets or when the cost of different types of errors varies.
Precision
Precision (also known as positive predictive value) is the ratio of correctly predicted positive observations to the total predicted positives. In other words, it tells you, “When the model predicts a positive, how often is it right?” Mathematically, it’s defined as:
${\large \text{Precision} = \displaystyle\frac{\text{True Positives (TP)}}{\text{True Positives (TP)} + \text{False Positives (FP)}}}$
For example, if a spam detector labels 100 emails as spam and 99 of them are actually spam, its precision is 99%.
Recall
Recall (also known as sensitivity or true positive rate) is the ratio of correctly predicted positive observations to all observations that are actually positive. It answers the question, “Out of all the actual positives, how many did the model capture?” Mathematically, it’s defined as:
${\large \text{Recall} = \displaystyle\frac{\text{True Positives (TP)}}{\text{True Positives (TP)} + \text{False Negatives (FN)}}}$
For example, if there are 100 spam emails in total and the model correctly identifies 90 of them, its recall is 90%.
True Positives (TP): The model correctly predicts the positive class.
False Positives (FP): The model incorrectly predicts the positive class (it predicted positive, but it was actually negative)
True Negatives (TN): The model correctly predicts the negative class.
False Negatives (FN): The model incorrectly predicts the negative class (it predicted negative, but it was actually positive).
Login to view more content
March 20, 2025