Error Analysis

Original Source: https://www.coursera.org/specializations/deep-learning

Picking a portion of the misclassified development examples and analyzing them manually to find out what the model is failing at is called ‘error analysis’.

Let’s assume we want to classify cat images. After training a model and validating it, we pick 100 images that classifier predicted as cat but actually was not a cat. It is efficient to draw a table like below to analyze features of misclassified examples.

error analysis

According to the above table, 61% of the misclassified images were blurry. So we should prioritize on having our model do better on blurry images.

Among these 100 images, there can be images that the labelr labeld as cat but actually is not a cat. In the table, there is a column named ‘incorrectly labeled’. This is where we check if the label is incorrect. In this case, only 6% was incorrectly labeld so we do not have to prioritize on correcting labels.

Machine Learning Workflow

Set up train/developement/test sets
Set up metrics
Build an initial system quickly
Train on train set, then validate on development set.
Use bias/variance analysis and error analysis to prioritize next steps
Iterate to get better model

Share on

Twitter Facebook Google+ LinkedIn

YoonSoo

Error Analysis

Machine Learning Workflow

Share on

Leave a Comment

You May Also Enjoy

Generalized Linear Models (GLM)

“ALBERT: A Lite BERT for Self-supervised Learning of Language Representations” Summarized

“Generative Pretraining from Pixels” Summarized

“Language Models are Few-Shot Learners” Summarized