A Quick Intro to Leave-One-Out Cross-Validation (LOOCV) | Online Statistics library

To evaluate the performance of a model on a dataset, we need to measure how well the predictions made by the model match the observed data.

The most common way to measure this is by using the mean squared error (MSE), which is calculated as:

MSE = (1/n)*Σ(y_i – f(x_i))²

where:

n: Total number of observations
y_i: The response value of the i^th observation
f(x_i): The predicted response value of the i^th observation

The closer the model predictions are to the observations, the smaller the MSE will be.

In practice, we use the following process to calculate the MSE of a given model:

1. Split a dataset into a training set and a testing set.

Training vs. testing set in machine learning

2. Build the model using only data from the training set.

LOOCV in machine learning

3. Use the model to make predictions on the testing set and measure the MSE – this is know as the test MSE. Training vs. testing set

The test MSE gives us an idea of how well a model will perform on data it hasn’t previously seen, i.e. data that wasn’t used to “train” the model.

However, the drawback of using only one testing set is that the test MSE can vary greatly depending on which observations were used in the training and testing sets.

It’s possible that if we use a different set of observations for the training set and the testing set that our test MSE could turn out to be much larger or smaller.

One way to avoid this problem is to fit a model several times using a different training and testing set each time, then calculating the test MSE to be the average of all of the test MSE’s.

This general method is known as cross-validation and a specific form of it is known as leave-one-out cross-validation.

Leave-One-Out Cross Validation

Leave-one-out cross-validation uses the following approach to evaluate a model:

1. Split a dataset into a training set and a testing set, using all but one observation as part of the training set:

LOOCV

Note that we only leave one observation “out” from the training set. This is where the method gets the name “leave-one-out” cross-validation.

2. Build the model using only data from the training set.

Build model with LOOCV approach

3. Use the model to predict the response value of the one observation left out of the model and calculate the MSE.

Leave-one-out cross-validation

4. Repeat the process n times.

Lastly, we repeat this process n times (where n is the total number of observations in the dataset), leaving out a different observation from the training set each time.

We then calculate the test MSE to be the average of all of the test MSE’s:

Test MSE = (1/n)*ΣMSE_i

where:

n: The total number of observations in the dataset
MSEi: The test MSE during the i^th time of fitting the model.

Pros & Cons of LOOCV

Leave-one-out cross-validation offers the following pros:

It provides a much less biased measure of test MSE compared to using a single test set because we repeatedly fit a model to a dataset that contains n-1 observations.
It tends not to overestimate the test MSE compared to using a single test set.

However, leave-one-out cross-validation comes with the following cons:

It can be a time-consuming process to use when n is large.
It can also be time-consuming if a model is particularly complex and takes a long time to fit to a dataset.
It can be computationally expensive.

Fortunately, modern computing has become so efficient in most fields that LOOCV is a much more reasonable method to use compared to many years ago.

Note that LOOCV can be used in both regression and classification settings as well. For regression problems, it calculates the test MSE to be the mean squared difference between predictions and observations while in classification problems it calculates the test MSE to be the percentage of observations correctly classified during the n repeated model fittings.

How to Perform LOOCV in R & Python

The following tutorials provide step-by-step examples of how to perform LOOCV for a given model in R and Python:

Leave-One-Out Cross-Validation in R
Leave-One-Out Cross-Validation in Python