Skip to main content

Model Validation

Model validation ensures that machine learning models generalise well to unseen data by assessing their performance using independent validation sets

Selection Model Validation in evoML

  1. Create a New Trial
  2. Under Splitting Options, select Model Validation method. Existing model validation approaches:
    • Holdout (all tasks)
    • K-fold cross validation (classification & regression)
    • Sliding window (timeseries)
    • Expanding window (timeseries)

Model Validation Options

1. Holdout (all models)

The dataset is split into two subsets: one for training and one for validation.

SettingDetails
SizeThe fraction of the training dataset to include in the validation subset.
Keep orderWhether or not to shuffle the data.

2. K-fold Cross Validation (Classification & Regression)

The dataset is divided into K subsets. The model is trained on K-1 subsets and validated on the remaining one, repeating the process for each subset.

SettingDetails
KNumber of subsets into which to divide the training data.
Keep orderWhether or not to shuffle the data.

3. Sliding Window (Timeseries)

The model is trained on a fixed-length training window and validated on a forecast window. Both windows move forward in time by a defined slide length between rounds. An optional gap can be added between them.

SettingDetails
Evaluation WindowNumber of time steps in the forecast window (typically 15% of the dataset).
Train WindowNumber of time steps in the training window (typically 45% of the dataset).
SlideNumber of time steps by which both windows move forward between rounds (typically 15% of the dataset).

4. Expanding Window (Timeseries)

The model is trained on an initially defined training window, which expands over time. The forecast window moves forward by a defined expansion length in each round. An optional gap can be added between them.

SettingDetails
Evaluation WindowNumber of time steps in the forecast window (typically 15% of the dataset).
Initial Train WindowNumber of time steps in the first round of training (typically 45% of the dataset).
Expansion LengthNumber of time steps by which the training window grows in each subsequent round (typically 15% of dataset).