Validate predictive models using train-test splits and cross-validation
You have one or more candidate models and must produce reliable estimates of test error.
Split the data into training and test sets (typically 70/30 or 80/20). Fit and tune on the training set using k-fold CV; report the final held-out error on the test set. For small datasets, lean on k-fold CV directly. Never tune on the test set — that contaminates the error estimate.
Tune on train (via CV); report on test (untouched).