Data Science

Test Data

A separate portion of data held back from training that is used to evaluate a model's performance on unseen examples. Test data provides an unbiased estimate of how well the model will perform in the real world.

Why It Matters

Test data is your reality check — it reveals whether your model actually learned generalizable patterns or just memorized training examples.

Example

After training a spam detector on 80% of your emails, you test it on the remaining 20% to see how accurately it classifies emails it has never seen.

Think of it like...

Like a final exam that covers material the student studied but uses different questions — it tests understanding, not memorization.

Related Terms