Data Science

Data Drift

A change in the statistical properties of the input data over time compared to the data the model was trained on. When data drifts, model predictions become less reliable.

Why It Matters

Data drift is the #1 reason production models degrade. Detecting it early prevents months of poor predictions before anyone notices.

Example

A credit scoring model trained on pre-pandemic data encountering post-pandemic spending patterns — remote work, less travel, more online shopping — that differ from training data.

Think of it like...

Like using a map from 10 years ago to navigate a city where new roads have been built and old ones closed — the guidance becomes less reliable as the world changes.

Related Terms