Data Science

Annotation

The process of adding labels, tags, or metadata to raw data to make it suitable for supervised machine learning. Annotation can involve labeling images, transcribing audio, or tagging text.

Why It Matters

Annotation quality directly determines model quality. High-quality annotations are expensive but essential — they are the ground truth your model learns from.

Example

Annotators drawing bounding boxes around every pedestrian in thousands of dashcam frames, labeling each with attributes like 'walking,' 'standing,' or 'crossing.'

Think of it like...

Like an art teacher who marks up student drawings with corrections and guidance — the annotations show the model what 'correct' looks like.

Related Terms