Data Science

Structured Data

Data organized in a predefined format with clear rows and columns, like spreadsheets and relational databases. Each field has a defined type and meaning.

Why It Matters

Structured data is the easiest type for ML to consume. Traditional ML algorithms (XGBoost, random forests) work directly with structured tabular data.

Example

A customer database with columns: customer_id (integer), name (string), email (string), signup_date (date), total_purchases (float) — each field clearly defined.

Think of it like...

Like a well-organized filing cabinet with labeled folders and standardized forms — everything has its place and you can find anything quickly.

Structured Data

Why It Matters

Example

Think of it like...

Related Terms

Unstructured Data

Semi-Structured Data

Data Preprocessing