Data Science

ETL

Extract, Transform, Load — a data integration process that extracts data from source systems, transforms it into a usable format, and loads it into a destination system.

Why It Matters

ETL is the backbone of data infrastructure. Without it, raw data remains siloed and unusable for AI/ML applications.

Example

Extracting customer data from Salesforce, transforming it by cleaning addresses, normalizing phone numbers, and deduplicating records, then loading it into a data warehouse.

Think of it like...

Like a food processing plant — raw ingredients come in, are washed, cut, packaged, and placed on shelves ready for use.

Related Terms