Skip to main content

ETL

ETL- Extract, Transfer, Load

  • ETL stands for Extract, Transform, Load. It is a process used to extract data from various sources, transform it into a format that is suitable for analysis, and then load it into a data warehouse or data lake for further processing.

  • ETL processing involves tasks such as data extraction, data cleansing, data transformation, and data loading. It plays a crucial role in data integration and ensuring data quality for analytics and reporting purposes.

  • ETL processing

ETL Pipelines

ELT - Extract, Load, Transform

Extract

  • Incremental Extraction: only records that have changes since the last extraction
  • Partial with Update Notification

Load

incremental

  • streaming
  • batch

full

Open Source Tools

  • Talend Open Studio
  • Apache Nifi
  • CloverETL
  • Jaspersoft ETL
  • Kettle (Pentaho Data Integration)