What is Transformation? Explain Different Types of Transformations

Data transformation is the process of extracting required data from a data source and is the most critical SSIS step. Post extraction, the process aids in managing and transferring the data to a specific file destination. There are several rules implemented by this process for loading the extracted data to the destination target file. Based on this, the transformations are classified as:



Blocked
Non Blocked
Semi Blocked


Non-Blocking – No blocking

Partial Blocking – The downstream transformations wait for certain periods, it follows start then stop and start over technique

Full BlockingThe downstream has to be waiting till the data has been released from the upstream transformation.

Non-blocking transformations

  • Audit
  • Cache Transform
  • Character Map
  • Conditional Split
  • Copy Column
  • Data Conversion
  • Derived Column
  • Export Column
  • Import Column
  • Lookup
  • Multicast
  • OLE DB Command
  • Percentage Sampling
  • Script Component
  • Slowly Changing Dimension

Partial blocking transformations

  • Data Mining
  • Merge
  • Merge Join
  • Pivot
  • Unpivot
  • Term Lookup

Fully Blocking Transformations

  • Aggregate
  • Fuzzy grouping
  • Fuzzy lookup
  • Row Sampling
  • Sort
  • Term Extraction

If you clearly observe Sort is a fully blocking transformation, so it’s better to sort your data using the SQL command in OLE DB Source instead of using sort transformation. Merge transform requires Sort but not Union All, so use Union All wherever possible.

Comments

Popular posts from this blog

Difference between ETL vs. ELT

What are the for each loop enumerators available in SSIS?

What are the SSIS package protection levels?