Chapter 49. Common Properties of Transformers

These components have both input and output ports. They can put together more data flows with the same metadata (Concatenate, SimpleGather, and Merge), remove duplicate records (Dedup), filter data records (ExtFilter and EmailFilter), create samples from input records (DataSampler), sort data records (ExtSort, FastSort, and SortWithinGroups), multiplicate existing data flow (SimpleCopy) split one data flow into more data flows (Partition at all, but optionally also Dedup, ExtFilter, also Reformat), intersect two data flows (even with different metadata on inputs) (DataIntersection), aggregate data information (Aggregate), and perform much more complicated transformations of data flows (Reformat, Denormalizer, Pivot, Normalizer, MetaPivot, Rollup, and XLSTransformer).

Metadata can be propagated through some of these transformers, whereas the same is not possible in such components that transform data flows in a more complicated manner. You must have the output metadata defined prior to configuring these components.

Some of these transformers use transformations that have been described above. See Defining Transformations for detailed information about how transformation should be defined.

Here we present an overview of all Transformers:

Table 49.1. Transformers Comparison

Component Same input metadata Sorted inputs Inputs Outputs Java CTL
SimpleCopy-
no
11-n--
ExtSort-
no
11-n--
FastSort-
no
11-n--
SortWithinGroups-
yes
11-n--
Dedup-
yes
11-2--
ExtFilter-
no
11-2--
EmailFilter-
no
10-2--
Concatenate
yes
no
1-n1--
SimpleGather
yes
no
1-n1--
Merge
yes
yes
2-n1--
Partition-
no
11-nyes/no1)yes/no1)
DataIntersection
no
yes
23
yes
yes
Aggregate-
no
11--
Reformat-
no
11-n
yes
yes
Denormalizer-
no
11
yes
yes
Pivot-
no
11
yes
yes
Normalizer-
no
11
yes
yes
MetaPivot-
no
11--
Rollup-
no
11-n
yes
yes
DataSampler-
no
1n--
XSLTransformer-
no
11--

Legend

1) Partition can use either the transformation or two other attributes (Ranges or Partition key). A transformation must be defined unless one of these is specified.