Input Parameters
Parameter | Description |
---|---|
Group-By Option | For a MapReduce job, pass the arguments:
|
Match Rule | Define as many parent and child rules as required, to create a
MatchRule object.For more information, see MatchRule. |
Input File | For text files:
Attention: Invoke the appropriate constructor of
For ORC format files:FilePath .
|
Output File | For text files:
Attention: Invoke the appropriate constructor of
For ORC format files:FilePath .
|
Job Configurations | The Hadoop configurations for the job. For a MapReduce job, the instance must be of type MRJobConfig. For a Spark job, the instance must be of type SparkJobConfig. |
Job Name | The name of the job. |
Express Match Column | The name of the column to be used for express matching of records. |
Setting Collection Number Zero to Unique Records | Set this to true to set the collection number of unique records as 0 (zero). |
Compress Output | Flag to indicate if the output must be compressed. Set this to true to compress the output. |
Match Key Settings | A combination of the columns and the algorithms to be applied to generate the match
key, required to perform the matching. Note: Specify only one match key.
Attention: Set the match key settings only if you wish to generate a match key before
performing the matching.
|