Intraflow Match

Intraflow Match locates matches between similar data records within a single input stream. You can create hierarchical rules based on any fields that have been defined or created in other stages of the dataflow.

Reporting

The Intraflow Match job allows you to monitor the results of the job. The counters available are:
INPUT_RECORDS
The number of records in the matching stage before the matching sort is performed.
DUPLICATE_RECORDS
The number of duplicate records within a match group, which can be either a suspect or a candidate record.
UNIQUE_RECORDS
The number of suspect or candidate records which do not match any other records in their respective match group.

If it is the only record in a match group, a suspect is automatically unique.

MATCH_GROUPS
(Group By) Records grouped together by a match key.
DUPLICATE_COLLECTIONS
The number of duplicate collections, which consist of a suspect and its duplicate records grouped together by a CollectionNumber.
EXPRESS_MATCHES
The number of Express Matches made in a collection.

An Express Match is made when a suspect and candidate have an exact match on the contents of a designated field, usually an ExpressMatchKey provided by the Match Key Generator. If an Express Match is made, no further processing is done to determine if the suspect and candidate are duplicates.

AVERAGE_SCORE
The average match score of all duplicates.

The possible values are 0-100, with 0 indicating a poor match and 100 indicating an exact match.

TOTAL_DUPLICATES
The total number of duplicates found.
TOTAL_SCORE
The total match score of all duplicates.