Example: Using Match Analysis

This example demonstrates how to use the Match Analysis tool to compare the lift/drop rates of two different matches. Before the data is sent through a matcher, it is split into two streams using a Broadcaster. Each stream is then sent through an Intraflow Match stage. Each data stream includes identical copies of the processed data. Each Intraflow Match stage uses different matching algorithm and generates Match Analysis data that you can use to compare the lift/drop of various matches.

Dataflow for Match Analysis

This example dataflow is available in Enterprise Designer. Go to File > New > Dataflow > From template and select HouseholdRelationshipsAnalysis. This dataflow requires the following products: Advanced Matching, Data Normalization, and Universal Name. It also requires you to load the Table Lookup core database and the Open Parser base tables.

To use view this example:

  1. Run the dataflow.
  2. Select Tools > Match Analysis.
  3. From Browse Match Results window, expand HouseholdRelationshipAnalysis, select Household Match 1 and Household Match 2 from the Source list, and then click Add.
  4. Select Household Match 1 in the Match Results List and click Compare. This displays results on the Summary tab.
  5. Click the Lift/Drop tab. This displays the Lift/Drop chart.
    Lift/Drop chart in Match Analysis

    This chart shows the differences between the duplicate and unique records generated for the different match rules used.

  6. Click the Match Rules tab. This displays the match rules comparison.

    Match rules comparison on the Match Rules tab

    From this tab you can see that the algorithm has been changed; Character Frequency is omitted and Exact Match has been added.

  7. Click Details.
  8. Select Duplicate Collections from the show list and then click Refresh.
  9. Expand each CollectionNumber to view the Suspect and Duplicate records for each duplicate collection.
    Collection numbers expanded
  10. Compare the collections in the Detail view to the output file created.