Data Normalization Module

The Data Normalization Module examines terms in a record and determines if the term is in the preferred form.

Components

The Data Normalization Module consists of:

  • Advanced Transformer—This stage scans and splits strings of data into multiple fields, placing the extracted and non extracted data into an existing field or a new field.
  • Open Parser—This stage parses your input data from many cultures of the world using a simple, but powerful parsing grammar. Using this grammar, you can define a sequence of expressions that represent domain patterns for parsing your input data. Open Parser also collects statistical data and scores the parsing matches to help you determine the effectiveness of your parsing grammars.
  • Table Lookup—This stage evaluates a term and compares it to a previously validated form of that term. If the term is not in the proper form, then the standard version replaces the term. Table Lookup includes changing full words to abbreviations, changing abbreviations to full words, changing nick names to full names or misspellings to corrected spellings.
  • Transliterator—Transliterator converts a string between Latin and other scripts.