Options

The InformationExtractor stage enables you to select entities for output data. It auto-assigns attributes for the entity types that were brought in to this stage. However, you can use the Quick Add function and select any or all of the 15 attributes:

Parameter

Description

Option.CategorizerName

Specifies which model to use for text categorization.

Option.CategoryCount

Specifies how many matching levels of the category should be output (closest match, closest plus second closest, etc.).

Option.EntityList

Specifies the type of data you want to extract from the unstructured string.

Specify one or more of these. Separate each entity type with a comma.

Address
CreditCard
Date
Email
HashTag
ISBN
Location
Mention
Organization
Person
Phone
ProperNouns
SSN
WebAddress
ZipCode

Option.OutputEntityCount

Specifies whether to return a count of how many times a particular entity occurred in the output.

true
Return a count of the entities found in the unstructured string.
false
Do not return a count of the entities found in the unstructured string.