Input

The stage takes unstructured strings of data as input . It can also use the Read from Documents stage as an input if you want to categorize text from an unstructured document. The Read from Documents stage reads the document and returns text based on the user-defined settings. This is read by the Text Categorizer stage to give you the desired output.

Table 1. Input Format

Field Name

Description

PlainText

The unstructured string of data from which you want to extract information.