Categorizing Text

  1. Create a data flow that includes a source stage like Read from File or Read from XML, the Text Categorizer stage, and a sink stage like Write to File or Write to XML.
  2. In the source stage, point to your input file.
  3. In the Text Categorizer stage, select the model in the Categorizer name field. This is the model you trained in the text categorization phase. For information about training a model, see Training the Model.
  4. In the Category count field, select the number of matching levels of category that should be included in the output. For example, the closest match or closest plus the second close match.
    Note: The maximum value corresponds to the number of different categories specified while training the model.
  5. Click OK.
  6. Run the job.