Field Parser
The Field Parser stage extracts fields from XML and delimited data in the specified input column. To configure the Field Parser options, perform the following tasks.
- From the Source field select the column that has the XML or delimited
data to be parsed. Note: The drop-down displays all the string input columns.
- Select the XML or Delimited Format based on the type of data you want to parse, and accordingly, select the options described below.
Field Parser Options for XML Data
Option Name | Description |
---|---|
Server name | Indicates whether the file selected for inferring the schema is located on the computer running the Spectrum Enterprise Designer or on the server. If you select a file on the local computer, the server name will be My Computer. If you select a file on the server the server name will be Spectrum Technology Platform. |
Schema file | Specifies the path to an XSD schema file. Click the ellipses button (...) to navigate to the file location. The schema file can reside on the server or your local system. Alternatively, you can also specify an XML file instead of an XSD file. If you specify an XML file the schema will be inferred based on the structure of the XML file. Using an XML file instead of an XSD file has the following limitations:
Note: If the Spectrum Technology Platform server is running on Linux, remember
that file names and paths on these platforms are case sensitive.
|
Output Fields | This section displays details of the selected schema. It includes the root element followed by the child elements along with their attributes. By default all the nodes of the schema remain selected. However, you can clear the
check-box of the nodes that you do not want to be passed to the next stage.
|
Field Parser Options for Delimited Data
Option Name | Description |
---|---|
Field separator | From the dropdown list, select the field separator used in the delimited column to be
parsed. If the delimited column uses a different character as a field separator, click the ellipses button to select another character as field separator. |
Text qualifier | From the dropdown list, select the text qualifier used in the delimited column to be parsed. Note: Text qualifiers are the character used to surround text values in a delimited data.
If the delimited column uses a different text qualifier, click the ellipses button to select another character as a text qualifier. |
Output type | Select if you want the parsed output in the form of a List (hierarchical display of values) or Fields. Note: For list as the output type, you can add only one output field, whereas the
Fields option allows you to add multiple fields in which you can
get the values segregated during parsing.
|
Output Fields | This section allows you to add/modify the various fields in which you want details of the delimited column to be segregated. You can also delete any of the added output fields. To add a new field for displaying the parsed output, click the Add
button, and perform these steps in the Field Setting pop-up that is
displayed:
The added field and its details are displayed in the box.
Note: If you want to have any
excess space characters removed from the beginning and end of a field's value string,
select the Trim check box. Modify: Click this button to modify details of any of the added output fields. Remove: Click this button to delete any of the added output fields. |
Runtime: Use this button to specify multiple runtime instances of parser. This results in significant performance improvement.
OK: Click this button to save all the details entered in this stage.
Cancel: Click this button to cancel all the updates you made.
Help: Click this button to refer to the help file for this stage.