Input Parameters

Parameter Description
Input File For text files:
File Path
The path of the input text file.
Record Separator
The record separator used in the input file.
Header Row Fields
An array of the header fields of the input file.
Skip First Row
Flag to indicate if the first row must be skipped while reading the input file records.

This must be true in case the first row is a header row.

Note: Invoke the appropriate constructor of FilePath.
For ORC format files:
ORC File Path
The path of the input ORC format file on the Hadoop platform.
For Parquet format files:
Parquet File Path
The path of the input Parquet format file on the Hadoop platform.
Index Name The name of the search index.
Complex Search Query The type of search query selected.
Note: Only Complex Search Query is supported.
Index Output Fields The field names for the search index output.
Output File For text files:
File Path
The path of the output text file on the Hadoop platform.
Note: Invoke the appropriate constructor of FilePath.
Record Separator
The record separator used in the output file.
For ORC format files:
ORC File Path
The path of the output ORC format file on the Hadoop platform.
For Parquet format files:
Parquet File Path
The path of the output Parquet format file on the Hadoop platform.
Job Configurations The Hadoop configurations for the job.

For a MapReduce job, the instance must be of type MRJobConfig. For a Spark job, the instance must be of type SparkJobConfig.

Job Name The name of the job.
Fetch Batch Size The size of batches in which you want the results to be processed. Default is 10000.
Maximum Results The maximum number of responses you want the index search to return. The default is 10.
Starting Record The record number on which search results should begin. The default is 1.