Write to DB
date
datatype as
String
values. This is the behavior of the jTDS
driver, which is the default driver used by Spectrum. To handle all
date
datatype values as is, use Microsoft's JDBC driver. - Connectivity to Hive from Spectrum on Windows
- Support and connectivity to Hive version 2.1.1 from Spectrum with high availability
- Support to Read and Write from Hive DB (JDBC) via Model Store connection
Also see Best Practices for connecting to HDFS 3.x and Hive 2.1.1.
Using the Write to DB Stage
- On the Spectrum Flow Designer Home page, click New.
- On the New Flow page, click Job, Service, or Subflow, as required and then click the corresponding blank canvas.
- Click OK.
- In the dialog box that appears, give a name to the Flow, Job, Service, or Subflow you are creating.
- Click OK.
- From the Palette Panel, drag the Write to DB stage to the canvas.
- Connect the required Sources and
Sinks to the stage and configure those. Note: For details on how to configure the Sources and the Sinks stages, see the respective sections.
- Click Write to DB and configure the stage as described in the following sections.
Configuring the General tab
Field | Description |
---|---|
Connection | From the drop down list, select the connection for the
database you want to use. The dropdown displays all
the connections you have configured. To configure a new connection or edit any
existing connections, click the Manage Connections link. It
takes you to the Connections page in the
Spectrum Management Console. For details on configuring new
connections, see Connections. Note: Click the Refresh icon to update any modification, deletion or addition
to your list of connections. |
Table/View | Specify the table to which you want to write the
data.
The fields of the selected table are displayed in the
Fields section on the right side of the page. These are
the details displayed:
|
Create Table: The Create Table button supports table
creation in these databases:
Note: For DB2 databases, if you try to create a table and
the page size is smaller than the total length of all string
columns, you will get an error that says "Failed to build
body from content. Serializable class not available to
broker." To create a new table in the
database:
|
Configuring the Runtime tab
Option Name |
Description |
---|---|
Write Mode |
Specify the action to take when writing to the database:
|
Batch commit |
Select this option to commit changes to the database after a specified number of records are processed. By default this option is not selected, which means that changes are committed after each record is processed. Selecting this option can significantly improve the performance of the Write to DB stage. |
Batch size |
If you enable the Batch commit option, this specifies the number of records to commit to the database in each batch. The default is 1,000. For dataflows created in Spectrum Technology Platform 7.0 and earlier, the default is 100. A larger batch size does not always offer better load performance. Consider these factors when choosing a batch size:
|
Commit at the end | Select this option to ensure that the commit to database operation occurs after all the records are transferred to the database. |
Batch count to commit | Specify a value after which the records are to be committed. Records are committed to the database after every (batch count to commit * batch size) number of records are transferred to the database. For example, if Batch size is set as 1000 and Batch count to commit is set as 3, then the commit occurs after every 3000 records are transferred to the database. |
Truncate table before inserting data |
Select this option if you want to clear all data from the table before writing to the database. |
Drop and recreate the table if it already exists |
Select this option to delete and recreate the table before writing the dataflow's output to the table. This option is useful if you want the table's schema to match the fields from the dataflow and not contain any extraneous schema information. The table that will be deleted and recreated is the one specified in the Table/View field on the General tab. For example, if you specify the Customers table in the Table/View field, and you select Drop and recreate the table if it already exists, then the Customers table will be deleted from the database, and a new table named Customers will be created with a schema that matches the actual fields written to the table. |