Configure Data Integration components

Replication Agent

An agent provides the communication between your environment, where your data is located, and the Precisely Cloud. This requires the agent to have access to your Precisely Cloud account and the agent will have the same user roles as the user you logged in as. If an agent has not been configured, click Install Replication Agent to download the latest available agent.

Runtime Engine

The Runtime Engine is the main component responsible for actually performing replication of the data from the source to the target. When working with data replication, you can add or configure engine. After you add an engine, it can be used to configure data connections. The current status of runtime servers used in replication can also be monitored.

Replication Data Connections

Data Integration gives you the tools to configure and manage the connections used to access, extract, filter, and transfer your data in bulk and process change data capture replications. Add at least one data connection for each type of data source and target database, or Kafka streaming service you plan to connect to.

Metabase

Create a Metabase for Oracle for JDBC data connection are repositories of database tables and objects that define, enable, and manage data distribution replication projects. Metabases are tied to a project and each project has a unique metabase. Metabases contain replication backlogs and metadata about what tables are enabled for capture on the source. Db2 for IBM i, Db2 for LUW, Oracle, and data connections used for a replication project must be associated with a metabase. You create metabases in Data Integration to be saved on the database server accessed by the JDBC data connection. Metabases are stored in the database on the source system. Each metabase for Db2 for IBM i connection is assigned a journal and that journal is associated with a log reader. Each time replication project changes are applied, information is written to the metabase.

Replication Pipeline

A Manage Replication Pipeline is a collection of mappings from source to target. Every project contains one or more replication pipeline. You configure replication pipeline properties that, when the replication pipeline is started, move source data to a target database in bulk or run data capture and replication processes.

How data flows work depends on the type of project you are working with:

Replication projects. Replication is managed at the replication pipeline level. Configure replication pipeline to enable or disable data capture on a per database table or data connection basis, start replication, and stop replication for the associated data. You then stage and apply changes for the project configuration into your replication environment.