Components of the SDK Java API

The key components to use a Big Data Quality SDK job using the Java API are:

JAR Files

Hadoop JAR files.

The JAR files of the module to which the desired Big Data Quality SDK job belongs, as indicated in the table:

Module	Job	JAR File
Advanced Matching Module	All AMM jobs	amm.core-12.0.jar
Data Normalization Module	All DNM jobs	dnm.core-12.0.jar
Universal Addressing Module	Validate Address	uam-universaladdress.core-12.0.jar
Universal Addressing Module	Validate Address Global	uam-global.core-12.0.jar
Universal Addressing Module	Validate Address Loqate	uam-loqate.core-12.0.jar
Universal Name Module	All UNM jobs	unm.core-12.0.jar

Configuration Files

Files in XML format containing all parameters and values required to run a job, including match rules, input file details, output file details, MapReduce or Spark configuration details, and the like.

Sample configuration XML files are placed at the location <Big Data Quality bundle>\samples\configuration.

Client Java Application

Java application to use the API to create and run the required Big Data Quality SDK job provided by its Java API.

Hadoop Platform

The created job accesses the configured Hadoop platform to access input data and dump the output data in a file.