Components of the SDK Java API
The key components to use a BDQ SDK job using the Java API are:
- JAR Files
-
- Hadoop JAR files.
- The JAR files of the module to which the desired BDQ SDK job belongs, as indicated
in the table:
Module Job JAR File Advanced Matching Module All AMM jobs amm.core-<JarVersion>.jar Data Integration Module All DIM jobs dim.core-<JarVersion>.jar Data Normalization Module All DNM jobs dnm.core--<JarVersion>.jar Global Addressing Module Global Address Validation gam-global addressvalidation.core--<JarVersion>.jar Universal Addressing Module Validate Address uam-universaladdress.core--<JarVersion>.jar Universal Addressing Module Validate Address Global uam-global.core--<JarVersion>.jar Universal Addressing Module Validate Address Loqate uam-loqate.core--<JarVersion>.jar Universal Name Module All UNM jobs unm.core--<JarVersion>.jar
- Configuration Files
- Files in XML format containing all parameters and values required to run a job,
including match rules, input file details, output file details, and MapReduce or Spark
configuration details.
Sample configuration XML files are placed at the location <Big Data Quality bundle>\samples\configuration.
- Client Java Application
- Java application to use the API to create and run the required BDQ SDK job provided by its Java API.
- Hadoop Platform
- The created job accesses the configured Hadoop platform to access input data and dump the output data in a file.