Installing SDK on Linux

To install the Big Data Quality SDK using command line on a Linux machine, follow the steps below:

  1. Download the Big Data Quality SDK using the download instructions contained in your welcome email or the release announcement email.
  2. Extract all files from the archive to a location on the server where you want to install the Big Data Quality SDK.
  3. Change the directory to the location.
  4. Ensure you have execute permission on the files by typing the following command:
    chmod a+x sdkinst.bin
  5. Run this command:
    ./sdkinst.bin

    Follow the prompts on the command prompt.

  6. When prompted, provide the directory where you want to install the SDK.
    For example, /home/hadoop/BDQ_InstallPath.
    Attention: If you select a non-default folder as the installation directory, ensure that the length of the absolute installation path does not exceed 34 characters.
    The default installation path with 27 characters is admissible:
    /root/PBSpectrum_BigDataSDK 

    A pre-installation summary is displayed.

  7. Review the summary and press ENTER to continue with the installation.
  8. See the installation log file to verify that the Big Data Quality SDK has been installed correctly.
  9. When you are done, press ENTER to finish and exit the installer.

Once you have successfully installed the SDK on your machine, these folders are added in the install directory:

  • API
  • Documentation
  • modules
  • samples
  • utilities
Note: To use the jobs of Data Normalization Module, Universal Name Module or Universal Addressing Module, you must install the respective Reference Data for each module.