Spectrum Data Integration Guide

Content
Search Results
Loading, please wait ...

Loading

  • Welcome
  • Introduction
    • Enterprise Data Management Architecture
    • The Star Schema Data Warehouse Design
      • Advantages of a Star Schema
  • Connecting to Data Sources and Data Warehouses
    • Connections
    • Defining Connections
      • Connecting to Amazon
        • Connecting to Amazon DynamoDB
          • Amazon DynamoDB Limitations
        • Connecting to Amazon S3
          • Using Amazon S3 Cloud Encryption
        • Connecting to Amazon SimpleDB
          • Amazon SimpleDB Limitations
      • Connecting to Apache Cassandra
        • Apache Cassandra Limitation
      • Connecting to Azure Cloud
      • Connecting to Context Graph
        • Context Graph connector limitation
      • Connecting to a Flat File
        • Connecting to a Delimited Flat File
        • Connecting to a Fixed-Width Flat File
        • Date Time Formats in a File Connection
      • Connecting to an FTP Server
      • Connecting to an SFTP Server
      • Connecting to Google Cloud Storage
      • Connecting to Hadoop
        • Compression Support for Hadoop
      • Connecting to Hive
      • Connecting to a JDBC Database
        • Importing a JDBC Driver
        • Manually Adding a JDBC Driver
        • Deleting an Imported JDBC Driver
        • Supported Database Data Types
        • Supported Database Data Types for Spectrum Spatial
        • JDBC Database connector limitations
      • Connecting to Knox
      • Connecting to a Windows Mapped Drive
      • Connecting to Marketo
        • Marketo Limitations
        • Supported Entities and Operations
      • Connecting to Microsoft Dynamics 365
        • Connecting to Microsoft Dynamics 365 Online
        • Connecting to Microsoft Dynamics 365 On Premise
        • Limitations
        • Supported Entities and Operations
      • Connecting to a Model Store
      • Connecting to NetSuite
        • NetSuite Limitations
        • Supported Entities and Operations
      • Connecting to NoSQL
      • Connecting to Salesforce
        • Salesforce Limitation
      • Connecting to SAP NetWeaver
        • SAP NetWeaver Limitations
        • Supported Entities and Operations
      • Connecting to SharePoint
      • Connecting to Splunk
        • Splunk Limitations
        • Supported Entities and Operations
      • Connecting to SugarCRM
        • SugarCRM Limitations
        • Supported Entities and Operations
      • Connecting to Oracle Eloqua
        • Special Operations
        • Limitations
        • Supported Entities and Operations
    • Compression Support for Cloud File Servers
    • Deleting a Connection
  • Populating the Data Warehouse
    • Preparing Your Data
    • Populating a Time Dimension Table
    • Populating a Dimension Table
    • Populating a Fact Table
    • Adding a Time Stamp to Records in a Data Warehouse
  • Updating the Data Warehouse
    • Defining a Data Warehouse Update Schedule
    • Updating a Fact Table
    • Using a Global Cache for Queries
      • Deleting a Cache
    • Using a Local Cache for Queries
  • Accessing Stages Through Enterprise Designer
    • Call Stored Procedure
    • DB Change Data Reader
      • Adding a CDC Resource
      • Editing a CDC Resource
      • Deleting a CDC Resource
      • Selecting Change Data Reader Options
    • DB Loader
      • Oracle Loader
      • DB2 Loader
      • PostgreSQL Loader
      • Teradata Loader
    • Field Parser
    • Field Combiner
    • Field Selector
    • Generate Time Dimension
      • Options
        • Creating a Calendar
      • Output
    • Query Cache
    • Query DB
      • Parameterizing Query DB at Runtime
    • Query NoSQL DB
      • Defining Fields - Query NoSQL DB
      • Configuring Dataflow Options - Query NoSQL DB
    • Read From DB
      • Visual Query Builder
        • Adding Objects to a Query
        • Setting Object Aliases
        • Joining Tables
        • Selecting Output Fields
        • Sorting a Dataset
        • Defining Criteria
        • Grouping Output Fields
        • Defining SQL Query Properties
      • Query Variables
        • Inserting a Query Variable
        • Configuring a Query Variable as a Dataflow Option
        • Configuring a Query Variable for Job Executor
    • Read From File
      • Defining Fields In a Delimited Input File
      • Defining Fields In a Line Sequential or Fixed Width File
      • Sorting Input Records
      • The File Definition Settings File
      • Configuring Dataflow Options
    • Read from Hadoop Sequence File
      • Defining Fields In an Input Sequence File
      • Sorting Input Records
      • Filtering Input Records
    • Read From Hive File
      • Defining Fields for Reading from Hive File
    • Read from HL7 File
      • Flattening HL7 Data
      • Adding a Custom HL7 Message
    • Read from NoSQL DB
      • Defining Fields in a NoSQL Database
      • NoSQL DB Dataflow Options
    • Read from SAP
      • Connecting to SAP
      • Reading Data from a Single SAP Table
      • Reading Data from Multiple SAP Tables
      • Filtering Records in Read from SAP
    • Read from Spreadsheet
    • Read from Variable Format File
      • Defining Fields in Delimited Variable Format Files
      • Defining Fields in a Line Sequential or Fixed Width Variable Format File
      • Flattening Variable Format Data
    • Read From XML
      • Flattening Complex XML Elements
    • SQL Command
      • Specifying SQL Command at Runtime
      • Running A Job from the Command Line
      • Executing SQL Commands Before or After a Dataflow
    • Transposer
    • Unique ID Generator
      • Defining a Unique ID
        • Unique ID Definition Methods
      • Using Algorithms to Augment a Unique ID
      • Defining a Non-Unique ID
    • Write to Cache
      • Clearing a Global Cache
    • Write to DB
      • Database Connection Manager
      • Configuring Error Handling in Write to DB
    • Write to File
      • Defining Fields In a Delimited Output File
      • Defining Fields In a Line Sequential or Fixed Width File
      • Sorting Output Records
      • The File Definition Settings File
      • Configuring Dataflow Options
    • Write to Hadoop Sequence File
      • Defining Fields In an Output Sequence File
    • Write to Hive File
      • Defining Fields for Writing to Hive File
    • Write to NoSQL DB
      • Defining Fields in a NoSQL Database
      • NoSQL DB Dataflow Options
    • Write to Spreadsheet
      • Defining fields in an Output file
    • Write to Variable Format File
      • Writing Flat Data to a Variable Format File
      • Tag Names in Variable Format Files
    • Write to XML
      • Using Namespaces in an XML Output File
      • Creating Complex XML from Flat Data
    • Date and Number Patterns
      • Date and time patterns
      • Number Patterns
  • Accessing Stages through Flow Designer
    • DB Loader
      • Oracle Loader
      • DB2 Loader
      • PostgreSQL Loader
      • Teradata Loader
    • Field Selector
    • Generate Time Dimension
      • Configuring options
        • Creating a Calendar
      • Output
    • Read from DB
      • Visual Query Builder
        • Adding Objects to a Query
        • Joining Tables
        • Selecting Output Fields
        • Sorting a Dataset
        • Defining Criteria
        • Grouping Output Fields
        • Building a Query with SubQueries
          • Working With SubQueries
    • Read From File
      • Configure the Read from File Stage
      • Configure the File tab details - Read from File
      • Configure the Sort tab details - Read from File
      • Configure the Runtime tab details - Read from File
    • Read from Spreadsheet
    • Read From XML
      • Flattening Complex XML Elements
    • Write to DB
      • Configuring Error Handling in Write to DB
    • Write to File
      • Configuring the Output/Write to File stage
        • Configure the File tab details - Write to File
        • Configure the Sort tab details - Write to File
        • Configure the Runtime tab details - Write to File
    • Write to Spreadsheet
    • Write to XML
    • Query DB
    • Record Joiner
    • SQL Command
    • Unique ID Generator
      • Defining a Unique ID
      • Using Algorithms to Augment a Unique ID
      • Defining a Non-Unique ID
  • Configurations
    • Oracle LogMiner Configurations
    • Configuring HDFS Connection for HA Cluster
  • Optimizing Performance
    • Determining an Optimimum Fetch Size
    • Best Practices for connecting to HDFS 3.x and Hive 2.1.1