Escolar Documentos
Profissional Documentos
Cultura Documentos
DATA SYSTEMS
Introduction
Informatica: An ETL Tool
SPICA
DATA SYSTEMS
Informatica
Informatica is industry standard enterprise level ETL tool: Extract Transform Load
SPICA
DATA SYSTEMS
Company Profile
Informatica Corporation is a NASDAQ listed company with ticker INFA. Founded in 1993, its headquarters is in Redwood City, California Sohaib Abbasi from Lahore Pakistan is the company's Chairman and CEO
SPICA
DATA SYSTEMS
Informatica Application
An open, scalable data integration solution addressing the complete life cycle for all data integration projects including data warehouses, data migration, data synchronization, and information hubs.
coordinates and drives a variety of core functions, including extracting, transforming, loading, and managing data
It can extract large volumes of data from multiple platforms, handle complex transformations on the data, and support high-speed loads. (ETL)
SPICA
DATA SYSTEMS
Informatica Components
Web Services Hub SAP BW Service Data Analyzer Metadata Manager PowerCenter Repository Reports
PowerCenter domain PowerCenter repository Administration Console PowerCenter Client Repository Service Integration Service
SPICA
DATA SYSTEMS
Components of Informatica
SPICA
DATA SYSTEMS
Informatica Architecture
Integration Service
Source
Target
Repository
SPICA
DATA SYSTEMS
PowerCenter Client
These client applications are used to manage the repository, design mappings, mapplets, and create sessions to load the data Designer create mappings that contain transformation instructions for the Integration Service. Data Stencil. create mapping template to generate multiple mappings. Repository Manager. create repository users and groups, assign privileges and permissions, and manage folders and locks Workflow Manager create, schedule, and run workflows. Workflow Monitor monitor scheduled and running workflows for each Integration Service.
8
SPICA
DATA SYSTEMS
Repository Service
The Repository Service manages connections to the PowerCenter repository from client applications.
a separate, multi-threaded process that retrieves, inserts, and updates metadata in the repository database tables. ensures the consistency of metadata in the repository.
SPICA
DATA SYSTEMS
Integration Service
The Integration Service reads workflow information from the repository. The Integration Service connects to the repository through the Repository Service to fetch metadata from the repository. The Integration Service runs workflow tasks.
It extracts data from the mapping sources and stores the data in memory It applies the transformation rules that you configure in the mapping to the data in memory. The Integration Service loads the transformed data into the mapping targets.
10
SPICA
DATA SYSTEMS
Source Definition
Source Type Extract Data definition (Data Requirements) Data Extract Method (Push, Pull) Data Extract Logic (New, Modified)
11
SPICA
DATA SYSTEMS
Target Definition
Target Type Target data definition Target Load Logic (Constraint based/Regions/Time Window) Target load method (Bulk/Normal)
12
SPICA
DATA SYSTEMS
Transformation Definition
13
SPICA
DATA SYSTEMS
14
SPICA
DATA SYSTEMS
15
SPICA
DATA SYSTEMS
Source Target
16
Transformation
SPICA
DATA SYSTEMS
Debugger Run
Output Window
Target Instance
Instance Window
17
SPICA
DATA SYSTEMS
Workflow Manager
18
SPICA
DATA SYSTEMS
Workflow Monitor
19
SPICA
DATA SYSTEMS
Repository Manager
20