Você está na página 1de 20

SPICA

DATA SYSTEMS

Introduction
Informatica: An ETL Tool

SPICA
DATA SYSTEMS

Informatica

Informatica is industry standard enterprise level ETL tool: Extract Transform Load

SPICA
DATA SYSTEMS

Company Profile

Informatica Corporation is a NASDAQ listed company with ticker INFA. Founded in 1993, its headquarters is in Redwood City, California Sohaib Abbasi from Lahore Pakistan is the company's Chairman and CEO

SPICA
DATA SYSTEMS

Informatica Application

An open, scalable data integration solution addressing the complete life cycle for all data integration projects including data warehouses, data migration, data synchronization, and information hubs.
coordinates and drives a variety of core functions, including extracting, transforming, loading, and managing data

It can extract large volumes of data from multiple platforms, handle complex transformations on the data, and support high-speed loads. (ETL)

SPICA
DATA SYSTEMS

Informatica Components
Web Services Hub SAP BW Service Data Analyzer Metadata Manager PowerCenter Repository Reports

PowerCenter domain PowerCenter repository Administration Console PowerCenter Client Repository Service Integration Service

SPICA
DATA SYSTEMS

Components of Informatica

SPICA
DATA SYSTEMS

Informatica Architecture
Integration Service

Source

Target

Admin Consol Web Server

Repository

Database Server Repository Service

Client Tools (Designer/W orkflow)

SPICA
DATA SYSTEMS

PowerCenter Client

These client applications are used to manage the repository, design mappings, mapplets, and create sessions to load the data Designer create mappings that contain transformation instructions for the Integration Service. Data Stencil. create mapping template to generate multiple mappings. Repository Manager. create repository users and groups, assign privileges and permissions, and manage folders and locks Workflow Manager create, schedule, and run workflows. Workflow Monitor monitor scheduled and running workflows for each Integration Service.
8

SPICA
DATA SYSTEMS

Repository Service

The Repository Service manages connections to the PowerCenter repository from client applications.
a separate, multi-threaded process that retrieves, inserts, and updates metadata in the repository database tables. ensures the consistency of metadata in the repository.

It accepts connection requests from the following PowerCenter applications:


PowerCenter Client Command line programs Integration Service Web Services Hub SAP BW Service

SPICA
DATA SYSTEMS

Integration Service

The Integration Service reads workflow information from the repository. The Integration Service connects to the repository through the Repository Service to fetch metadata from the repository. The Integration Service runs workflow tasks.
It extracts data from the mapping sources and stores the data in memory It applies the transformation rules that you configure in the mapping to the data in memory. The Integration Service loads the transformed data into the mapping targets.
10

SPICA
DATA SYSTEMS

Source Definition

Source Type Extract Data definition (Data Requirements) Data Extract Method (Push, Pull) Data Extract Logic (New, Modified)

11

SPICA
DATA SYSTEMS

Target Definition

Target Type Target data definition Target Load Logic (Constraint based/Regions/Time Window) Target load method (Bulk/Normal)

12

SPICA
DATA SYSTEMS

Transformation Definition

Transformation Logic Input/Output Data Definition Other variables

13

SPICA
DATA SYSTEMS

Start Web Server (For Admin Console)

14

SPICA
DATA SYSTEMS

Start Informatica Services

15

SPICA
DATA SYSTEMS

Designer Example Mapping

Source Target

16

Transformation

SPICA
DATA SYSTEMS

Debugger Run

Output Window

Target Instance

Instance Window

17

SPICA
DATA SYSTEMS

Workflow Manager

18

SPICA
DATA SYSTEMS

Workflow Monitor

19

SPICA
DATA SYSTEMS

Repository Manager

20

Você também pode gostar