Você está na página 1de 12

Collection and Use of

Data

Session9

Data Design & Usage


IIPM

Relations in Data
2

1: M (one to many) 1:1 (one to one) M:M (many to many)

Data Normalization
3

Normalization is a process for evaluating and correcting table structures to minimize data redundancy.

It is a process involves assigning attributes to tables based on the concept of relational database In this,
Each

table represents single subject No unnecessary storage All fields are dependent of Primary Key

Database Lifecycle
4

Initial data study Database design Implementation and loading Testing & Evaluation Operations Maintenance and Evolution

Database Languages
5

SQL (STRUCTURED QUERY LANGUAGE)


DDL

(data definition language): includes commands for creation of database objects, tables, views, index, etc. DML (data manipulation language): includes commands to insert, update, delete, retrieve data in the database tables

Relations between Data


6

And OR Not Between Is Null Like In Exists

Drill Down & Rollup


7

Drill Down: Using OLAP if we go into minute details of data then it is called drilling down. Here data shown in more atomic format. Ex: date wise and employee wise sales report Rollup: Aggregated data shown at higher level. Summary report is an example of this. Ex: total sales report of Ahmedabad region for this week

Data Warehouse
8

An integrated, subject-oriented, time-variant, nonvolatile collection of data that provides support for decision making - Bill Inmon
Integrated:

data warehouse is centralized, consolidated database that integrates data derived from the entire organization Subject-Oriented: data are arranged and optimized to provide answers to questions coming from diverse functional areas within a company

Data Warehouse
9

Time-Variant:

Data focus on current transactions, warehouse data represent the flow of data through time. Nonvolatile: Once data is entered into database it is never removed.

10

Rules to define Data Warehouse


The data warehouse and operational environment are separated The data warehouse data are integrated Data warehouse contains historical data over a long time horizon Data warehouse data are snapshot data captured at a given point in time Data warehouse data are subject oriented No online updates are allowed

11

Rules to define Data Warehouse

It is a data driven process It has several levels of details They are read only data Data warehouse traces data from business sources Data warehouse metadata is a critical component of it. It is required to identify schema (relations) between data It contains chargeback mechanism for resource usage.

Thanks
12

Vatsal Shah Consultant & Mentor Pragmatic Consultancy 99240 68585

Você também pode gostar