Escolar Documentos
Profissional Documentos
Cultura Documentos
Chapter Outline
3.1 Data, Master Data, and Data Management
3.2 File Management Systems
3.3 Database Management Systems
3.4 Data Warehouses, Data Marts, and Data
Centers
3.5 Enterprise Content Management
3.6 Managerial Issues
Learning Objectives
1. Describe how data and document management
impact profits and performance.
2. Understand how managers are supported or
constrained by data quality.
3. Discuss the functions of databases and
database management systems.
4. Understand how logical views of data provide a
customized support and improve data security.
5. Describe the tactical and strategic benefits of
data warehouses, data marts, and data
centers.
3
IT-Performance Model
Business uncertainty
Problem
Another problem is that it is difficult to bring together
huge quantities of data located in different databases in
a way that creates value.
Without efficient processes for managing vast amounts
of customers data and turning data into usable
knowledge, companies can miss critical opportunities to
find insights hidden in the data.
IT Solution
Results
Management can collect and analyze detailed data in
near real-time using EDW.
Regional managers can select the best menu offerings
and operate more efficiently.
From customer satisfaction surveys, be able to
identify regional preferences, predict product demand
and build financial models.
Improved customers experience, satisfaction, and
loyalty
Increase earnings
Total sales increased by 3.9% over prior year and
opened 16 new restaurants.
11
Results: Improved
profitability.
13
14
15
Data Management
Basic rule is that, to maximize earnings, companies
invest in data management technologies that increase
The opportunity to earn revenues. (Customer
relationship management)
The ability to cut expenses (Inventory
management)
To improve business processes and performance,
managers and decision makers need rapid access to
data.
16
Data Management
Data management is about the design of data
infrastructures to provide employees with
complete, timely, accurate, accessible,
understandable, and relevant (
) data.
By definition, Data management is a structured
approach for Capturing, Storing, Processing,
Integrating, Distributing, Securing, and Archiving
data EFFECTIVELY throughout their life cycle.
17
18
20
21
24
Data Visualization
To format data into meaningful
contexts for users, businesses
employ data visualization and
decision support tools.
Present data in ways that are
faster and easier for users for
users to understand.
Data visualization tools are less
expensive and easier to
manipulate.
Read A Closer Look 3.2. The
Royal Shakespeare.
26
27
28
29
30
33
34
36
Document Management
Business records contracts, research, accounting
documents, memos, customer/client communications
and meeting minutes.
Document Management is the automated control of
imaged and electronic documents, spreadsheets, voice
and email, word processing documents from INITIAL
creation to FINAL archiving or destruction.
Document management system (DMS) consist of HW,
SW that manage and archive E-documents and convert
paper document to E-documents and then index and
store them according to policy. Eg. Keep emails for 7
years, Promotions for 1 year and then discards.
38
DMS
DMS have query and search capabilities so they
can be identified and accessed like data in a
database.
Help reduce costs paperless office
DMS help business become more efficient and
productive by
Enabling the company to access and use the content
contained in the documents
Cutting labor cost by automating business processes
Reducing time and effort required to locate information
needed to support decision making
Improving the security of the content, reducing risk of
intellectual property theft
Minimizing the costs associated with printing, storage,
39
and searching for content
DMS
Major Document Management tools are
workflow software (), authoring
tools (
), scanners, databases.
When workflow are digital, productivity
increases, costs decrease, green computing
become possible. (Conserve our valuable
natural resources by reducing the effects of
our computer usage on the environment)
40
41
Figure 3.5
Example of primary and foreign keys.
42
Figure 3.6
Hierarchy of data for a computer-based file.
43
Figure 3.7
Indexed sequential access method
45
Limitations
Data Redundancy Different programmers create
different data-manipulating applications, the same data
could be duplicated in several files.
Data inconsistency Actual data values are not
synchronized across various copies of data. For
example, Customers with serveral loans, for each loan
there is a file containing customer fields (name, address,
email, phone), then a change to a customerss address
in only one file creates inconsistencies.
Data isolation File organization creates silos of data
that make it extremely difficult to access data from
different applications. For example, wants to know which
product customers bought and which customers own
more than 1000, To get results, have to filter and
integrate data manually from multiple files.
46
Limitations
Data security Securing data is difficult in the file
environment because new applications are added to the
system. As the number of applications increases, so
does the number of people who can access data
Lack of data integrity In the file environment it is harder
to enforce data integrity rules (),
which include preventing data input errors, eg. SSN
Data concurrency At the same time, one is updating
record, another may be accessing that record cant
get the most current update. To prevent a concurrency
problem, applications and data need to be independent
of one another. In file environment, they are dependent.
47
Figure 3.8
49
50
Centralized Databases
Stores all related files in one location More
consistent
Files are not accessible except via the
centralized host computer, where they can be
protected more easily from unauthorized access
or modification.
Vulnerable () to a single point of failure.
Computer fails, all users are affected.
When users are widely dispersed and must
perform data manipulations from distances, they
often experience transmission delays.
51
Distributed Databases
A replicated database store complete copies of the entire
database in multiple location. This arrangement provides
backup in case of a failure of problems with the
centralized database.
Improve the response time (local users)
Much more expensive to set up and maintain because
each replica must be updated as records are added to,
modified in and deleted from any of the databases.
The updates may be done at the end of the day,
otherwise the various databases will contain conflicting
data.
52
Distributed Databases
A partitioned database is divided up so that each
location has a portion of the entire database
usually the portion that meets users local needs.
Provide response speed of localized files
without the need to replicate all changes in
multiple locations.
Advantage: data can be entered more quickly
and kept more accurate by the users
immediately responsible for data
53
DBMS
A program that provide access to
databases
DBMS permits an organization to
centralize data, manage them efficiently
and provide access to the stored data.
Range from simple Microsoft Access to
full-featured Oracle and DB2
54
55
56
Data Warehouses
DW is a repository () (a type of
database) in which data are organized so that
they can be readily analyzed using methods
such as data mining, decision support, querying
and other applications.
Examples are revenue management, CRM,
Fraud detection, payroll-management
applications
Databases are designed and optimized to store
data whereas data warehouses are designed
and optimized to respond to analysis questions
that are critical for a business.
58
Data Warehouses
Databases are online transaction processing (OLTP)
systems in which every transaction is recorded quickly.
For example, withdrawns from a bank ATM, these
transactions must be recorded and processed as they
occur in real-time. Databases systems for banking are
designed to ensure that every transaction get recorded
immediately.
Database are volatile () because data
are constantly added, edited or updated.
The volatility caused by transaction processes makes
data analysis too difficult.
To overcome this, data are Extracted from designated
databases, Transformed and Loaded into a data
warehouse.
These data are read-only data. They remain the same
until the next scheduled ETL.
Warehouse data are not volatile so data warehouse are
designed as online analytical processing (OLAP) system 59
61
Example :: Charles
He is calling the customer service center because of
frequent dropped cell calls.
Through the call center application (attached to the
active data warehouse), the CSR accesses not only the
complete history of Charless calls to the company but
also a full view of all services to which he subscribes
DSL, Internet, Cellular along with his customer
profitability score, which lets the CSR know how
profitable he is to the company.
Called Intelligent Customer Service.
The company try to minimize the risk of losing him as a
customer.
62
63
67
11
69
Suitability
Data WH is appropriate for organizations that have some of
following characteristics
End users need to access large amount of data
Operational data are stored in different systems
Organization employ an information-based approach to
management.
Organization serves a large, diverse customer base.
Same data are represented differently in different
systems
Data are store in highly technical formats that are difficult
to decipher.
Extensive end-user computing is performed (many end
users performing many activities)
70
(Data Warehouse)
( )
(Data Base)
( )
(data marts)
71
schema
(schema
primary
key, foreign key )
schema
.
72
schema
()
primary key
( )
(query)
73
74
Data marts
Data warehouse Too expensive for a company
to implement
As an alternative, many firms create a lowercost, scaled down version of a data warehouse
called a data mart.
Data marts are designed for a strategic business
unit, or a single department
Allow for local rather than central control.
Contain less information than Data WH
Respond more quickly and easier to understand
75
Multidimensional Databases
From tables to cubes
A data WH can be based on a
multidimensional model called a
multidimensional database that views data
in the form of a data cube.
76
Example .
nuts, screws,
bolts Washers
(East), (West) (Central)
2001, 2002 2003
(Relational database)
(Product, Region Sales)
Screw
77
78
80
81
82
Data Center
is the name given to the newer facilities
containing mission-critical ISs and components
that deliver data and IT services to the enterprise.
Integrate networks, computer systems and
storage devices.
Insure the availability of power and provide
physical and data security.
Newest data centers include temperature and fire
controls, physical and digital security, redundant
power supplies as uninterruptible power sources
(UPS), redundant data communications
connections.
83
4 key forces
4 key forces are driving organizations to adopt a strategic,
enterprise-level approach to planning and deploying content
systems ( Content Systems).
Compounding growth of content generated by
organizations
The need to integrate that content within business
processes
The need to support increasing sophistication (
) for business user content access and interaction.
The need to maintain governance () and
control over content to ensure regulatory compliance and
preparedness for legal discovery (
).
85
Discovery or
the request for information
Nearly 90 percent of US corporations become engaged
in lawsuits
The average $1 billion company in US faces 147
lawsuits
Each lawsuit will involve discovery or the request for
information (almost always involves the request for email
and other electronic communications).
86
Discovery
Discovery is the process of gathering information in
preparation for trial, legal or regulatory investigations, or
administrative action as required by law.
Electronic information is involved; the process is called
E-Discovery.
Serveral cases where a company incurred huge costs for
not responding to e-discovery are followings:
Failure to save emails results in a $2.75 million fine for Phillip
Morris
Failure to respond to e-discovery requests cost Bank of America
$10 million
Failure to produce backup tapes and deleted emails results in a
$29.3 million jury verdict against USB Warburg.
87
Managerial Issues
Reducing uncertainty Requires a data infrastructure
that can capture, process and report information in near
real-time.
Cost-benefit issues and justification Some solutions
are expensive and justifiable only in large corporations.
Smaller ones can make solutions cost effective if they
make use of existing databases rather than creating new
ones.
Where to store data physically Should data be
distributed close to their users? This arrangement could
speed up data entry and updating, but it could also
generate replication and security risks. Should data be
centralized for easier control, security and disaster
recovery? This offers fewer communications and singlepoint-of failure risks.
88
Managerial Issues
Legal issues Failure to manage electronic records
exposes companies to fines from the courts and regulatory
agencies such as IRS.
Internal or External Should a firm invest internally or
externally?
Disaster recovery Can an organizations business
processes (dependent on databases and data WH)
recover after an information system disaster?
Data security and ethics Are the companys customer
and other competitive data safe from snooping (
) and sabotage (
)? Are confidential
data, such as personnel details safe from improper or
illegal access?
89
Managerial Issues
Privacy Storing data in a WH, and conducting data
mining may result in the invasion () of individual
privacy. What will the company do to protect individuals?
The legacy data problem Data in older, perhaps
obsolete, databases still need to be available to newer
database management systems. Many of legacy
application programs used to access the older data
cannot be converted into new computing environment
without expense. Two approaches to solve this problem
Create a database front end that can act as a translator from the
old system to the new.
Integrate the older applications into the new system so that data
can be seamlessly accessed in the original format.
90
Managerial Issues
Data Delivery Moving data around an enterprise
efficiently is often a major problem.
91
Quiz
What is the goal of data management?
Explain how having detailed real-time
or near real-time data can improve
productivity and decision quality.
How are organizations using their data
warehouses to improve consumer
satisfaction and the companys
profitability?
92