Escolar Documentos
Profissional Documentos
Cultura Documentos
DATA: The term data referred to known as facts that could be recorded and stored on computer
media.
Data consists of facts, text, graphics, images, sound, and video segments that have
meaning in the user environment.
Cust name
Facts Address
Phone no
Text Documents
Numeric Digits
images Photos
INFORMATION : Data that have been processed in such a way as to increase the
knowledge of the person who use the data
EX:
Mon 4th SQL programming
Tue 1st SQL programming
Course id and
Course Alphanumeric 30
name
Here data type, length, minimum and maximum values are consider as metadata
It means metadata describes the properties of data but do not include that data.
1. PROGRAM DATA DEPENDENCE: File descriptions are stored with in each application
program that accesses a given file. If any change to file structure requires changes to the file
description for all programs that access the file.
For ex suppose it is decided to change the customer address field length in the records from 30 to
40 characters the file descriptions in each program it is effected would have to be
modified. It is often difficult even to locate all programs effected by such changes
3. LIMITED DATA SHARING: In the traditional file processing system each application has
its own private files and users have little opportunity to share data outside their own application
for ex users in the accounting dept have access to the invoicing system and its files but they do
not have access the Orderfilling system or the pay roll system. It is very difficult to find a
requested report from several incompatible files in separate systems.
4.LENGTHY DEVELEPMENT TIMES: In the traditional file processing system there is little
opportunity to use previous development efforts. Each new application requires new file formats
and descriptions. So writing the file access logic for each new program required lengthy
development times.
Disadvantages: The data cannot be easily shared with other users. These are limited for very
small organizations
All the workgroup members are linked by a local area network (LAN). The database is
stored on a central device called the database server is also connected to the network. Different
types of group members (developer or project manager) may have different user views of the
shared database
All ERP systems are heavily dependent on databases to store the data required by the ERP
applications.
DATA WAREHOUSE: An integrated decision support database whose content is derived from
the various operational databases.
Data warehouses collect their content from various operational databases including personal,
workgroup and department databases. Data warehouses provide user to work with
historical data.
INTERNET: It is a global network of public computers connects users of multiple plat forms
Telephone wire, cable and satellite connect millions of computers around the world to
each other.
WEB BROWSER: A worldwide network that connects users of multiple platforms easily
through an interface known as a web browser
INTRANET: Use of Internet protocols to establish access to company data and information that
is limited to the organization.
EXTRANET: use of Internet protocols to establish limited access to company data and
information by the company’s customers and suppliers.
The separation of data descriptions (metadata) from the application programs that use the data is
called data independence. With the database approach data descriptions are stored in a
central location called the repository. This property of the database system allows an
organization’s data to change with out changing the application programs that process the
data
USER VIEW
A logical description of some portion of the database that is required by a user to perform some task.
A major advantage of the data base approach is that it greatly reduce the cost and time for
developing new business applications.
Assuming that data database and the related data capture and maintenance applications have
already been designed and implemented the programmer can concentrate on the specific
functions required for new applications.
ENFORCEMENT OF STANDARDS
When the database approach is implemented with full management the database
administration function should be granted single point authority and responsibility for
establishing and enforcing standards. These standards will include naming conventions, data
quality standards, uniform procedures for accessing, updating, and protecting data.
Select * from product where product_name= ”computer desk” is SQL command to display the
information about computer desks
Data base approach need to hire and train individuals to design and implement databases,
provide database administration services and manage a staff of new people. Because of the rapid
changes in technology these new people will have to be retrained or upgraded on a regular basis.
A Multiuser database management system is a large and complex suite of software that has high
initial costs, requires a staff of trained personnel to install and operate. Installing such a system
may also require upgrades to the hardware and data communication systems in the organization
Substantial training is normally required on an ongoing basis to keep up with new releases and
upgrades.
CONVERSION COSTS:
The term legacy system is widely used to refer to older applications in an organization
that ate based on file processing system. The cost of converting these older systems to modern
database technology measured in terms of dollars, time and organizational commitment.
ORGANAIZATIONAL CONFLICT:
A shared database requires a consensus on data definitions and ownership as well as
responsibilities for accurate data maintenance. Experience has shown that conflicts on data
definitions, data formats and coding, rights to update shared data. Handling these issues requires
organizational commitment and organizationally astute database administrators
5. APPLICATION PROGRAM
Computer programs that are used to create and maintain the database and provide
information to users
6. USER INTERFACE
Languages, menus, and other facilities by which users interact with various system
components, such as CASE tools, application programs, the DBMS and the repository
7. DATA ADMINISTRATORS
Persons who are responsible for the overall information resources of an organization.
Data administrators use CASE tools for system requirements analysis and program design
8. SYSTEM DEVELOPERS:
Persons such as systems analysts and programmers who design new application
programs. System developers often use CASE tools for system requirements analysis and
program design
9. END USERS
Persons throughout the organization who add, delete, and modify data in the database and
who request or receive information from it. All user interactions with the database must
be routed through the DBMS.
THE DATA BASE DEVELEPMENT PROCESS
ENTERPRISE DATA MODELING: The first step in database development, in which the
scope and general contents of the organizational databases are specified
Fig 2.1 page no 37
INFORMATION ENGENEERING
It is a data oriented methodology to create and maintain information systems. Because of
the data orientation information engineering can be helpful how databases are identified and
defined. Information engineering follows top down planning
For ex the problem area of in accurate sales forecast might cause information system managers
to place additional historical sales data, new market research data or data concerning results from
test trails of new products in organizational databases
1.Organizational units
The various departments of the organization
Ex: sales, orders, accounting, manufacturing
2.Organizational locations
The places where business operations occur
Ex: corporate head quarters, Durango plant, western regional sales office, Lumber mill
3. Business functions
A related group Business process that support some aspect of the mission of an
enterprise
Ex: Business planning, product development materials management Marketing and sales
4. Entity types
Major categories of data about the people, places and things managed by the organization
5. Information systems: The application software and supporting procedures for handling
sets of data
An enterprise data model shows not only the entity types but also the relation ships
between data entities. A common format for showing the interrelation ship between planning
objects is matrixes
UNIT TO FUNCTION: identifies which business functions are performed by or are the
responsibility of which business units
SUPPORTING FUNCTIONS TO DATA ENTITY: Identifies which data are captured, used,
updated, or deleted with in each function
A traditional process for conducting an information systems development project is called the
systems development life cycle. The SDLC is a complete set of steps that a team of
information systems professionals including database designers and programmers. They
are used specify, develop, maintain, and replace information systems.
PHYSICAL SCHEMA:
A physical schema contains the specifications for how data from a conceptual schema are
stored in a computer’s secondary memory
1. CLIENT TIER: A desktop or laptop computer, which concentrates on managing the user
system interface and localized data also called the presentation tier. Web scripting tasks may
be executed on this tier.
Three tired architecture for databases and information related to the concept of client/server
architecture
It is possible to take advantage of the best data processing features of each computer platform.
You can mix client technologies and share common data
MODELING DATA IN THE ORGANAIZATION
Represent those rules so that they can be unambiguously understood by information systems
developers and users
Ex: A student may register for a section of a course only if he or she has successfully completed
the prerequisites for that course
A preferred customer qualifies for a 10 percent discount unless he has an overdue account
balance
CHARACTERISTIC EXPLANATION
A business rule is a statement of policy, not how policy is enforced or
DECLARATIVE conducted. The rule does not describe a process or
implementation, but rather describes what a process validates
With the related organization, the rule must have only one interpretation
PRECISE
among all interested people, and its meaning must be clear
A business rule marks one statement, not several, no part of the rule can
ATOMIC
stand its own as a rule(the rule is indivisible)
A business rule must be internally consistent. That is not containing
CONSISTENT
conflicting statements.
A business rule must be able to be stated in natural language, but it will be
EXPRESSIBLE stated in a structured natural language, so that there is no
misinterpretation
DISTINCT Business rules are not redundant, but a business rule may refer to other rules
A business rule is stated in terms business people can understand, and since
BUSINESS it is a statement of business policy, only business people can
ORINETED modify or invalidate a rule. Thus a business rule is owned by
business
DATA NAMES
RELATE TO BUSINESS, NOT TECHNICAl (HARDWARE OR SOFTWARE)
CHARACTERISTICS
Customer is a good name but file10, bit7 and payroolreportsortkey are not good names
BE MEANINGFUL: The data name should be meaningful for documentation purpose. So avoid
using generic words like “has” , “is”, “person” , “it”
BE UNIQUE: The name used for every distinct object words should be included in the data
name. Ex: Home address, campus address
REPEATABLE:
Different people or the same person at different times should develop exactly or almost
the same name. This means there is a standard hierarchy for data names
Ex: Birth date of a student would be StudentBirthDate
Birth date of a employee would be EmployeeBirthDate
A customer may request a model of car from a rental branch on a particular date
Here model, rental, request associates the four underlined terms
THE ER MODEL
ENTITY RELATIONSHIP MODEL: (E-R MODEL): A logical representation of the data for an
organization or for a business area. The E-R model is expressed in terms of entities in the
business environment.
ORDER: The transaction associated with the sale of the one or more products to a customer and
identified by a transaction number from sales or accounting
ITEM: A type of component that goes into making one or more products and can be supplied by
one or more supplies
Ex: ball bearing
SUPPLIER: Another company that may provide items to pine valley furniture
SHIPMENT: The transaction associated with items received in the same package by pine valley
furniture from a supplier.
A SUPPLIER may supply many ITEMS (by “may supply “ we mean the supplier may not supply
any items). Each ITEM is supplied by any number of SUPPLIERS.
(BY is supplied we mean must be supplied by at least one supplier.
Each item must be used in the assembly of at least one PRODUCT and may be used in many
products. Conversely each product must use one or more items.
A SUPPLIER may send many SHIPMENTS. On the other hand each shipment must be sent by
exactly one SUPPLIER. A supplier may be able to supply an item, but may not yet have
sent any shipments of that item.
A shipment must include one or more ITEMS. An ITEM may be included on several
SHIPMENTS.
A CUSTMOER may submit any number of orders. However each order must be submitted by
exactly one customer.
An ORDER must request one (or more) PRODUCTS. A given PRODUCT may not be requested
on any order
ENTITIES: An entity is a person, place, object, event or concept in the user environment
about which the organization wishes to maintain data.
Ex:
Person : EMPLOYEE, STUDENT, PATIENT
Place : STORE,WAREHOUSE, STATE
Object : MACHINE, BUILDING, AUTOMOBILE
Event : SALE, RGISTRATION, RENEWAL
Concept : ACCOUNT, COURSE, WORK CENTER
Empno char(10)
Name char(25)
Address char(30)
City char(10)
STRONG ENTITY TYPE: An entity that exists independently of other entity types
Ex: employee, student, automobile and course
WEAK ENTITY TYPE: An entity type whose existence depends on some other entity type
Ex: Class
Class cannot be uniquely identified without a course number
Weak Entities are indicated by double lined rectangle
IDENTIFYING OWNER: The entity type on which the weak entity type depends
IDENTIFYING RELATION SHIP: The relation ship between a week entity type and its owner
1. SINGULAR NOUN: An entity type name is singular noun (Such as customer student or
automobile)
3. CONCISE: An entity type name should be concise using as few words as possible. For
example in a University database an entity type REGISTRATION for the event of a student
registering for a class. Sufficient name for this entity type is student registration for class
5. Event entity types should be named for the result of event. The event of a project manager
assigning an employee to work on a project is an ASSIGNMENT.
In naming attributes we use initial capital letter followed by lower case letters. If an attribute
name consists of two words we use an underscore character to connect the words and we start
each word with a capital letter for ex: employee_Name
In ER diagram we represent an attribute by placing its name in an ellipse with a line connecting
to its associative entity
Entity I
Student_Id = 455
Student_Name = Smith
Home_Adress = 452 walnut street
Phone =303-839
Major =DBMS
Entity 2
Student_Id = 555
Student_Name = Thoms
Home_Adress = 944 mapel street
Phone =631-391
Major =VCPP
COMPOSITE ATTRIBUTE: An attribute that can be broken down into component parts
Ex: Address
It can be broken down into Street_adress, city, State and Postal_code
SIMPLE ATTRIBUTE: An attribute that can not be broken down into smaller components
Ex: Color, Weight
MULTI-VALUED ATTRIBUTE: An attribute that may take on more than one value
for a given entity instance.
DERIVED ATTRIBUTE: An attribute whose values can be calculated from related attribute values
For ex: The employee entity has a Date_Employed attribute. If users need to know how many years a person has been employed that
value can be calculated using date_employed and today’s date
Student Student_id
Automobile Vehicle_Id
Student_Id is not a identifier because many students have same name or may change their
namee. It is underlined in the E-R diagram
Student_Nam Other_Attribute
Student_Id e s
STUDENT
Entity flight
Composite identifier: Flight _Id
Flight_Id in turn has component attributes Flight_number and Date. This combination is required
to uniquely identify individual occurrences of Flight.
2. Unique: An attribute name should be unique. No two attributes of the same entity type may
have the same name. For clarity purpose no tow attributes across all entity types have the
same name
Ex: cust_id
CLASS: class is a phrase from list of phrases defined by the organization that are permissible
characteristics of entities
Ex: Name Nm
Identifier ID
Date Dt /*entities*/
Amount Amt
A relationship type: A relationship type is a meaningful association between (or among) entity
types.
Relationship instances: An association between (or) among entity instances where each
relationship instance includes exactly one entity from each participating entity type
Attributes on relationships: Attributes may be associated with a many to many (or one to one)
relationship as well as with an entity. Suppose an organization wishes to record the date (month
and year) when an employee completes each course.
Relationship instances:
Employee Course
Chen C++
Melton Java
Ritche COBOL
Celko Basic
Gosling SQL
Associative entity: An entity type that associates the instances of one or more entity types and
contains attributes that are peculiar to the relationship between those entity instances
The associative entity is represented with the diamond relationship symbol encloses within the
entity box. The purpose of this symbol is to preserve the information that the entity was initially
specified as a relationship on the E-R diagram
Employee
EMPLOYEE A CERTIFI B COURSE
CATE
Degree of a relationship: The degree of a relationship is the number of entity types that
participate in that relationship.
The three most common relationship degrees in E-R models are Unary (degree1), Binary
(degree2) and ternary (degree3)
Unary Relationship: A relationship between the instances of a single entity type. Unary relation
ships are also called recursive relationships). Is_married_to is shown as a one to one relation ship
between instances of the PERSON entity type.
Ex1: Indicates (one to one) that an employee is assigned one parking place and each parking
place is assigned to only one employee
Ex2: indicates (one to many) that a product line may contain several products and each product
belongs
Ex3: (Many to Many) that a student may register for more than one course and that each course
may have many student registrants.
Ternary relation ship: A ternary relation ship is a simultaneous relationship among the instances
of three entity types. In this example vendors can supply various parts to warehouses. The
relationship supplies is used to record the specific parts that are supplied by a given vendor to a
particular warehouse
Thus there are three entity types. VENDOR, PART and WAREHOUSE
There are two attributes on the relationship supplies Shipping mode and Unit_cost
For ex considers a VIDEOSTORE that rents videotapes of movies. Since the store may
stock more than one video tape for each movie. The store may not have any typess of a given
movie in stock at a particular time.
MINIMUM CARDINALITY: The minimum number of instances of one entity that may be
associated with each instance of another entity.
In our VIDEOTAPE the minimum number of videotapes for a movie is zero. When the
minimum number of participants is zero we say the entity type b is an optional participant in the
relationship.
In our VIDEOTAPE example the maximum cardinality for the VIDEOTAPE entity is
“Many”. That is an unspecified one greater than one. This is indicated by the “ crows foot”
symbol on the arrow next to the VIDEOTAPE
MANDATORY ONE: The minimum and maximum are both one. This is called mandatory one
cardinality. In other words each videotape of a movie must be a copy of exactly one movie. If the
minimum cardinality is zero participation is optional
A relationship name
2. Avoid vague names: you should avoid vague names such as Has or Is_related_to. Use
descriptive verb phases found in the definition of relationship
4. It may also be important to give examples to clarify the action. For ex for a relationship of
registered for between student and course. It may be useful to explain that this covers both on
site and on line registration and includes registration s made during the drop/add period.
5. Optional participation: The definition should explain any optional participation. You should
explain what condition lead to zero associated instances.
Subtype: a subgrouping of the entities in an entity type that is meaningful to the organization and
that shares common attributes or relationships distinct from other sub groupings
Ex: graduate student, undergraduate student,
Supertype: a generic entity type that has relationship with one or more subtypes
Ex: student
Attribute inheritance: attribute inheritance is the property by which subtype entities inherit values
of all attributes of the supertype. This property makes it unnecessary to include all supertype
attributes redundantly with the subtypes.
For ex Employee_name is an attribute of EMPLOYEE but not of the sub types of employee.
Whether to use supertype/subtype relations ships are not is a decision that the data modeler
must make in each situation. You should consider for the following conditions are present
1. There are attributes that apply to some of the instances of an entity type
2. The instances of a subtype participate in a relation ship unique to that subtype.
3. Fig 4.3
Page 131
The hospital entity type PATIENT has two sub types. OUT PATIENT and RESIDENT
PATIENT. The primary key is Patient_ID. All patients have an Admit Date attribute as well as
Patient_name also. Every patient is carded by a responsible physician who develops a treatment
plan for the patient.
Each sub type has an attribute that is unique to that sub type. Out patients have a check back
date. While resident patients have a Date_Discharged. Resident patients have a unique relation
that assigns each patient to a bed. Each bed may or may not assigned to a patient.
According to attribute inheritance each out patient and resident patient inherits the attributes of
the parent supetype PATIENT. Patient_Id, Patient_Name, and Admit_date
Generalization: The process of defining a more general entity type from a set of more specialized
entity types. Thus generalization is bottom up process.
In the above example three entity types have been defined. CAR, TRUCK and MOTOR
CYCLE. We have observed that three entity types have a number of attributes in common
Vehicle_Id(identifier), Vehicle_Name( with components make and model), price,
Engine_Displacement. This fact suggests that each of the three entity types is really a version of
generalization.
There is more general entity type named VEHICLE. The entity CAR has the specific attribute
NO_of_passengers, while TRUCK has two specific attributes Capacity and Cab._Type. Thus
generalization has allowed us to group entity types along with their common attributes
The entity MOTOR CYCLE is not included in the relation ship it does not satisfy the condition
for a subtype because attributes of MOTOR CYCLE are common to all vehicles. There are no
attributes specific to motor cycle. Further MOTOR CYCLE does not have a relation ship to
another entity type. Thus there is no need to create a MOTOR CYCLE subtype.
Specialization:
The process of defining one or more sub types of the super type and forming supertype/ subtype
relationships. Specialization is a top down process the direct reverse of generalization.
Fig 4.5 page 134
Fig 4.5a shows an entity type named PART together with several of its attributes. The identifier
is part_no and other attributes include Description, Unit_price, Location, Qty_on_Hand,
Routing_number and suppliers(The last attribute is multivalued since there may be more
than one supplier with associated unit price for a part.
Here some parts are manufactured internally while others are purchased from outside suppliers.
Thus routing numbers applies only to manufactured parts while supplier_ID and
Unit_price apply only to purchased parts.
In this example every time a new instance of Patient is inserted into the supertype a
corresponding instance is inserted into either OUTPATIENT or RESIDENT PATIENT. If the
instance is inserted into RESIDENT PATIENT an instance of the relation ship is assigned is
created to assign the patient to a hospital bed.
Motorcycle is a type of vehicle, but that it is not represented as a subtype in that model. Thus if a
vehicle is a car it must appear as an instance of CAR. If it is a truck it must appear as an instance
of TRUCK. How ever if the vehicle is a motorcycle it can not appear as an instance of any
subtype. This example of partial specialization and it is specified by the single line from the
VEHICLE supertype to the circle.
SPECIFYING DISJOINTENESS CONSTRAINTS:
The business rule in this case is the following at any given time a patient must be either an
outpatient or a resident patient but can not be both. This is the disjoint rule as specified by the
letter ‘d’ in the circle joining the supertype and its subtypes.
Note: The sub class of a PATIENT may change over time but at a given time a PATIENT is only
of one type.
Overlap Rule:
The overlap rule specifies that an entity instance can be simultaneously be a member of
two or more sub types.
Fig 4.7b pg 138
In this example an instance of PART is a particular part number. That is a type of part not an
individual part. This is indicated by the identifier which is Part_No. For ex consider part number
4000.
The overlap rule is specified by placing the letter ‘o’ in the circle . thus any part must be either
purchased or a manufactured part or it may be simultaneously be both of these.
Thus for example the condition “employee_type=”S” causes an entity instance to be inserted
into the SALARIED EMPLOYEE subtype.
OVERLAPPING SUBTYPES:
When subtypes overlap a slightly modified approach must be applied for the subtype
discriminator. The reason is that a given instance of the supertype may require that we create an
instance in more than one subtype.
Fig 4.9 page 140
A new attribute when named part_type has been added to PART. Part_Type is a composite
attribute with components manufactured and purchased. Each of these attributes is a boolean
variable. It takes only the values yes “Y” and no “N” when a new instance is added to part
ENTITY CLUSTRERING:
A entity cluster is a set of one or more entity types and associated relationships grouped
into a single abstract entitytype.
SINGLE UNIT: represents the SALESPERSON and SALES TERRIORITY entity types and the
serves relationship
CUSTOMER: represents the CUSTOMER entity supertype, its subtypes and the relationship
between supertype and subtypes
ITEMSALE: represents the ORDER entity type and ORDERLINE associative entity as well as
the relationship between them.
ITEM: represents the PRODUCT LINE and PRODUCT entity types and the includes relation
ship
MANUFACTURING : represents the WORK CENTER and EMPLOYEE supertype entity and
its subtypes as well as the works in and supervises relationships and the relationship between the
supertype and its subtypes.
MATERIAL: represents the RAW MATERIAL and VENDOR entity types, the SUPPLIER
subtype, the supplies relationship, and the supertype/subtype relationship between VENDOR and
SUPPLIER.