Escolar Documentos
Profissional Documentos
Cultura Documentos
1
geodatabase
A map portrays geographic information through a series
of thematic overlaysstreets, buildings, water, terrain,
imagery. So does a geographic information system (GIS).
Each geographic data model is a collection of thematic
overlays used to represent geographic location and shape
as well as descriptions of the geographic features (points,
lines, polygons, imagery, and so on). The geodatabase is a
collection of these datasets and properties.
The geodatabase extends tables in a relational
database with shape fields that contain point,
line, or polygon geometries. Imagery and continuous
data are stored with efficient raster data structures
scalable to large data volumes. From these simple tables,
feature classes, and raster datasets, many structures
and techniquesfeature datasets, subtypes, attribute
domains, networks, terrains, address locators, and
moreprovide a framework for sophisticated modeling of
natural and man-made systems.
4 How Maps work
6 Maps and thematic layers
7 Map layers and the geodatabase
8 What is the geodatabase?
9 Types of geodatabases
10 Why use the geodatabase?
12 Extending the relational database
14 Extending simple datasets
16 The geodatabase transaction model
18 steps for designing geodatabases
How maps work
Think of a map as a geographic Points, lines, polygons, and images may be red or black lines. City
information system that you can on a map are organized into the- blocks might be color coded by
fold and put in your pocket. matic layers. A collection of points zone. The map legend is a symbolic
Maps work because they follow a can be used to represent cities or inventory of all the types of features
well-evolved set of practices to pres- well locations. Lines show features in a map.
ent geographic information. such as road systems or river net- Thematic layers are the starting
works. Polygons describe areas and point for modeling our world. Maps
Maps present a view of a geographic are often used to depict buildings,
area. Representations of real-world combine many thematic layers over
states, or countries. Imagery shows a common geographic area. These
entities are drawn with point, line, a picture of the landscape captured
and polygon symbols and are also thematic layers not only contain col-
from airplanes or satellites. lections of similar entities, but also
shown with imagery.
Maps apply symbols thematically. model their attributes, relationships,
Rivers are typically blue lines, roads and behavior.
This USGS 7.5 minute quadrangle map of Point Reyes, California from 1916 demonstrates the use of point, line, and polygon symbols.
There are three types of geodatabases: personal, file, and ArcSDE. Select-
ing the appropriate geodatabase to work with will depend on the specific
requirements of your GIS project.
Personal geodatabases
The personal geodatabase is based on Microsoft Access software and was
the original geodatabase available at the launch of ArcGIS 8. It is designed
for single users working with small to moderate size GIS datasets. The per-
sonal geodatabase is implemented within a single Microsoft Access file with
a maximum size capacity of 2 gigabytes.
File geodatabases
The file geodatabase is ideal for GIS projects The file geodatabase was introduced at ArcGIS 9.2. The file geodatabase is
done by single users and small workgroups. implemented as a collection of binary files in a file system and because of
The file geodatabase improves on the
this has no practical size capacity limit. Each table in a file geodatabase can
personal geodatabase in terms of scalability,
performance, and cross-platform operating be configured to store up to 256 terabytes of information.
system support. File geodatabases scale up Vector data in file geodatabases can be compressed to a read-only format,
to handle very large datasets by using an
providing a smaller memory footprint and improved geodatabase perfor-
efficient data structure that is optimized for
performance and storage. mance. Uncompressing the vector data will make it editable.
The file geodatabase can support more than one editor at a time, provided
the editors work on different tables, stand-alone feature classes, or feature
datasets.
For new geodatabases, ESRI generally recommends file geodatabases
over personal geodatabases because of improved functionality and better
performance.
ArcSDE geodatabases
ArcSDE geodatabases are designed to be scalable and to support multiple
readers and writers concurrently. Each has the ability to manage a shared,
multiuser geodatabase as well as support for a number of critical version-
based GIS workflows such as access permission control for individual data-
sets, versioning, geodatabase replication, and historical archiving. ArcSDE
geodatabases can be implemented on many different relational database
platforms including Microsoft SQL Server, Oracle, IBM DB2, IBM
Informix and PostgreSQL. Because the enterprise level of geodatabases
use relational databases, they are only limited by the underlying relational
databases and can be scaled to any size and support any number of users,
running on computers of any size and configuration. The ability to leverage
your organizations enterprise relational databases is a key advantage of the
ArcSDE geodatabase.
The geodatabase has several key qualities that make it flexible enough to
manage any GIS.
1 Simplicity and efficiency
Since the advent of computer-based The geodatabase represents real-world entities in a simple and efficient way.
geographic information systems in The geodatabase is built on a base relational model in which geographic data
the 1960s, many different formats for is organized into tables which contain rows. All rows in a table have the
storing geographic data have been same columns, therefore maintaining relational integrity. Each column has
developed. These geographic data formats
a type such as integer, decimal, character, date, and so on.
include coverages and shapefiles, which
were developed by ESRI for previous The geodatabase extends this relational model by including a column
generations of GIS software. While with a geometry type. This allows the geodatabase to conform to a simple
coverages and shapefiles were very feature model where geographic elements in the real world are represented
successful in their time and to this day by simple features with geometries of point, line, and polygon. Using this
have widespread use, they now place
simple and efficient design allows the geodatabase to be fast and scalable.
technical limits on users for modeling
geographic systems and handling huge 2 Transactional model for editing workflows
amounts of data.
In the year 2000, ESRI introduced the
The geodatabase also has a comprehensive transactional model used to apply
geodatabase with the release of ArcGIS 8. edits to rows and columns in tables. This allows users to perform inserts,
The geodatabase was created in response to updates, and deletes on data. The geodatabase can extend this transactional
user requirements for increased scalability model to accommodate a wide range of workflows including multiuser edit-
for large datasets, open access to data, and ing and data distribution through geodatabase replication.
richer modeling of geographic systems such
as transportation networks, surfaces, and 3 Scalable framework
parcel cadastres. The geodatabase is a robust and customizable framework that can scale
The creation of the geodatabase became from small, single-user databases up to large enterprise-level systems with
possible at that time because relational many concurrent editors. Because of the simple design behind extending
database technology had matured to
the fundamental relational, simple feature, and transactional models, the
the point that extensions to relational
database technology could handle the geodatabase has the flexibility to represent geographic data models of any
specialized requirements of geographic size or design.
data. Since then, subsequent releases of
4 Effective display and analysis of geographic data
ArcGIS software have continually added
enhancements to the geodatabase and it Datasets stored within the geodatabase can be overlaid on a map for display
remains the state of the art. and analysis. Combining thematic datasets over a common geographic area
helps you visualize how the layers interact. Datasets can be used as input
into geoprocessing tools and models to perform analyses. Results can be
displayed on the map or even used to derive new datasets which can be used
for further analysis.
5 Extended capabilities and behavior
Simple features stored in tables are the foundation of the geodatabases
information model. Groups of points, lines, or polygons are combined into
feature classes. Tables, feature classes, and raster datasets form the basis for
representing real-world entities as geographic data. The geodatabase can
extend this information model with more advanced dataset types, capabili-
ties, and application logic used to maintain data integrity and model spatial
relationships. The geodatabase can also model more dynamic systems such
as topologies, networks, parcel fabrics, and surfaces.
d
el
el
el
el
Fi
Fi
Fi
Fi
Row value value value value
or text.
value value value value
value value value value
Storing attributes
All three primary datasets in the geodatabase (feature classes, tables, and
A table is the primary unit of storage in a
raster datasets) as well as other geodatabase elements are stored using tables
database. Conceptually, a table is a matrix
of values organized into rows and columns. in a database. These tables are related to one another using key values in the
Each row represents an entity. Each column attributes of the tables. This is commonly referred to as the relational model
or field represents an attribute type shared and is the backbone of the relational databases on which the geodatabase is
by all rows. built.
Tables also have integrity rules which help ensure the correctness of data
stored in a table. Other types of rules control how objects can be related to
Feature class each other, either logically or spatially.
Storing feature shapes
e
ap
d
d
el
el
el
Fi
Fi
53696 60910
60900
61562 60972
The geodatabase is often thought of as an object-relational database because
it employs a multitier application architecture. The advanced logic and
67810
behavior is implemented in the application tier which is separated from
55136
62932
59548
62866
62868 62306 the data storage tier controlled by the relational database. The geodatabase
53956
56460 application logic includes support for a series of generic GIS data objects
57150
6 and behaviors such as feature classes, raster datasets, topologies, networks,
63106
57152
63760
69178
address locators, and much more.
Feature class Parcel
60768 58632
62822
62854
62074
lu
va
62364 59424
58232
se
55242
lI
ct
es
44670
rc
bj
ss
Sh
Pa
O
14352 Polygon 56460 $58,000.00 types and associative query processing, are delegated to the data storage tier
17234 Polygon 53956 $56,000.00
in the relational database. The GIS application tier retains responsibility for
19923 Polygon 62866 $45,000.00
23049 Polygon 67810 $52,000.00
defining the specific schema used to represent various geographic datasets
26965 Polygon 62932 $47,000.00 and for domain-specific logic, which maintains the integrity and utility of
the underlying records.
Each row in this parcel feature class has a
shape field for the parcel boundary and the
attributes of that parcel.
Application tier
in simple tables managed by a database ArcGIS products
management system. In the application tier,
data is accessed through ArcGIS applications ArcGIS Desktop
as tables, feature classes, and rasters. ArcGIS Engine
ArcGIS Explorer
Tables Feature classes Rasters ArcGIS Server
Relational databases
Storage tier
Oracle
Microsoft SqlServer
Tables IBM DB2
IBM Informix
PostgresSQL
GIS users have some specialized transactional Geographic data is used to model many types of real-world entities.
requirements. Often, transactions must span Although the characteristics of these entities can vary, they have one quality
long periods of time (sometimes days and
in commonthey are dynamic.
months, not just seconds or minutes).
In a multiuser database, the GIS Consider a land parcel feature class as an example. Over time, land parcels
transactions must be managed on the undergo changes such as splitting or merging boundaries, reassignment of
relational databases short transaction zoning codes, and transfers of landownership. Because geographic informa-
framework. ArcSDE technology plays a key
tion changes over time, the geodatabase provides a transactional framework
role during these operations by managing
the high-level, complex GIS transactions for users to easily edit their datasets while preserving data integrity.
on the simple transactional framework of a When you edit geographic data, you follow a workflow similar to editing
relational database.
text within a Microsoft Word document. As you write a word document,
you add, update, and remove text and pictures. At any time, these modifica-
tions can be undone, which restores the Word document to a previous state.
The edits you perform are only committed when the Word document is
saved.
Likewise, when you edit in ArcGIS, you add, update, and delete features in
your geographic data. Changes to datasets in your geodatabase are commit-
ted only when you save your edits.
ArcGIS extends this simple transactional model to accommodate more
advanced GIS workflows as well. Organizations such as utilities have
complex transactions that span days or weeks. These are often called work
orders and comprise all of the changes within a defined job, such as adding
a new utility service or replacing part of a utility distribution network. The
challenge of working with work orders is ensuring that the editor can seam-
lessly perform a complex set of edits to multiple datasets over an extended
period of time without locking those datasets to other editors. ArcGIS
addresses these workflows through a technology called versioning, which is
available with ArcSDE geodatabases.
Versioning in the geodatabase
DEFAULT version Versioning allows multiple users to edit the same data simultaneously by
giving each editor their own version of the original data to work with.
Versioning works by recording any changes that are made to a dataset in an
Roads Sewers
Gas pipeline ArcSDE geodatabase. Only the changes made to the dataset are tracked and
Phase II
this avoids duplication of data.
Each editor works with their own version of the data until they decide to
Roads Roads Gas pipeline Gas pipeline merge their changes back in with the original data. Once an individual edi-
Phase II Phase II
design maintenance
East West
tors changes are incorporated into the original data, the edits are visible to
other connected users.
The goal of versioning is to provide a large number of users with high-per-
Roads Roads
maintenance maintenance formance access to the geographic datasets in their geodatabase. Geographic
East West information systems that employ versioning can efficiently support the use
of datasets containing millions of rows accessed simultaneously by hundreds
A conceptual view of versioning as applied to
a public works department of a municipality of users.
A GIS design is built around the set of thematic layers of information that
will address a particular set of requirements. A thematic layer is a collection
of common features, such as a road network, a collection of parcel boundar-
ies, soil types, an elevation surface, satellite imagery for a certain date, and
well locations.
!
17
1540
2.3
4
0
00 155 9.3
4 0. 7 0 34.30 2 98 5
R= 5
. 00
37
44
163.5
.50
6 7. 5 4
156 236
8
235
15
0
237
26. 0 239
3 40 0 0 13. 37. 00
.0
45 .
11
00
3 3. WA TE
0
9.
M
6
40
0
158 . 4 R
.0
40 38
10
AI
. 6. 0
234 79. 48
3 4. 0
30 4
0 .00
8.
R =10 0
40
7 5 . 00
NV
15
50. 0
8
4.
159 10 0
. 5
37.60 137.9 0 0 5 0. 0
IE
2.
0 207 0
135.5
5.
630
00
W
8.
.0
233
00
2
206 11
40
160
0
4 2. 4. 00
.9
11
137.7
6
.0
127.80
0. 19
69
5
0
40 7.
1
205 2. 00
.7
00
55 .
0
8
232
12
161 208 17
4 0. 0
143.50
42. 0
00
8
6. 5
0
.0 0
45
129.30
40
7.
PA RK
.0
First, you define these thematic layers for your particular applications and
40
0 4 5. 00 45
104.90
.0
99
0
0
0
0
LEOTA
60 . 00
231 209 0
40
104.9
.0
R=10 5
40
0
.0
0
137.40
210
7 3. 7 6
0
120.30
45
162 11 3.0
53
R =4 3 1 . 9 6
.0
.2
163
.0
230
0
7
90
38 211
60 . 0 0
OUTLO
.00
00
154.5 0 11 4.
70 204
50
40. 28 T D
.0
.0
0
25
8
1 2 0. 0 0 . 0 0
0
0
50
.0
212
0
38
0 4 9. .5 0
2
.0
45
7
122
0
167
.0
7.
0
information requirements. Then, you define each thematic layer in more
R =2 7 PA RK
40
5 00 213
38
6 5168
R= 5
. 00
6 0. 9 . 7 2 . 00
. 00
2 8 0.
138
1690 0
0 . 00
1 4 7.
2 7. 2
80 4
6 5. 214
0
.8 0 .
.80
10 40.0 0
170 0 0
50
190 14
229
4.
43
171 215
30 . 6 0 .
.60
.3
186
.1 0 .00
14.30
0 24
20. 4 0
2 9.
228 216
8
20
.30
19.
178
227
4
0. 0
.0 0
9. 70
0
16 226 0
0
2.0
225
16
3
- 0
tion of standard geodatabase data elements, such as feature classes, tables,
-1 7
relationships, raster datasets, subtypes, topologies, and domains.
When identifying thematic layers in your design, try to characterize each
theme in terms of its visual representations, its expected usage in the GIS,
its likely data sources, and its levels of resolution. These characteristics help
describe the high-level contents expected from each theme.
Once you have identified the key thematic layers in a design, the next step
is to develop specifications for representing the contents of each thematic
layer in the physical database. These include how the geographic features
are to be represented (for example, as points, lines, polygons, rasters, or
tabular attributes); how the data is organized into feature classes, tables, and
relationships; and how spatial and database integrity rules will be used to
implement GIS behavior.
17
15 4
2.3
4
0
00 15 5 9.30
0
40. 7 0 34.3 29 55
8
R= 0
0
3 7.
44
163.
. 50
67 .
15 6 236
50
85
235
15
237
9.5
23 8
0
15 7
4
. 00
2.
0 40 2300
26. 3 0 0 1 3. 37.
WAT 9
5 .
00 4
0
11
3 3.
0
46
ER
M
.4
158 0
0
. 4
.0
40 38
10
AI
. 36. 0
234 79. 48
3 4. 0
0 . 00
8.
40
75. 00
NV
8
15
104
. 50. 0
159 0
.
37.60 137.9 0 5 0. 0
IE
20 7
55
0 50 0
135.
6 30
0
W
.0
8.
.0
23 3
0 4 84. 0
20 6 11
0
16 0
0
4
Map use Parcels define landownership and are used for taxation.
.9
11 1
0.
137.70
6
2. 0 0
127.80
0. 97
5
69
0
0
40 .8
20 5 2. 0
.7
0
55
0
04
232
12
17
. 00
161 208
4 0. 0
143.50
6
.0 0
2. 00
. 50
45
40
7.
129.30
99 PA RK
.0
Data source
40
104.9
.0
0
0
LE OTA
60. 00
23 1 209 90
40
104.
.0
R=10
40
0
.0
0
137.40 210
73. 76
120.30
162 00
45
53
11 3.
R =4 3 1 . 9 6
5. 0
16 3
.0
230
0
7
90
211
60. 00
38
.
OUTL
.00
00
154.5 0 OT 11 4.
70 20 4
50
.0
40. 28
D
.0
25
8 0
0
1 2 0 . 0 0. 0 0
0
50
.0
21 2
38
0
0 4 9.
0
2
Spatial relationships Parcel polygons cannot overlap and are covered by boundary lines.
.0
12 2.5
45
7
16070
0
.0
7.
R =2 PA RK 21 3
40
5
3 8.
6 516
R= 5
. 00 79.
8 . 00
2 8 0.
60. 72
00
138
16 90 0
0. 8 0 0
147.
27. 2
65.
0 44
Map scale and accuracy Typical map scales are 1:1,200 and 1:2,400.
0 21 4
0
.8 0 .
10 40.0 0
170 0 0 190.8
50
14
22 9
.
43
17 1 0 215
3 0 . 60 .
186.6
.3
.1 0 0. 00
2 0. 4 0
2 9.
22 8
48
21 6
20
30
19.
178.
30
22 7
4
0. 0
.0 0
9. 70
0
16 22 6
00
2.22
16 5
Design steps
The eleven steps presented on the following page outline a general GIS
database design process. The initial conceptual design steps help you iden-
tify and characterize each thematic layer. In the logical design phase, you
begin to develop representation specifications, relationships, and, ultimately,
geodatabase elements and their properties. In the physical design stage, you
will test and refine your design through a series of initial implementations.
You will also document your design.
conceptual design
mission-based requirements for your organization. List the data sources you currently use in this work. Use these to drive your data
design needs.
3 Specify the scale ranges and the spatial representations of each data theme at each scale
Data is compiled for use at a specific range of map scales. Associate your geographic representation for each map scale. Geographic
representation will often change between map scales (for example, from polygon to line or point). In many cases, you may need to
generalize the feature representations for use at smaller scales. In other situations, you may need to collect alternative representa-
tions for different map scales.
5 Define the tabular database structure and behavior for descriptive attributes
Identify attribute fields and column types. Tables also might include attribute domains, relationships, and subtypes. Define any valid
values, attribute ranges, and classifications (for use as domains). Use subtypes to control behaviors. Identify tabular relationships and
associations for relationship classes.
6 Define the spatial behavior, spatial relationships, and integrity rules for your datasets
logical design
For features, you can add spatial behavior and capabilities and also characterize the spatial relationships inherent in your related
features for a number of purposes using topologies, address locators, networks, terrains, and so on. For example, use topologies to
model the spatial relationships of shared geometry and to enforce integrity rules. Use address locators to support geocoding. Use
networks for tracing and pathfinding. For rasters, you can decide if you need a raster dataset or a raster catalog.
Test your prototype design. Build a sample geodatabase copy of your proposed design using a file, personal, or ArcSDE Personal
geodatabase. Build maps, run key applications, and perform editing operations to test the design's utility. Based on your prototype
test results, revise and refine your design. Once you have a working schema, load a larger set of data (such as loading it into an
ArcSDE geodatabase) to check out production, performance, scalability, and data management workflows.