Escolar Documentos
Profissional Documentos
Cultura Documentos
Geographic Data:
Concepts, File Formats, Topology
The same entity can be represented at different scales by different object types: the
multi-representation problem
Behavior can be associated with objects thus they can change over time
Field View
Real world properties vary continuously over space;
every place has a value
represent as raster data or as vector data in a TIN (triangulated irregular network
Shapefiles are the simplest and most commonly used format. Used them in
GIS Fund. Will use Geodatabases in Applied GIS (and some coverages).
Database
Environments
Old Model:
Geo-relational Database
GIS
User
SDE
db
GIS
GIS Data
Data Models
Models
File-based
File-basedand
andDatabased
Databased
Geodatabase
Features
Workspace
Coverages
Images
Tins
Shapes
Grids
Tables
Grids
Images
Relationships
Rules
Tables
One Repository
Source: ESRI, Inc.
Concept of Topology
Topology distinguishes GIS data models from non-topological
data models supported by many CAD, mapping and graphics
systems
Topology refers to knowledge about relative spatial
positioning of features.
knowledge about how features are connected and which features are
adjacent to each other.
10
2
3
6
7
11
Shapefiles
share some similarities with coverages but are not fully topological
May need to covert to coverages for some analyses.
Discuss topology for coverages later today and for geodatabases later in the course.
12
Feature ID
(key field)
Feature
type
Secondary or
Foreign key
Raster
Tracts
shapefile
Trans
shapefile
15
Shapefiles
openly published structure for spatial data (Coverages &
Geodatabases are proprietary)
Partially an attempt (successfully!) by ESRI to make their format the
industry standard
road.shx
road.dbf
17
Geodatabase (gdb)
Feature (vector) datasets
Spatial Reference
Object classes and subtypes
Feature Classes and subtypes
Relationship classes
Network Topology
Planar topology
Domains
Validation Rules
Raster Datasets
rasters
TIN (3-D) datasets
nodes, edges, faces
Locators
addresses x,y locations
Zip codes place names
route locations
Anatomy of a Geodatabase
Geodatabases may contain: feature datasets,
raster datasets, TIN datasets, locators
Feature datasets contain vector data
All data in a single feature dataset share a
common spatial reference system
Similar Objects (e.g. Jane Blow, land owner) are
instances of object classes (e.g. land owners)
and have no spatial form.
Features and feature classes are spatial objects
(e.g. land parcels) which are similar and have
same spatial form (e.g. polygon)
Object (or feature) classes are the tables, and
objects (or features) are the rows of the table
Attributes are in the columns of the table
Subtypes are an alternative to multiple object (or
feature) classes (e.g. concrete, asphalt,
gravel road subtypes): think of subtype as
the most significant classification variable
(attribute) in the class table
Domains define permitted data values.
Topology is saved as a relationship between the
feature classes in the feature dataset.
address
201 N. Hi
207 N Main
20 Elm
40 Oak
Attribute = column
dob
45
55
75
80
ssn
274-54-8910
234-81-7890
890-75-9876
x04-23-7890
19
feature classes are spatial data sets containing geographic features (e.g. land parcels): a table with spatial
data
Data in FC must have same topology type (all points, all lines, all polygons)
Water feature class with lakes (polygon) and streams (line) not permitted
Subtypes are subclasses within a feature class that allow you to further distinguish objects without
creating new feature classes
based on a single columns values (must be integer or long integer)
Same subtype has similar attribute values and behaviors
Use where attributes are the same across all subtypes
Coded value domain: specifies a valid set of values for an attributes. Can apply
to any type of attributes
Parcels can only have RES or VAC land use values
Spatial domain
The allowable coordinate range for the geographic coordinates
X/Y Domain: MinX, MaxX, MinY, MaxY (horizontal extent)
Z Domain: Min, Max (vertical extent)
M Domain: Min, Max (other parameter, e.g. distance from river mouth ) (can differ within feature data set)
Once created, the spatial domain for feature dataset/class cannot be changed.
Data outside extent will require a new feature dataset or standalone feature class.
Precision
Number of system storage units (SU) per one map measurement unit (MU)
If precision is 1 and mu= 1 meter ( 1 SU per MU), cannot record values less than 1 meter
If precision is 100 and mu= 1 meter (100 SUs per MU), can record values
to 1/100 = .01 = 1 cm
25
The Coverage
Digital version of a single map sheet layer and generally contains one type of
map feature such as streets, parcels, soils,
Can contain both the coordinate/spatial data and the descriptive data for features
in a given geographic area.
Additional attribute data about features (entities) can be stored in data base tables
using proprietary INFO relational data base system
Allowed user to customize, organize and store substantial amounts of attribute data
and relate to spatial data
26
Workspace
Coverages must be stored in workspaces
A workspace is the work area used during an ARC/INFO
session.
Within the computer file system, the workspace is a directory
(folder) containing one or more geographic data sets (e.g.,
coverage, tin, grid), a local INFO database, and other
supporting data.
at a minimum it is a folder containing an INFO subfolder
(subdirectory)
More than one user can read data from the same workspace,
however, it is strongly recommend that only one user access a
workspace for creating or updating data.
27
28
The INFO folder contains the feature attribute tables and related tables for all coverage in that
workspace.
Unfortunately, file names do not correspond to the names of files we work with!
29
Soil
POLYGON
ARC/INFO Spatial
Database Structure
(coverage)
INFO
ARC
Soil
AAT
TIC
BND
ETC.
PAT
Exceptions:
Can drag and drop, cut, copy, paste, and delete the entire workspace
Can drag and drop, cut, copy, paste, and delete the interchange file
(e00) created by exporting the coverage
Naming Coverages
Names for coverages are maximum 13 characters in length and cannot
include blanks or special characters (-,#, etc) other than under_score
31
32
33
You can add items (columns) to the PAT after the <cover>-id item.
34
35
You can add items (attributes) to the PAT after the <cover>-id item.
The first polygon is always the universal polygon which represents the
coverage boundary.
36
37
38
TYPE
ABBREV.
WIDTH
Main Street
Character
1 to 320
10/15/1990
Date
23675
Integer
1-16
347.22
Numeric
1-16
1344719822
Binary number
2 or 4
99378164.788
Binary floating
point
4 or 8
39
In ArcToolbox:
The greatest number of conversion options are available here.
40
Coordinate Systems
41
Coordinate Systems
All spatial data is in a coordinate system
You must know what it is!
Projection
The transformation by which 3D lat/long is converted to 2D X/Y Cartesian values
parameters normally required to describe the exact nature of the projection
measurement units: usually feet or meters, also must always be specified
Thus, for any a spatial data set, knowing simply the name of the
projection is not sufficient. Must also know:
Datum
Parameter(s)
Measurement units
Project
Actually projects the data. Think of this as reproject.
The data does change.
The current projection (input) must already be known by the ArcGIS
system,
That is, you have to do a Define first, if somebody has not already done it
43
In ArcMap
Providing that it is already defined (projection system known to ArcGIS), data brought
into a data frame (whose coordinate system is also known) will be reprojected in
memory to the coordinate system of the frame for display.
It can be exported in this (potentially) new projection, if desired.
In effect, this projects the data.
Note double proviso: known coordinate system for data inputted and for frame.
44
Warning!
Failure to correctly deal with datums and
projection is the single major source of
problems in GIS!
Assuming that the software will take care
of it is an invitation for eventual disaster!
45
Appendix
46
Vertice
Node
47
from-node/to-node
fromnode
1
3
right
polygon
tonode
Sections
2
3
1
Arcs
4
2
48
Main Street
49