Escolar Documentos
Profissional Documentos
Cultura Documentos
GIS
Lecture Name :-
INTRODUCTION
Geographic data in digital form are numerical
representations of the real world.
It describes real-world features and phenomena
coded in specific ways in support of GIS and
mapping applications using the computer.
The digital geographic data must be organized
as a geographic database.
Roughly two-thirds of the total cost of
implementing a GIS involves building the GIS
database which should be accurate and has a
significant impact on the usefulness of the GIS.
spatial data
non-spatial data.
SPATIAL DATA
It gives information about the geometrical
orientation, shape and size of a feature, and its
relative position with respect to the position of
other features.
Spatial data is described by its x and y
coordinates.
The spatial data is normally available in analog
form as maps but now the maps are also
available directly in digital format.
In GIS, both types of the spatial data are handled
differently.
Normally the spatial and non-spatial data are
stored separately in a GIS, and links are
established between the two at the time of
processing and analysis.
NON-SPATIAL DATA
The non-spatial data, also known as attribute data,
are information about various attributes like length,
area, population, acreage, etc.
The non-spatial data describe the attributes of a
point, along a line, or in a polygon.
In other words they describe what is at a point
(e.g., a hospital), along a line (e.g., a canal), or in a
polygon (e.g., a forest).
The attributes of a soil category may be depth of
soil, texture, type of erosion, or permeability.
The non-spatial data, mostly available in tabular
form, are also converted into digital format for use
in GIS.
DATA REPRESENTATION
Nominal Variables
Nominal Variables are used when the data are
principally classified into mutually exclusive sets
or levels based on relevant characteristics.
The nominal variable is the commonly used as a
measure for spatial data.
It can be of two types as below.
a) Dichotomous
b) Categorical
Categorical data
These are used when it is required to classify the
data into one of several categories by name with
no specific order.
Categories of urban area such as residential area,
recreational area, business areas, or
trees such as Quercus agrifolia, Pinus Coulteri,
Eucalyptas calophylla,
are different kinds of categorical variables.
ORDINAL VARIABLES
Ordinal Variables are lists of discrete classes but with
an inherent order or sequence.
This representation of data is more sophisticated and
orderly as the classes are placed into some form of
rank order based on a logical property of magnitude.
The ranking of data may be natural such as grades of
agricultural land, or according to some criteria, such as
population density.
class of streams may be first order, second order, and so
forth,
levels of education may be primary, secondary, college,
post-graduate,
are ordinal variables since the discrete classes have a natural
sequence.
Internal Variables
Internal variables also have a natural sequence,
but in addition, the differences between the
values are quantified.
For example, the elevation of points is an internal
variable since the difference in elevation between
two points having elevations 55 m and 65 m is
the same as for other points having elevations 80
m and 90 m.
The representation of population in same order is
an example of internal variables.
RATIO VARIABLES
o Ratio variables have the same characteristic as internal
variables, but in addition, they have natural zero or real
origin (i.e., starting point).
o Per capital income,
o the fraction of the weight of a soil sample that passes through
sieve,
o Marks in percentage.
are common ratio variables.
land use
crop type
cropping area
water bodies and drainage
soil types
forest types
groundwater potentials
mineral resources.
Demographic data
population,
age structure,
sex ratio,
urban and rural population,
reserved caste population,
occupational structure, and
migration patterns.
Agro-economic data
pricing information.
Socio-economic data
industrial,
fishing,
tourism development ,
beneficiaries of various schemes and
programmes of development.
Infrastructure data
Source of Data
a)
b)
c)
d)
e)
f)
Remote Sensing
Existing Maps
Photogrammetry
Field Surveying Methods
GPS
Internet
REMOTE SENSING
The terrain data acquired through sensors onboard
satellite platforms being in digital format can be
directly used after preprocessing for preparing a
GIS database.
These data are coded in picture elements called
pixels, and stored in the form of a two-dimensional
matrix that contains merely a number representing
the amount of the reflected electromagnetic
radiation received in a given band.
The digital images must be located properly with
respect to a geodetic grid, otherwise the data they
contain, cannot be related to their true ground
position.
EXISTING MAPS
The acquisition of digital data by digitizing
existing maps is comparatively cheaper, and
requires less time compared to other methods.
The elevation data extracted using contours have
poor accuracy as compared to spot heights.
The digitized contours cannot be used for any
useful GIS application other than regenerating the
original contours themselves.
By Photogrammetry
When the area of interest is too extensive or
too rugged, the photogrammetric method is
employed to collect the digital terrain data
using
appropriate
photogrammetric
instruments such as analog stereoplotter
equipped with encoders, analytical plotter, or
using digital photogrammetry methods.
Analog stereoplotter provide 3D data from
aerial photographs.
The digitization of 3D models of the terrain
formed by analog stereoplotter is done by
equipping the stereoplotter with linear and
rotary encoders.
GPS
GPS is a satellite-based surveying system to
obtain highly accurate digital terrain data
electronically in the form of x, y, and z coordinates.
There are two basic field methods of GPS
measurements: static and differential.
Static GPS surveying is used to determine
positions of survey control points in areas where
geodetic control is lacking or unreliable.
It is also used for accurate measurement of
distance between two points using two or more
dual-frequency GPS receivers to record the
observations
made
on
GPS
satellites
simultaneously for about six hours.
INTERNET
The Internet is a vast network of digital
computers.
They are linked by an array of different data
transmission media such a satellite and radio
links, and fiber optic, unshielded-twisted pair, coaxial, and telephone communication lines.
Connecting these media involves an array of
different devices ranging from complicated
routers and data switches, through simple signal
amplifying hubs to modems.
DATA VERIFICATION
AND EDITING
Incomplete or double
When the data are entered manually, incompleteness in
the spatial data may be due to omissions in the input of
points, lines, or cells.
In case of scanned data, this error of omission is usually in
the form of gaps between lines where the raster-vector
conversion process fails to join up all parts of a line.
The raster-vector conversion of scanned data can lead to
the generation of unwanted spikes.
Sometimes one line may be digitized twice, and lines and
nodes may be disjointed at the intersections.
Data is distorted
T
T
Original line
Rubber sheeting
Greenwich
Meridian
Meridian through
P
P
O
0
Parallel of latitude
passing through P
P'
Equator
RECTANGULAR COORDINATE
SYSTEM
Since most of spatial data available for use in
GIS exist in 2D form, a referencing system that
uses rectangular coordinates is most suited.
This requires a map graticule or grid, placed on
top of the map.
The graticule is obtained by projecting the lines
of latitude or longitude from our representation of
the world as a globe onto a flat surface using a
map projection.
Non-coordinate System
Non-Coordinate System
Delivery post office
No. (267667)
(DL3 6KT)
a)
b)
c)
d)
Bias
Bias in GIS data indicates the systematic
variation of data from reality, and can be referred
to as a consistent error through out a data set.
For example, a badly calibrated digitizer
introduces a consistent overshoot in digitized
data.
Resolution
Resolution being an important concept when
dealing with spatial data.
It is dependent on the scale of the original map,
the point size, and line width of the features
represented thereon, and the precision of
digitizing.
Completeness, Compatibility,
Consistency and Applicability
A GIS data set should be complete spatially and
temporally, including with respect to attribute
information.
Different data sets in the GIS database should
be compatible to produce a sensible result.
Compatibility can be ensured
using similar methods of data capture,
storage manipulation, and
editing in producing ideal data sets.
Source of Errors
Errors in GIS data may be categorized as:
(i) Conceptual errors,
(ii) Source data errors,
(iii) Data encoding errors,
(iv) Data editing and conversion errors,
(v) Data processing and analysis errors, and
(vi) Data output errors.
CONCEPTUAL ERRORS
Conceptual errors arise from our understanding of the
real world and how it is modeled.
The perception of reality varies from person to
person, and this affects the data.
Whatever GIS model is adopted, it is a simplification
of the reality, and thus any simplification will introduce
errors of generalization, completeness, and
consistency.
Node
Original line
Overshoot
(dangling line)
Undershoot
Digitized line
Digitizing a line
Spike
Small polygon
(intentional)
Unclosed polygon
(unintentional)
Dangling line
remains
Dangling line
(intentional)
Gap
(intentional)
Arc or
line
Dangling line
(unintentional)
Node
Intentional small
polygon removed
Intentional gap
removed
TOPOLOGICAL ERRORS IN
CONVERSATION
Original vector map
Forest polygon
digitized at time
t1
Forest polygon
digitized at time
t2
Overlay
operation
Sliver polygons
THANK
YOU