Escolar Documentos
Profissional Documentos
Cultura Documentos
https://doi.org/10.1007/s12524-019-01077-4 (0123456789().,-volV)(0123456789().
,- volV)
RESEARCH ARTICLE
Abstract
Road network delineation from remote sensing imagery is useful in many applications related to humans in present days as
it provides meaningful information which is very helpful in the field of transportation and other support systems of human
civilization. However, the road network extraction from remote sensing images is a challenging problem because of
various properties of road network altered by different factors. In this paper, a supervised multistage framework based on
least square support vector machine (LS-SVM), mathematical morphology and road shape features is proposed to extract
road networks from remote sensing images. In the first stage, after denoising the image using the nonlinear filter, the image
is segmented using LS-SVM into two classes’ road or non-road but some non-road components like building structures,
shadows of trees and other roads like objects are also classified as the road network. In the second stage, the morphology
and primary shape features are used to remove the non-road objects. Finally, road centerlines are extracted using a method
based on Euclidean distance transformation. The experimental results reflect the accuracy of the proposed work as
compared to other road extraction methods.
123
Journal of the Indian Society of Remote Sensing
environment generates unmatched breaks for multi-tem- The KB method uses geometric features to detect specific
poral and spatial analysis alike observing variations in shaped objects. However, they suffer from over extraction
built-up areas and investing how these variations interact and occlusion. Gao et al. (2018) proposed a KB method
with other geographical features like vegetation. Apart based on segmentation and shape features. Hu et al. (2007)
from these, we can compare the current GIS data with proposed an automated method by tracking road footprints
previous data to obtain better results. So, it is necessary to using a toe-finding algorithm and using shape features for
develop an automatic or semiautomatic method for the pruning road network. Extended Kalman filter is
extracting road network. applied to demarcate road network by Movaghati et al.
In this present study, an attempt has been made to (2010).
increase the accuracy of road network extraction in a Mathematical morphology in combination with other
multifaceted environment. For this purpose, a multistage segmentation and classification methods are used to
framework based on region-based classification and shape delineate the road network from RSI and gained popularity
features has been proposed. The general approach in due to their speed, less computation and accuracy but their
region-based methods includes the classification or seg- performance depends on numerous factors such as choice
mentation of road or non-road regions in RSI. Further, the of structuring element. Géraud and Mouret (2004) use
additional operations are applied to delineate the road morphology with Markov random field to extract road
network. network, and TPA (texture progressive and morphology)
However, the present study introduces a framework, are used by Mena and Malpica (2005) for vectorization of
wherein the denoised image is classified using LS-SVM the road network of the semi-urban area. Singh and Garg
into the road and non-road classes. The framework further (2013) used segmentation based on adaptive thresholding
moves ahead with the elimination of non-road components and mathematical morphology to obtain the road network.
and delineation of the road network from the classified Sujatha and Selvathi (2015) present an automatic method
image. This entire processing is executed using morpho- using connected component analysis in conjunction with
logical operations and shaped feature algorithm. The cen- morphological operations for removing non-road elements.
terline is extracted using the euclidean distance-based Chaudhuri et al. (2012) used directional morphological
method. filters and segmentation using supervised homogeneity for
In literature, various road network extraction methods road extraction from the urban area RSI. Active contour-
from high-resolution RSI have been proposed by various based methods use snake and level set to represent the
researchers. An exhaustive survey of various road extrac- contour model to delineate the road network. Level set and
tion techniques is presented by Mena (2003) and Wang SVM are merged to delineate road network by Abdollahi
et al. (2016). Prevailing road network extraction methods et al. (2018). All the methods of road extraction from high-
can be categorized as follows: (1) classification-based resolution RSI work on features extracted either at the pixel
methods, (2) knowledge-based (KB) methods, (3) mor- level, region wise or shape features. The pixel-level
phology-based methods and (4) active contour-based methods rely on information extracted by pixels such as
methods. However, the performance depends on the line, edges, magnitude and orientation which are used at
training data used for classification. In some recent works, higher-level processing to extract road, whereas in region-
the road network is extracted by a cascaded end-to-end based methods first the whole region is segmented into
convolutional neural network (CasNet). This method can regions of road or others and in the second step further
detect the road network and extract their centerlines all refinement is made to extract road network accurately. The
together (Cheng et al. 2017). Wei et al. (2017) conceptu- KB method uses advanced information for extraction of the
alize a road network extraction using refined CNN. An road network.
object-based automatic road extraction method is proposed The methods of the road can be automated or semiau-
by Miao et al. (2015) where road features are segmented tomated; a fully automated method does not require an
and then filtered using object’s features and SVM is used operator but this is not realistic for getting accurate and
for classification. Wang et al. (2015) proposed a semiau- precise road networks due to various restrictive factors.
tomated method using a neural dynamic framework based Furthermore, various factors, such as sensors and weather,
on FSM and deep learning. Sghaier and Lepage (2016) use alter the various features of the road network in RSI, which
texture analysis, beamlet transform and SVM to extract in turn increases the complexity. In contrast, in semiauto-
road network. Bakhtiari et al. (2017) proposed a semiau- matic methods, a human operator is used, who plays a key
tomated method for road extraction using edge-based role in identification or road network and non-road objects
segmentation, SVM and morphological operation. Song in the initial phase of the method and reduces the com-
and Civco (2004) propose a road extraction method using plexity resulting in improved accuracy.
SVM and region-based segmentation using shape features.
123
Journal of the Indian Society of Remote Sensing
Data Set
The original remote sensing image suffers from the noise x/1 x
YðxÞ ¼ ea; ð2Þ
that occurs during image procurement from sensors, poor ða 1Þ!aa
illumination, thermal effects and electronic circuits, which where a2 is the variance and x is the intensity value. In the
may limit the interpretability of information. The nature of literature, it is observed that denoising in the RSI is per-
noise in remotely sensed imagery is either additive or formed by many methods by various researchers (Al-Amri
multiplicative. The random variation impulsive (Gaussian) et al. 2010) and every method has its own pros and cons.
noise and salt and pepper noise are additive in nature. The For fruitful extraction of the road network from RSI, edges
electronic circuits used in the imaging system of satellites preservation is the main criteria and the linear filter is not
are the main source of random variation impulsive noise, suitable for this task (Anil and Natarajan 2010). In pro-
and its probability distribution function (PDF) is as posed work to reduce the noise from RSI, median filter is
follows. efficiently applied along with the removal of outliers
1 ðxaÞ2 without conceding on the sharpness of edges.
PðxÞ ¼ pffiffiffiffiffiffiffiffiffiffi e 2r2 ; ð1Þ
2pr2
Classification Stage
where x represents the intensity level, a is the mean of
function, and r is the standard deviation of the noise.
After eliminating the noise from the remotely sensed
Faulty transmission during image capturing corrupts the
image, the road network can be extracted either by seg-
intensity of pixels and set alternatively minimum or max-
mentation or by any classification method. The main
imum value, generate ‘‘salt and pepper’’ noise in the image.
objective of this phase is to divide the whole image into
Speckle noise is generated due to the coherent recon-
two classes of road and non-road. Although researchers
struction of RSI that bounds the understanding of the
123
Journal of the Indian Society of Remote Sensing
employed various methods of classification that may be higher-dimensional space. The function estimation of LS-
supervised or unsupervised (Wang et al. 2016), in this SVM model takes the following form
work, LS-SVM which an efficient variant of SVM is used X
N
for classification due to better classification performance FðxÞ ¼ ak Kðx; xk Þ þ B: ð5Þ
and easy to deal with complex scenes in remote sensing k¼1
images.
The choice of kernel affects severely the performance of
SVM, so when the radial basis function (RBF) kernel is
LS-SVM
used, two tuning parameters (c) for regularization constant
and (r) for the width of the kernel are added. The results of
SVM is a supervised classification method, which is used
LS-SVM can be described by Eq. 6 and
for classification of an image into different classes or
regions (Cortes and Vapnik 1995) and has shown superior 1; if x is classified as road
RðxÞ ¼ ð6Þ
results as compared to conventional methods like neural 0; otherwise:
networks. The LS-SVM discussed by Hanbay et al. (2009) The classification algorithm used in this work is defined
is a variant of SVM due to the reformation of the objective in Algorithm 1.
1. For each road and non-road class select region of interest as training data.
2. Choose RBF kernel, penalty parameter C other kernel arguments are chosen using tenfold cross-validation
3. Uses the optimized variables based on cross-validation defined in Step 2 for the training purpose of LS-SVM.
4. Perform classification of the whole image using LS-SVM model obtained in Step 3
function. It uses equality constraints in place of inequalities Road Object Analysis Stage
in orthodox SVM and a sum squared error cost function;
these changes result in solving linear equations instead of a After the classification stage, the classified image still
convex quadratic program. The changes in objective encompasses some non-road components that are classified
function significantly simplify optimization problem and as road area and few holes caused by occlusion of vehicles
can be solved in less time. However, this reformulation or shadow of trees. To remove such regions and for filling,
results in an increase in the size of the model of LS-SVM holes morphological opening and hole filling operations are
as compared to SVM, to reduce the complexity and to performed. The non-road components from the classified
achieve sparseness; pruning methods are used (Hanbay image are eliminated to some extent by morphological
et al. 2009). The size of the LS-SVM model is larger than operations, and additional processing is essential to
SVM because it does not support vector selection and this increase the quality and accuracy of road network extrac-
can be pruned using pruning techniques. The solution tion using geometric features analysis. The road network
process of LS-SVM is preferable to SVM due to less has distinctive geometric features, different from other land
computational encumbrance and high efficiency. For linear covers present in RSI. The previous work on shape analysis
LS-SVM linear classifier is described as: shows that road is complex and irregular (Lu et al. 2009)
FðxÞ ¼ signðwT x þ BÞ; ð3Þ and roads are always elongated and there is a small vari-
ation in their curve. Therefore, road shape features can be
where B is real constant. In the case of nonlinear classifi- used to refine the extracted road network.
cation, the classifier in dual space takes the following form:
" # Area
XN
FðxÞ ¼ sign ak Fk Kðx; xk Þ þ B ð4Þ
k¼1 The no. of pixels in a road component cannot be too less or
too more, so there must be a range in which the no of pixels
where ai are positive real constants and B is real constant
which constitute the area of an object lies and can set the
and Kðx; xi Þ ¼ h/ðxÞ /ðxi Þi; \ [ is an inner product,
upper (AU) and lower (AL) limit of the area of the object.
and /ðxÞ is the nonlinear map from original space to
123
Journal of the Indian Society of Remote Sensing
Any object that has the area between [AU, AL] is included Road Centerline Extraction
in the road network otherwise rejected.
Centerline extraction from road network is an essential
Complexity Rate ðCRÞ challenging task. Various methods have been proposed by
many researchers for extracting centerlines of 2D images
CR defines the shape complexity of the object and is such as; thinning-based or onion peeling (Chaudhuri et al.
describes as 2012), geodesic-based methods (Miao et al. 2014a, b),
Ar level set marching methods (Ebert et al. 2002; Hassouna
CR ¼ ; ð7Þ and Farag 2005) and regression-based methods (Liu et al.
PM2
2016a). The results of different methods are far away from
where PM is the perimeter of the object and Ar is an area of satisfaction due to the spectral heterogeneity in land cover
the component. The value CR is decided by its complexity of RSI and may generate large spurs. Thinning-based
and it is directly related to the more complex the shape, methods are widely used as they generate the correct
complex the shape more the rate. component but require large computation and more itera-
tions. Geodesic-based methods use the shortest path for
Aspect Ratio of Bounding Rectangle ðABRÞ computation, have lower computation cost and high speed
but the centerline extracted is near to the inner boundary.
The ABR of an object can be calculated as Level set marching methods are robust in nature but it is
LARB difficult to obtain energy function. Regression-based
ABR ¼ ; ð8Þ
WARB methods use weighted least square to extract centerline but
results are inaccurate at road junctions. In this study, road
where LARB is the length of the bounding rectangle and
centerlines are extracted by using Euclidean distance
WARB is the width of the bounding rectangle.
property. The Euclidean distance transformation is applied
to the skeleton of the binary image to get the distance from
Extent ðEXÞ
the centerline to the edge and finally multiplying them to
the get centerline where every point on the centerline is the
Extent represents how rectangular shape is, how far it fills
distance to the edge. The unwanted small sections gener-
the minimum bounding rectangle and defined as:
ated in the road centerline are removed by morphological
Ar operation. The proposed method is computationally faster
EX ¼ ; ð9Þ
SMBR and inexpensive along with generated centerline is cen-
where Ar and SMBR denote the area of the object and the tered, connected and one pixel thick.
area of the minimum bounding rectangle of the object. For
performing efficient refinement of road network from the
classified image, shape features algorithm is used. To label, Experiment Setting
each disjoint component and geometric features of the road
network are extracted with the help of connected compo- The experiment for extraction and analysis of road network
nent analysis. The road object refinement algorithm is from RSI are performed in the Matlab 2018a environment
defined in Algorithm 2 on various images of the urban and suburban area. To
extract the road network and non-road regions binary LS-
SVM is used. For this, a certain region of interest of each
class (road and non-road class) is selected for the training
set, in such a way that there is no overlapping between two
classes and all other pixels that do not belong to any of the
class would be placed into a non-road class. The LS-SVM
is trained using training samples, and the function obtained
after training is used for classification of the image into the
road and non-road classes. The experimental result of the
classification of road and non-road classes using LS-SVM
on test images is shown in Fig. 2c. In the road object
refinement stage, the road network is extracted from the
classified image by filtering using morphological opera-
tions and fundamental shape features. The various
123
Journal of the Indian Society of Remote Sensing
Fig. 2 a Original image, b median filtered image, c LS-SVM classified image, d result of the closing operation, e image road object refinement,
f the centerline of the extracted road network, g ground truth road network and h overlapped image of the road map
threshold limits of different fundamental shape features are not removed by ½AL AU and CRT values of shape fea-
shown in Table 1. Finally, the centerline of the road net- tures algorithm. For the removal of such an object, we use
work is extracted by using euclidean distance transform the elongation feature of the road that can be detected by
and spurs produced are removed by morphological ABR parameter of road object refinement phase and results
operations. are clearly reflected in Fig. 2e. The centerline extracted by
The experiments are accomplished to test the efficacy of proposed method and the referenced road network are
the introduced framework, the experiments are categorized shown in Fig. 2f, g. A broken road is clearly observed in
into images of semi-urban (test case 1), images belong to a road network in Fig. 2d–f due to occlusion caused by
developed urban area (test case 2), images belong to a shadow of the tree.
developed urban area with grassland (test case 3) and aerial
image (test case 4). Test Case 2
123
Journal of the Indian Society of Remote Sensing
123
Journal of the Indian Society of Remote Sensing
In test case 3, images of the developed sub-urban area are To estimate the proficiency of the designed framework for
used to test the effectiveness of the proposed method, and extracting road networks from high-resolution RSI, the
the road is situated amid a residential area’s barren land. A delineated road network is compared with the reference
small path leading to the houses of the residential area also road network obtained by using GIMP and evaluation
considered as a road. The delineation of road networks method described in Fig. 6. The comparison is performed
from such type images is a problematic task as it is nec- on distinct quantitative measures proposed by Heipke et al.
essary to remove the effect of shading of vegetation and (1997) such as completeness ðEcomp Þ, correctness ðEcorr Þ
buildings which can generate a broken road network. The and quality ðEQual Þ described as follows:
framework proposed in this manuscript attempts to delin- TP
eate road network from such type of terrain, and experi- Ecomp ¼ : ð10Þ
TP þ FN
mental results are shown in Fig. 4.
The desired value of Ecomp is equal to 1, signifying that
Test Case 4 100% roads are extracted.
TP
Ecorr ¼ : ð11Þ
In this case, all images are of aerial images that contain a TP þ FP
semi-urban area of the city and the road network area is
The desired value of Ecorr is equal to 1, indicating that
surrounded by shadows of trees that are creating obstacles
extracted roads are 100% actual roads.
in road extraction. In these types of images, a path to
nearby houses along the road also exists that can be clas- TP
EQual ¼ : ð12Þ
sified as the road network. The results of the proposed TP þ FP þ FN
method on aerial images are shown in Fig. 5, the reference The desired value of EQual equal to 1 means extracted
road network is shown in red in Fig. 5a1–c1, Fig. 5a3–c3 roads are 100% correct and complete, where true positive
shows the delineated road network by proposed frame- (TP) is the road pixels present in both reference road net-
work, and centerline of road network is shown in Fig. 5a4– work and road network extracted by the proposed method,
c4. true negative (TN) are the pixels absent in both reference
and extracted road network. False positive (FP) is the
pixels that are absent in reference road data but present in
the extracted road network, and false negative (FN) is in
123
Journal of the Indian Society of Remote Sensing
123
Journal of the Indian Society of Remote Sensing
Table 2 Quantitative analysis developed urban images (test case 2), the proposed method
Test cases Ecomp Ecorr EQual
attains higher Ecorr values while the method of Liu et al.
(2016b) attains the higher EQual values. Images in test case
Test case 1 3 have a huge area of thick forestry and grassland along
(Semi-urban) with building like structures that makes the task of road
Figure 2 delineation more tedious. The proposed method attains
Image I 96.05 96.5 92.82 higher values as compared to other methods in all three
Test case 2 parameters as shown in Table 3. In the case of aerial
(Developed urban) images of Data 2#, quantitative experimental results of
Figure 3 three images are shown in Table 2 and the results of two
Image II 76.23 91.99 71.48 images are compared with Huang and Zhang (2009), Miao
Image III 87.16 92.55 81.45 et al. (2013) and Miao et al. (2014a). These images are of a
Image IV 85.97 88.99 77.07 developed urban area with a large vegetative area along
Test case 3 with a road network. The comparative analysis of the
(Developed urban with grassland) proposed method with the above-mentioned methods is
Figure 4 summarized in Table 4. In the case of image 1, the pro-
Image V 87.39 74.9 67.6 posed method obtained the second best performance in
Image VI 94.17 63.83 61.41 Ecorr and EQual values and in the case of image 2, the
Test case 4 proposed method achieves the second best performance in
Aerial image EQual values. The results of proposed framework are best in
(Developed urban) the case of images semi-urban area but in the case of
Figure 5 developed urban area images, its performance is degraded
Image I 79.96 89.94 73.4 but still better than its counterparts. This is due to fact that
Image II 76.25 95.6 73.67 fully developed urban area has objects like buildings,
Image III 80.15 85.15 70.59
parking area, houses and some fragmented roads that have
the same spectral features like road along with the presence
of vegetative area. During the road extraction process,
these objects have been erroneously classified as road
networks. This results in an increase in FP pixels that
Comparative Analysis further degrade Ecomp and EQual parameters.
Table 3 Analysis of
Authors Image I (test case 1 Image II (test case 2) Image V (test case 3)
experimental results with the
existing methods on Data 1# Ecomp Ecorr EQual Ecomp Ecorr EQual Ecomp Ecorr EQual
each image of size 512 9 512
Liu et al. (2016b) 89.4 73.1 67.3 92.1 76.2 71.6 62.3 64.5 46.4
Miao et al. (2014a) 90.4 72.8 67.6 88.3 79.8 72.2 55.4 68.3 44.1
Chaudhuri et al. (2012) 88.7 70.5 64.6 91.6 63.0 66.2 61.7 62.4 31.0
Singh and Garg (2013) 95.32 96.52 92.15
Proposed method 96.4 96.50 92.82 76.23 91.99 71.48 87.39 74.9 67.6
123
Journal of the Indian Society of Remote Sensing
Table 4 Analysis of
Authors Image 1 (test case 4) (861 9 700) Image III (test case 4) (855 9 747)
experimental results with the
existing methods on Data 2# Ecomp Ecorr EQual Ecomp Ecorr EQual
Huang and Zhang (2009) 80.4 77.3 65.1 87.2 78.5 70.4
Miao et al. (2013) 68.1 85.7 61.2 68.1 90.2 63.4
Miao et al. (2014a) 92.2 97.7 90.2 85.7 97.2 83.6
Proposed method 79.96 89.94 73.4 80.15 85.15 70.59
123
Journal of the Indian Society of Remote Sensing
Jeyapalan, K., & Bhagawati, D. (2000). As built surveys of road side Movaghati, S., Moghaddamjoo, A., & Tavakoli, A. (2010). Road
features for gis, visualization, and virtual reality. International extraction from satellite images using particle filtering and
Archives of Photogrammetry and Remote Sensing, XXXIII(B5/1; extended Kalman filtering. IEEE Transactions on Geoscience
PART 5), 406–413. and Remote Sensing, 48(7), 2807–2817.
Lehtomäki, M., et al. (2010). Detection of vertical pole-like objects in Sghaier, M. O., & Lepage, R. (2016). Road extraction from very high
a road environment using vehicle-based laser scanning data. resolution remote sensing optical images based on texture
Remote Sensing, 2(3), 641–664. analysis and beamlet transform. IEEE Journal of Selected Topics
Liu, R., Miao, Q., et al. (2016a). ‘Improved road centerlines in Applied Earth Observations and Remote Sensing, 9(5),
extraction in high-resolution remote sensing images using shear 1946–1958.
transform, directional morphological filtering and enhanced Singh, P. P., & Garg, R. D. (2013). ‘Automatic road extraction from
broken lines connection. Journal of Visual Communication and high resolution satellite image using adaptive global threshold-
Image Representation, 40, 300–311. ing and morphological operations. Journal of the Indian Society
Liu, R., Song, J., et al. (2016b). Road centerlines extraction from high of Remote Sensing, 41(3), 631–640.
resolution images based on an improved directional segmenta- Song, M., & Civco, D. (2004). ‘Road extraction using SVM and
tion and road probability. Neurocomputing, 212, 88–95. image segmentation. Photogrammetric Engineering & Remote
Lu, B., Ku, Y., & Wang, H. (2009). Automatic road extraction Sensing, 70(12), 1365–1371.
method based on level set and shape analysis. In 2009 2nd Sujatha, C., & Selvathi, D. (2015). ‘Connected component-based
international conference on intelligent computation technology technique for automatic extraction of road centerline in high
and automation (pp. 511–514). resolution satellite images. EURASIP Journal on Image and
Mena, J. B. (2003). State of the art on automatic road extraction for Video Processing. https://doi.org/10.1186/s13640-015-0062-9.
GIS update: a novel classification. Pattern Recognition Letters, Türetken, E., Benmansour, F., & Fua, P. (2012). Automated
24(16), 3037–3058. reconstruction of tree structures using path classifiers and mixed
Mena, J. B., & Malpica, J. A. (2005). An automatic method for road integer programming. In 2012 IEEE conference on computer
extraction in rural and semi-urban areas starting from high vision and pattern recognition (pp. 566–573).
resolution satellite imagery. Pattern Recognition Letters, 26(9), Vplab. Available at: http://www.cse.iitm.ac.in/*vplab/satellite.html.
1201–1220. Accessed Mar 2019.
Miao, Z., et al. (2013). Road centerline extraction from high- Wang, J., et al. (2015). Road network extraction: A neural-dynamic
resolution imagery based on shape features and multivariate framework based on deep learning and a finite state machine.
adaptive regression splines. IEEE Geoscience and Remote International Journal of Remote Sensing, 36(12), 3144–3169.
Sensing Letters, 10(3), 583–587. https://doi.org/10.1109/LGRS. Wang, W., et al. (2016). ScienceDirect A review of road extraction
2012.2214761. from remote sensing images. Journal of Traffic and Transporta-
Miao, Z., et al. (2014a). An integrated method for urban main-road tion Engineering (English Edition), 3(3), 271–282. https://doi.
centerline extraction from optical remotely sensed imagery. org/10.1016/j.jtte.2016.05.005.
IEEE Transactions on Geoscience and Remote Sensing, 52(6), Wei, Y., Wang, Z., & Xu, M. (2017). Road structure refined CNN for
1–14. road extraction in aerial image. IEEE Geoscience and Remote
Miao, Z., et al. (2014b). A semi-automatic method for road centerline Sensing Letters, 14(5), 709–713.
extraction from VHR images. IEEE Geoscience and Remote
Sensing Letters, 11(11), 1856–1860.
Miao, Z., et al. (2015). An object-based method for road network Publisher’s Note Springer Nature remains neutral with regard to
extraction in VHR satellite images. IEEE Journal of Selected jurisdictional claims in published maps and institutional affiliations.
Topics in Applied Earth Observations and Remote Sensing,
8(10), 4853–4862. https://doi.org/10.1109/JSTARS.2015.
2443552.
123