Você está na página 1de 4

2010 Second IITA International Conference on Geoscience and Remote Sensing

Studies on Application of Cloud Computing Techniques in GIS

YANG Jinnan, WU Sheng*


Spatial Information Research Center of Fujian Province,
Fuzhou University, Key Lab of Spatial Data Mining and Information Sharing, Ministry of Education,
Fuzhou, China
*correspond to e-mail: ws0110@163.com

Abstract—Cloud computing is a new internet-based computing) are not new. However, the core feature which
supercomputing principle and practice, which supplies differentiates cloud computing from the other computing
dynamic, scalable, and pay-per-use services and has huge models is its nature of ‘service driven’ [9]. Currently
computing and storage capacity with high reliability and Google, Amazon, Yahoo, Microsoft, Oracle, IBM, Dell,
security. This paper first summarized the concept, features SUN and other world famous IT companies are actively
and the development of the cloud computing, and then researching and deploying their cloud computing and have
discussed the application of cloud computing techniques in started to provide some commercial services.
spatial analysis, decision support and the storage of spatial
data, and the impact on software development and product III. STUDYS ON APPLICATION OF CLOUD COMPUTING
patterns of GIS. Finally, we proposed several issues about IN GIS
cloud security, how to monitor cloud services and to develop
cloud specifications in using cloud computing techniques. Geographic Information System includes a variety of
advanced features in information processing, and its basic
Keywords-GIS; cloud computing; spatial data storage; functionality is data collection, management, processing,
cloud security; analysis and output. Relying on these basic functions, GIS
implements a variety of applications through the use of
I. INTRODUCTION spatial analysis, modeling, network technology, database
and data integration technology, and further development
Cloud computing is associated with a new paradigm environment to meet the broad needs of users[10]. The
for the provision of computing infrastructure. This emergence of cloud computing, no doubt brings new
paradigm shifts the location of infrastructure from desktop opportunities for GIS.
to the network to reduce the costs in management of
hardware and software resources [1]. Cloud computing is A. Enhance the Storage Capacity and Security of Spatial
evolution of grid computing, and this evolution is the Data, Provide Strong Support of Mass Data Storage
development of grid computing, web 2.0 and particularly for GIS
the virtualization technology. There are many different types of spatial data: attribute
Geographic Information System (GIS) aims at helping data, geometric data, relational data that expresses the
people to present nature changes, discover the law of social mutual relationship of map elements, auxiliary data which
and economic development through analyzing massive facilitates the map processing. While the data changes
spatial and attributed data of the earth surface. The respectively with time, and the amount of data is very large,
emergence of cloud computing brings a new solution to so the management and maintenance of spatial data storage
massive data storage, data processing, spatial analysis. costs a lot of hardware, human and financial resources.
Thanks to cloud computing, the massive data that Cloud storage, which assembles different kinds of storage
knowledge acquisition and decision support information devices to work together by software with functionality of
needs can be scheduled and parallel processed entirely clustering, grid or distributed file systems, is a complex
within the cloud instead of being transferred on the system composed by various parts such as network device,
network. Data security has been well guaranteed for the storage equipments, servers, applications, public access
centralized storage and backup. Enterprises can not only interface, the access network, and client programs. The
rent data and hardware resources, but also can deploy their storage equipments are core, and other part provides
services in the cloud, providing unique data processing services of data storage and business access through the
services. application software.
For massive spatial data stored in the cloud, the cloud
II. CLOUD COMPUTING TECHNIQUES
platform provides services for data storage, management,
Cloud computing is a completely new Internet-based backup, maintenance by using its cluster and distributed
principle and practice of supercomputing, which provides file systems technology and builds the user interface of
scalable and on-demand services on computing, storage, application layer for authorized users to access through a
software and hardware with high reliability and safety to unified application program interface (API). Authorized
Internet users. In this section most of the available cloud users can not only rent cloud server to store their own data
definitions are gathered together as shown in Table 1[2], on demand, but also can paid use the data of other
especially those proposed by cloud computing experts, providers stored in the cloud, which greatly reduces the
which is marked in reference [5]. costs of upgrades and maintenance of hardware and
Though cloud computing is a new form of web service, software.
some of the existing technologies the cloud concept draws
on (such as virtualization, utility computing or distributed

978-1-4244-8515-4/10/$26.00 ©2010 IEEE GRS2010


492
Take Hadoop technology of Apache open source for intense applications. And it stores data distributed in each
example, with the support of Hadoop Distributed File node, when computing, each node reads and processes
System (HDFS), data is stored in each node, and each node their own storing data, showing its advantage in dealing
reads its own data to process, thus avoiding the large with massive data. Therefore, once applying high
amount of data transmission on net, which realized shifting computing capacity of cloud computing in overlay analysis,
calculation to storage. HDFS has a master/slave buffer analysis, network analysis, statistical analysis,
architecture showed in Fig. 1. An HDFS cluster consists of terrain analysis and other analysis algorithm, the data
a single Namenode, a master server that manages the file processing capacity will be significantly enhanced, and the
system namespace and regulates access to files by clients. knowledge mining and information acquisition for making
In addition, there are a number of Datanodes, usually one decision will be more quickly and efficiently.
per node in the cluster, which manage storage attached to
the nodes that they run on. Inside the Datanode, a file is C. Have a Great Impact on Software Development and
divided into multiple blocks [11]. Product Models of GIS
The rise of cloud computing, changes the traditional
business model of IT enterprises from selling the entity
software and hardware products into the appropriate
software and hardware services. And through cloud
computing, more applications can work in the form of
internet service. Google, one of the cloud computing
pioneers, claimed that in future almost all software can be
loaded onto the internet and be replaced by services.
Using model of cloud computing, the advantage of
sharing their services will reduce project costs, reduce
duplication and waste of resources invested, thus GIS
applications can be quickly build and deploy on it.
GIS enterprises only need hire on-demand
infrastructure, services, software and hardware resources
Figure 1. HDFS architecture offered by cloud service providers to develop GIS software
products, to deploy its services layer on the cloud and to
HDFS is designed to reliably store very large files supply services by means of the unique API of application
across machines in a large cluster. It stores each file as a layer, which makes the functions of geographic
sequence of blocks; all blocks in a file except the last block information browsing, querying, analyzing and interacting
are the same size. The blocks of a file are replicated for a reality, and at the same time meets the high concurrent
fault tolerance. The block size and replication factor are user requests. Thus enterprises do not need to build their
configurable per file. The Namenode makes all decisions own data centers, large computer room and sever centers
regarding replication of blocks. It periodically receives a to save cost. The traditional project-centered or product-
Heartbeat and a Blockreport from each of the Datanodes in centered development approach will transit to the
the cluster. Receipt of a Heartbeat implies that the platform-centric way of cloud computing.
Datanode is functioning properly. A Blockreport contains April 27, 2010, SuperMap software Co. Ltd
a list of all blocks on a Datanode. Fig. 2 shows block cooperating Red Hat released GIS platform solution-
replication. SuperMap SGS based on the open source architecture of
cloud computing. The platform serves for digital city in the
mapping-related GIS applications. The full name for the
SuperMap SGS is SuperMap digital city geographic
information platform software, it is a public software
platform based on the SuperMap iServer Java 2008 that
specially targets to our country’s mapping units and the
information service centers where the regional geo-spatial
framework is being built and needs the platform to provide
services. The platform includes the “Data Management
Subsystem”, “Information Services Subsystem”,
“Auxiliary Application Subsystem” and “Operation
Management Subsystem” to provide two-dimensional and
Figure 2. Block replication three-dimensional map services and to support geographic
information browsing, querying, analyzing and interacting
functions.
B. Enhance the Efficient Processing Capabilities of
Through KVM virtualization technologies and
Spatial Analysis and Facilitate Decision-Making
visualization management techniques, Red Hat RHEV
Support by Combining Cloud Computing and Spatial virtualization platform has improved the ability of
Analysis SuperMap SGS in five aspects[12]: support rapid
As mentioned in the previous section, cloud computing deployment of SuperMap SGS product and applications
avoids the defects caused by bad bandwidth to transfer and support business agility; make the dynamic monitoring
data from storage nodes to processing nodes in data- and management of software and hardware a reality; meet

493
the high concurrent requests of applications and provide B. Supervision Issue
excellent service performance; support the separation and Supervision issue is also one big problem that cloud
automatic backup and restoring of applications; computing should face. If a private cloud platform is built
intelligently release server resources to save energy. for geographic information system industry, then who
SuperMap SGS system architecture is shown in Fig. 3. would be responsible for monitoring the cloud providers in
order to prevent leakage of data and services? Whether or
not to introduce the third-party organizations? What role
should government departments act as and how to use the
two methods—law and moral—in the process of
monitoring of cloud provider? These fundamental
problems need to be worked out.
C. Regulations Establishing
Now cloud computing is still in its early stage of
development, there is still a long way for it to get widely
accepted and applied. Establishing regulations about
technology, service customization, price and other aspects
of cloud computing will be also of great concern. At the
Figure 3. SuperMap SGS architecture same time people also look forward to applying these
norms and policies in the private cloud of GIS industry.
IV. ISSUES IN THE APPLICATION OF CLOUD V. CONCLUSION AND FURTURE RESEARCH
COMPUTING TECHNOLOGIES IN GIS This paper discussed cloud computing applications in
As an emerging technology with broad prospects, the GIS field, first introduced the concept, characteristics
cloud computing receives great regard, but at the same and research of the basic profiles of cloud computing;
time it also needs to face and solve the problems in actual Then the paper focused on how to apply the cloud
operation, such as not transparent to users, regulatory computing techniques in the GIS spatial data storage,
measures. However, with the further development of cloud spatial analysis and decision support, as well as its impact
computing technology and the deepening of cloud concept, on GIS software development and product patterns; At last
this application platform will achieve its large-scale we proposed several issues in the process of using cloud:
popularity in the future and turn the information life upside data security and confidentiality, supervision of cloud
down. While the new technique offering great prospects service providers and the development of norms. There are
for development of GIS, we should also consider carefully still problems that needed to be improved in this article: (1)
any problem that may come along during combining Lack of the examples about cloud techniques applying in
technologies of cloud computing with GIS industry. GIS; (2) Because of the limited knowledge of author,
problems that emerging in applying the cloud technologies
A. Spatial Data Security and Cconfidentiality
to GIS may be considered incompletely. It is very
In cloud computing, cloud providers will be necessary to make sure that as mature as the techniques of
responsible for the security of data transmission. Once the cloud computing develop, the research on the application
spatial data is stored in the cloud, GIS users will have no of cloud computing techniques in the field of GIS will
idea in what place or even in which country the data is become a hot spot.
stored, its exact location is transparent to users. For
including the location attribute, the spatial data must be ACKNOWLEDGMENT
kept strictly confidential in several areas such as military, This work is supported by the National Key
thus if the data is delivered to cloud service providers, the Technology R&D Program of CHINA (Grant No.
third party, besides the GIS users and developers, data 2007BAH16B00).
security and confidentiality will be an extreme test. So,
how to ensure the GIS industry have trust in the cloud
providers is one of the most important issues before using REFERENCES
cloud techniques in GIS industry. [1] Greg Boss, Padma Malladi, and Dennis Quan, Cloud
Given the sensitivity of geospatial data and the current Computing, 2007, 4-5.
cloud providers are foreign companies, the author [2] Luis M.Vaquero, Luis Rodero-Merino, and Juan Caceres,
recommends that in our country a private cloud center “A Break in the Clouds: Towards a Cloud Definition,”
especially for geographic information industry should be ACM SIGCOMM Computer Communication Review.
established so all of the GIS applications could be based Volume 39, Number 1, January 2009.
on this platform, while spatial data and the access to the [3] Roy Bragg, “Cloud computing: When computers really
storage resources being controlled entirely by the GIS rule,” Tech News World, July 2008. Electronic Magazine,
available at http://www.technewsworld.com/story/63954.h-
industry itself, thus avoiding various risks. In this way, the tml.
whole operating costs of the entire industry will be [4] Rajkumar Buyya, Chee Shin Yeo, and Srikumar Venugopa,
reducing effectively and at the same time a new industry “Market-oriented cloud computing: Vision, hype, and
chain will form. reality for delivering it services as computing utilities,”
CoRR, (abs/0808.3558), 2008.
[5] Jeremy Geelan, “Twenty one experts define cloud
computing,” Virtualization, August 2008. Electronic

494
Magazine, article available at http://virtualization.sys- [9] Ji Hu, Andreas Klein, “A Benchmark of Transparent Data
con.com/node/612375. Encryption for Migration of Web Applications in the
[6] Galen Gruman, Eric Knorr, “What cloud computing really Clouds,” In the 2009 Eighth IEEE International
means,” InfoWorld, April 2008. Electronic Magazine, Conferrence on Dependable, Autonomic and Secure
available at http://www.infoworld.com/article/08/04/07/15- Computing, 2009, pp. 735-740.
FE-cloud-computing-reality_1.html. [10] Hu Peng, Geographic Information System Course. Wuhan:
[7] E.Hand, “ Head in the clouds,” Nature, (449): 963, Oct. Wuhan University publication. 2002.11.
2007. [11] HDFS Architecture [EB/OL]. [2008-12-10], available at
[8] Paul McFedries, “The cloud is the computer,” IEEE http://hadoop.apache.org/core/docs/current/hdfs_design.htm
Spectrum Online, August 2008. Electronic Magazine, l.
available at http://www.spectrum.ieee.org/aug08/6490. [12] http://sixth.blog.51cto.com/426044/310395.

TABLE I. CLOUD DEFINITIONS

Author/Reference Year Definition/Excerpt


you can scale your infrastructure on demand within minutes or even seconds, instead of days or weeks, thereby
M.klems[5] 2008
avoiding under utilization (idle servers)and over-utilization(blue screen) of in-house resources...
using the internet to allow people to access technology-enabled services. Those services must be massively
P.Gaw[5] 2008
scalable ...
A Cloud is a type of parallel and distributed system consisting of a collection of interconnected and virtualized
R.Buyya[4] 2008 computers that are dynamically provisioned and presented as one or more unified computing resources based on
service-level agreements established through negotiation between the service provider and consumers.
Cloud Computing is one of those catch all buzz words that tries to encompass a variety of aspects ranging from
deployment, load balancing, provisioning, business model and architecture (like Web2.0).It's the next logical step
R.Cohen[5] 2008
in software(software 10.0).For me the simplest explanation for Cloud Computing is describing it as, internet
centric software ...
a broad array of web-based services aimed at allowing users to obtain a wide range of functional capabilities on a
'pay-as-you-go' basis that previously required tremendous hardware/software investments and professional skills
J.Kaplan[5] 2008
to acquire. Cloud Computing is the realization of the earlier ideals of utility computing without the technical
complexities or complicated deployment worries...
D.Gourlay[5] 2008 ...the next hype-term...building off of the software models that virtualization enabled
D.Edwards[5] 2008 ...what is possible when you leverage web-scale infrastructure (application and physical) in an on-demand way
...There really are only three types of services that are Cloud based: SaaS ,PaaS, and Cloud Computing Platforms.
B.de Haff[5] 2008
I am not sure being massively scalable is a requirement to fit into any one category.
…Put simply Cloud Computing is the infrastructural paradigm shift that enables the ascension of SaaS. …It is a
broad array of web-based services aimed at allowing users to obtain a wide range of functional capabilities on a
B.Kepes[5] 2008
pay-as-you -go basis that previously required tremendous hardware/software investments and professional skills
to acquire.
Clouds focused on making the hardware layer consumable as on-demand compute and storage capacity. This is
an important first step, but for companies to harness the power of the Cloud, complete application infrastructure
K.Sheynkman[5] 2008
needs to be easily configured, deployed, dynamically-scaled and managed in these virtualized hardware
environments.
…In a fully implemented Data Center 3.0 environment, you can decide if an app is run locally (cook at home), in
someone else’s data center (take-out) and you can change your mind on the fly in case you are short on data
O.Sultan[5] 2008
center resources (pantry is empty) or you having environmental/facilities issues (too hot to cook).In fact, with
automation, a lot of this can be done with policy and real-time triggers...
…really is accessing resources and services needed to perform functions with dynamically changing needs… is a
K.Hartig[5] 2008
virtualization of resources that maintains and manages itself…
J.Pritzker[5] 2008 Clouds are vast resource pools with on-demand resource allocation… virtualized…and priced like utilities
T.Doerksen[5] 2008 Cloud computing is…the user-friendly version of Grid computing
T.von Eicken[5] 2008 outsourced, pay-as-you-go, on-demand, somewhere in the internet, etc
…'Cloud Pyramid' to help differentiate the various Cloud offerings out there… Top: Saas; Middle: PaaS; Bottom:
M.Sheedan[5] 2008
IaaS
…Cloud Computing projects are more powerful and crash-proof than Grid systems developed even in recent
A.Ricadela[5] 2008
years
…the key thing we want to virtualize or hide from the user is complexity…all that software will be virtualized of
I.Wladawshy
2008 hidden from us and taken care of by systems and/or professionals that are somewhere else - out there in The
Berger[5]
Cloud
Cloud computing encompasses any subscription-based or pay-per-use services that, in real time over the Internet,
B.Martin[5] 2008
extends IT's existing capabilities
The key concept behind the Cloud is Web application…a more developed and reliable Cloud. Many find it's now
R.Bragg[3] 2008 cheaper to migrate to the Web Cloud than invest in their own server farm… it is a desktop for people without a
computer
G.Gruman and Cloud is all about: SaaS… utility computing… Web Services… PaaS… Internet integration… commerce
2008
E.Knorr[6] platforms…
Cloud Computing, in which not just our data but even our software resides within the Cloud, and we access
everything not only through our PCs but also Cloud- friendly devices, such as smart phones, PDAs… the mega-
P.McFedries[8] [7] 2008
computer enabled by virtualization and software as a service...This is utility computing powered by massive
utility data centers.

495

Você também pode gostar