Escolar Documentos
Profissional Documentos
Cultura Documentos
Cloud Introduction
1
Cloud Computing
2
Cloud Computing
Application Example
3
Cloud Computing
What is a Cloud?
4
Cloud Computing
Cloud Models
5
Cloud Computing
Cloud Models
Public Cloud
Enables public systems and service access
Open architecture (e.g., e-mail)
Could be less secure due to openness
Private Cloud
Enables service access within an organization
Due to its private nature, it is more secure
6
Cloud Computing
Cloud Models
Community Cloud
Cloud accessible by a group of organizations
Hybrid Cloud
Hybrid Cloud = Public Cloud + Private Cloud
Private cloud supports critical activities
Public cloud supports non-critical activities
7
Cloud Computing
8
Cloud Computing
10
Cloud Computing
Cloud Benefits
11
Cloud Computing
Characteristics
12
Cloud Computing
REFERENCES
13
References
K. Kumar and Y. H. Lu, Cloud Computing for Mobile Users: Can Offloading
Computation Save Energy?, Computer, vol. 43, no. 4, pp. 5156, Apr. 2010.
Wikipedia, http://www.wikipedia.org
Apple, iCloud, https://www.icloud.com
Google, Google Cloud, https://cloud.google.com/products [Accessed June 1, 2015]
Virtualization, Ciscos IaaS cloud,
http://www.virtualization.co.kr/data/file/01_2/1889266503_6f489654_1.jpg
[Accessed June 1, 2015]
Tutorialspoint, Cloud computing,
http://www.tutorialspoint.com/cloud_computing/cloud_computing_tutorial.pdf
[Accessed June 1, 2015]
14
References
Image sources
AWS Simple Icons Storage Amazon S3 Bucket with Objects, By Amazon Web
Services LLC [CC BY-SA 3.0 (http://creativecommons.org/licenses/by-sa/3.0)], via
Wikimedia Commons
iCloud Logo, By EEIM (Own work) [Public domain], via Wikimedia Commons
MobileMe Logo, By Apple Inc. [Public domain], via Wikimedia Commons
15
Cloud Computing
16
Cloud Computing
17
IaaS
18
IaaS
IaaS Procedures
20
IaaS
IaaS Benefits
Flexible and Efficient Renting of Computer & Server
Hardware
Rentable Resources
VM, Storage, Bandwidth,
IP Addresses, Monitoring Services, Firewalls,
etc.
Rent Payment Basis
Resource type
Usage time
Service packages
21
IaaS
IaaS Benefits
Portability & Interoperability with
Legacy Applications
Enables portability based on infrastructure
resources that are
used through Internet connections
Enables a method to maintain interoperability with
legacy applications and workloads
between IaaS clouds
22
PaaS
PaaS
(Platform as a Service)
Provides development &
deployment tools for
application development
Provides runtime
environment for apps.
23
Cloud Services
PaaS Types
Add-on
Open Platform Development
as a Service Facilities
24
PaaS
PaaS Types
Application Delivery-Only Environment
Provides on-demand scaling & application security
Stand-Alone Development Environment
Provides an independent platform for a specific function
Open Platform as a Service
Provides open source software to run applications for
PaaS providers
Add-On Development Facilities
Enables customization to the existing SaaS platforms
25
PaaS
PaaS Benefits
26
PaaS
Benefits
27
PaaS
Benefits
Scalable Solutions
Application resource demand based automatic
resource scale control
28
SaaS
29
SaaS
Characteristics
On Demand Availability
Cloud software is available anywhere that the
cloud is reachable via Internet
Easy Maintenance
No user software upgrade or maintenance needed
All supported by the cloud
Flexible Scale Up or Scale Down
Centralized Management & Data
30
SaaS
Characteristics
Enables a Shared Data Model
Multiple users can share a single
data model and database
Cost Effectiveness
Pay based on usage
No risk in buying the wrong software
Multitenant Programming Solutions
Multiple programmers are ensured to use the same
software version
No version mismatch problems
31
Software-as-a-service
Open SaaS
Applications
32
Cloud Computing
REFERENCES
33
References
K. Kumar and Y. H. Lu, Cloud Computing for Mobile Users: Can Offloading
Computation Save Energy?, Computer, vol. 43, no. 4, pp. 5156, Apr. 2010.
Wikipedia, http://www.wikipedia.org
Apple, iCloud, https://www.icloud.com
Google, Google Cloud, https://cloud.google.com/products [Accessed June 1, 2015]
Virtualization, Ciscos IaaS cloud,
http://www.virtualization.co.kr/data/file/01_2/1889266503_6f489654_1.jpg
[Accessed June 1, 2015]
Tutorialspoint, Cloud computing,
http://www.tutorialspoint.com/cloud_computing/cloud_computing_tutorial.pdf
[Accessed June 1, 2015]
34
References
Image sources
AWS Simple Icons Storage Amazon S3 Bucket with Objects, By Amazon Web
Services LLC [CC BY-SA 3.0 (http://creativecommons.org/licenses/by-sa/3.0)], via
Wikimedia Commons
iCloud Logo, By EEIM (Own work) [Public domain], via Wikimedia Commons
MobileMe Logo, By Apple Inc. [Public domain], via Wikimedia Commons
35
Cloud Computing
Cloud Services
36
Cloud Services
Google Cloud
Google App Engine
Released as a preview in April 2008
PaaS (Platform as a Service) for web applications
Provides automatic scaling based on resource
demands and server load
37
Cloud Services
Google Cloud
Google BigQuery
Released in April 2012
Data analysis tool that uses SQL-like queries to
process big datasets in seconds
38
Cloud Services
Google Cloud
Google Cloud Endpoints
Released in November 2013
Tool to create services inside App Engine
Easily connects from Android, iOS, and JavaScript
clients
39
Cloud Services
Google Cloud
40
Cloud Services
41
Cloud Services
42
Cloud Services
43
Cloud Services
Unlimited Photos
Unlimited storage for photos & raw data files
5 gigabytes of video storage
Unlimited Everything
Unlimited storage for photos, videos, documents, and
various files types
44
Cloud Services
iCloud
Operating system
OS X (10.7 Lion or later)
Microsoft Windows 7 or later
iOS 5 or later
45
Cloud Services
46
Cloud Services
iCloud Features
Email, Contacts, and Calendars
Find My Friends
Backup & Restore
Back up feature for device settings & data
iOS 5 or later required
Find My iPhone
Enables a user to track the location of an iOS device or
Mac
Formerly a feature of MobileMe
47
Cloud Services
iCloud Features
48
Cloud Services
iCloud Features
Photo Stream
Can store most recent 1,000 photos
Free storage for up to 30 days
49
Cloud Services
iCloud Features
iCloud Drive
Can save photos, videos, documents, and apps
iCloud Keychain
Secure database for Website and Wi-Fi
password
Secure Credit card & Debit card management for
quick access and auto-fill
50
Cloud Services
iCloud Features
iTunes Match
iTunes music library scan and match tracks
function
Serves tracks copied from CDs or other sources
51
Cloud Computing
REFERENCES
52
References
K. Kumar and Y. H. Lu, Cloud Computing for Mobile Users: Can Offloading
Computation Save Energy?, Computer, vol. 43, no. 4, pp. 5156, Apr. 2010.
Wikipedia, http://www.wikipedia.org
Apple, iCloud, https://www.icloud.com
Google, Google Cloud, https://cloud.google.com/products [Accessed June 1, 2015]
Virtualization, Ciscos IaaS cloud,
http://www.virtualization.co.kr/data/file/01_2/1889266503_6f489654_1.jpg
[Accessed June 1, 2015]
Tutorialspoint, Cloud computing,
http://www.tutorialspoint.com/cloud_computing/cloud_computing_tutorial.pdf
[Accessed June 1, 2015]
53
References
Image sources
AWS Simple Icons Storage Amazon S3 Bucket with Objects, By Amazon Web
Services LLC [CC BY-SA 3.0 (http://creativecommons.org/licenses/by-sa/3.0)], via
Wikimedia Commons
iCloud Logo, By EEIM (Own work) [Public domain], via Wikimedia Commons
MobileMe Logo, By Apple Inc. [Public domain], via Wikimedia Commons
54
Big Data
55
Big Data
56
Big Data
57
Big Data
58
Big Data
59
Big Data
60
Big Data
Wal-Mart
61
Big Data
Wal-Mart
62
Big Data
63
Big Data
Amazon.com
Amazon.coms Recommendation System
Item-to-Item Collaborative Filtering Algorithm
Personalization of the Online Store
Customized to each customer
Each customers store is based on the customers
personal interest
Example: For a new mother, the store will display
baby supplies and toys
64
Big Data
Citibank
Bank operations in 100 countries
Big Data analysis on the database of basic financial
transactions can enable Global insight on
investments, market changes, trade patterns, and
economic conditions
Many companies (e.g., Zara, H&M, etc.) work with
Citibank to locate new stores and factories
65
Big Data
66
Big Data
67
Big Data
REFERENCES
68
References
V. Mayer-Schnberger, and K. Cukier, Big data: A revolution that will transform how
we live, work, and think. Houghton Mifflin Harcourt, 2013.
T. White, Hadoop: The Definitive Guide. O'Reilly Media, 2012.
J. Venner, Pro Hadoop. Apress, 2009.
S. LaValle, E. Lesser, R. Shockley, M. S. Hopkins, and N. Kruschwitz, Big Data,
Analytics and the Path From Insights to Value, MIT Sloan Management Review,
vol. 52, no. 2, Winter 2011.
B. Randal, R. H. Katz, and E. D. Lazowska, "Big-data Computing: Creating
revolutionary breakthroughs in commerce, science and society," Computing
Community Consortium, pp. 1-15, Dec. 2008.
G. Linden, B. Smith, and J. York. "Amazon.com Recommendations: Item-to-Item
Collaborative Filtering," IEEE Internet Computing, vol. 7, no. 1, pp. 76-80, Jan/Feb.
2003.
69
References
J. R. GalbRaith, "Organizational Design Challenges Resulting From Big Data,"
Journal of Organization Design, vol. 3, no. 1, pp. 2-13, Apr. 2014.
S. Sagiroglu and D. Sinanc, Big data: A review, Proc. IEEE International
Conference on Collaboration Technologies and Systems, pp. 42-47, May 2013.
M. Chen, S. Mao, and Y. Liu, Big Data: A Survey, Mobile Networks and
Applications, vol. 19, no. 2, pp. 171-209, Jan. 2014.
X. Wu, X. Zhu, G. Q. Wu, and W. Ding, Data Mining with Big Data, IEEE
Transactions on Knowledge and Data Engineering, vol. 26, no. 1, pp. 97107, Jan.
2014.
Z. Zheng, J. Zhu, and M. R. Lyu, Service-Generated Big Data and Big Data-as-a-
Service: An Overview, Proc. IEEE International Congress on Big Data, pp. 403
410, Jun/Jul. 2013.
70
References
I. Palit and C.K. Reddy, Scalable and Parallel Boosting with MapReduce, IEEE
Transactions on Knowledge and Data Engineering, vol. 24, no. 10, pp. 1904-1916,
2012.
M.-Y Choi, E.-A. Cho, D.-H. Park, C.-J Moon, and D.-K. Baik, A Database
Synchronization Algorithm for Mobile Devices, IEEE Transactions on Consumer
Electronics, vol. 56, no. 2, pp. 392-398, May 2010.
IBM, What is big data?, http://www.ibm.com/software/data/bigdata/what-is-big-
data.html [Accessed June 1, 2015]
Hadoop Apache, http://hadoop.apache.org
Wikipedia, http://www.wikipedia.org
Image sources
Walmart Logo, By Walmart [Public domain], via Wikimedia Commons
Amazon Logo, By Balajimuthazhagan (Own work) [CC BY-SA 3.0
(http://creativecommons.org/licenses/by-sa/3.0)], via Wikimedia Commons
71
Big Data
Big Data's 4 Vs
72
Big Data
73
Big Data
74
Big Data
75
Big Data
76
Big Data
77
Big Data
78
Big Data
REFERENCES
79
References
V. Mayer-Schnberger, and K. Cukier, Big data: A revolution that will transform how
we live, work, and think. Houghton Mifflin Harcourt, 2013.
T. White, Hadoop: The Definitive Guide. O'Reilly Media, 2012.
J. Venner, Pro Hadoop. Apress, 2009.
S. LaValle, E. Lesser, R. Shockley, M. S. Hopkins, and N. Kruschwitz, Big Data,
Analytics and the Path From Insights to Value, MIT Sloan Management Review,
vol. 52, no. 2, Winter 2011.
B. Randal, R. H. Katz, and E. D. Lazowska, "Big-data Computing: Creating
revolutionary breakthroughs in commerce, science and society," Computing
Community Consortium, pp. 1-15, Dec. 2008.
G. Linden, B. Smith, and J. York. "Amazon.com Recommendations: Item-to-Item
Collaborative Filtering," IEEE Internet Computing, vol. 7, no. 1, pp. 76-80, Jan/Feb.
2003.
80
References
J. R. GalbRaith, "Organizational Design Challenges Resulting From Big Data,"
Journal of Organization Design, vol. 3, no. 1, pp. 2-13, Apr. 2014.
S. Sagiroglu and D. Sinanc, Big data: A review, Proc. IEEE International
Conference on Collaboration Technologies and Systems, pp. 42-47, May 2013.
M. Chen, S. Mao, and Y. Liu, Big Data: A Survey, Mobile Networks and
Applications, vol. 19, no. 2, pp. 171-209, Jan. 2014.
X. Wu, X. Zhu, G. Q. Wu, and W. Ding, Data Mining with Big Data, IEEE
Transactions on Knowledge and Data Engineering, vol. 26, no. 1, pp. 97107, Jan.
2014.
Z. Zheng, J. Zhu, and M. R. Lyu, Service-Generated Big Data and Big Data-as-a-
Service: An Overview, Proc. IEEE International Congress on Big Data, pp. 403
410, Jun/Jul. 2013.
81
References
I. Palit and C.K. Reddy, Scalable and Parallel Boosting with MapReduce, IEEE
Transactions on Knowledge and Data Engineering, vol. 24, no. 10, pp. 1904-1916,
2012.
M.-Y Choi, E.-A. Cho, D.-H. Park, C.-J Moon, and D.-K. Baik, A Database
Synchronization Algorithm for Mobile Devices, IEEE Transactions on Consumer
Electronics, vol. 56, no. 2, pp. 392-398, May 2010.
IBM, What is big data?, http://www.ibm.com/software/data/bigdata/what-is-big-
data.html [Accessed June 1, 2015]
Hadoop Apache, http://hadoop.apache.org
Wikipedia, http://www.wikipedia.org
Image sources
Walmart Logo, By Walmart [Public domain], via Wikimedia Commons
Amazon Logo, By Balajimuthazhagan (Own work) [CC BY-SA 3.0
(http://creativecommons.org/licenses/by-sa/3.0)], via Wikimedia Commons
82
Big Data
HADOOP
83
Hadoop
84
Hadoop
Challenge 2: Cost
To avoid data loss or computed analysis information
loss, using backup computers and memory is needed,
which helps the reliability, but is very expensive
85
Hadoop
86
Hadoop
Hadoop
87
Hadoop
88
Hadoop
89
Hadoop
HDFS
90
Hadoop
HDFS: Namenode
Maintains the filesystem tree and the metadata for all the
files and directories in the tree
91
Hadoop
HDFS: Datanodes
92
Hadoop
MapReduce
93
Hadoop
94
Hadoop
MapReduce Characteristics
95
Hadoop
MapReduce Characteristics
96
Hadoop
97
Hadoop
MapReduce
MapReduce gains performance enhancement through
optimal balancing
of Seeking and Transfer operations
Reduce Seek operations
Effectively use Transfer operations
98
Big Data
REFERENCES
99
References
V. Mayer-Schnberger, and K. Cukier, Big data: A revolution that will transform how
we live, work, and think. Houghton Mifflin Harcourt, 2013.
T. White, Hadoop: The Definitive Guide. O'Reilly Media, 2012.
J. Venner, Pro Hadoop. Apress, 2009.
S. LaValle, E. Lesser, R. Shockley, M. S. Hopkins, and N. Kruschwitz, Big Data,
Analytics and the Path From Insights to Value, MIT Sloan Management Review,
vol. 52, no. 2, Winter 2011.
B. Randal, R. H. Katz, and E. D. Lazowska, "Big-data Computing: Creating
revolutionary breakthroughs in commerce, science and society," Computing
Community Consortium, pp. 1-15, Dec. 2008.
G. Linden, B. Smith, and J. York. "Amazon.com Recommendations: Item-to-Item
Collaborative Filtering," IEEE Internet Computing, vol. 7, no. 1, pp. 76-80, Jan/Feb.
2003.
100
References
J. R. GalbRaith, "Organizational Design Challenges Resulting From Big Data,"
Journal of Organization Design, vol. 3, no. 1, pp. 2-13, Apr. 2014.
S. Sagiroglu and D. Sinanc, Big data: A review, Proc. IEEE International
Conference on Collaboration Technologies and Systems, pp. 42-47, May 2013.
M. Chen, S. Mao, and Y. Liu, Big Data: A Survey, Mobile Networks and
Applications, vol. 19, no. 2, pp. 171-209, Jan. 2014.
X. Wu, X. Zhu, G. Q. Wu, and W. Ding, Data Mining with Big Data, IEEE
Transactions on Knowledge and Data Engineering, vol. 26, no. 1, pp. 97107, Jan.
2014.
Z. Zheng, J. Zhu, and M. R. Lyu, Service-Generated Big Data and Big Data-as-a-
Service: An Overview, Proc. IEEE International Congress on Big Data, pp. 403
410, Jun/Jul. 2013.
101
References
I. Palit and C.K. Reddy, Scalable and Parallel Boosting with MapReduce, IEEE
Transactions on Knowledge and Data Engineering, vol. 24, no. 10, pp. 1904-1916,
2012.
M.-Y Choi, E.-A. Cho, D.-H. Park, C.-J Moon, and D.-K. Baik, A Database
Synchronization Algorithm for Mobile Devices, IEEE Transactions on Consumer
Electronics, vol. 56, no. 2, pp. 392-398, May 2010.
IBM, What is big data?, http://www.ibm.com/software/data/bigdata/what-is-big-
data.html [Accessed June 1, 2015]
Hadoop Apache, http://hadoop.apache.org
Wikipedia, http://www.wikipedia.org
Image sources
Walmart Logo, By Walmart [Public domain], via Wikimedia Commons
Amazon Logo, By Balajimuthazhagan (Own work) [CC BY-SA 3.0
(http://creativecommons.org/licenses/by-sa/3.0)], via Wikimedia Commons
102
Big Data
MapReduce vs.
RDBMS
103
Hadoop
104
Hadoop
MapReduce Characteristics
105
Hadoop
106
Hadoop
107
Hadoop
108
Hadoop
109
Hadoop
110
Hadoop
Hadoop Release Series Release 2.6.0 became available Nov. 2014
111
Hadoop
112
Hadoop
113
Big Data
REFERENCES
114
References
V. Mayer-Schnberger, and K. Cukier, Big data: A revolution that will transform how
we live, work, and think. Houghton Mifflin Harcourt, 2013.
T. White, Hadoop: The Definitive Guide. O'Reilly Media, 2012.
J. Venner, Pro Hadoop. Apress, 2009.
S. LaValle, E. Lesser, R. Shockley, M. S. Hopkins, and N. Kruschwitz, Big Data,
Analytics and the Path From Insights to Value, MIT Sloan Management Review,
vol. 52, no. 2, Winter 2011.
B. Randal, R. H. Katz, and E. D. Lazowska, "Big-data Computing: Creating
revolutionary breakthroughs in commerce, science and society," Computing
Community Consortium, pp. 1-15, Dec. 2008.
G. Linden, B. Smith, and J. York. "Amazon.com Recommendations: Item-to-Item
Collaborative Filtering," IEEE Internet Computing, vol. 7, no. 1, pp. 76-80, Jan/Feb.
2003.
115
References
J. R. GalbRaith, "Organizational Design Challenges Resulting From Big Data,"
Journal of Organization Design, vol. 3, no. 1, pp. 2-13, Apr. 2014.
S. Sagiroglu and D. Sinanc, Big data: A review, Proc. IEEE International
Conference on Collaboration Technologies and Systems, pp. 42-47, May 2013.
M. Chen, S. Mao, and Y. Liu, Big Data: A Survey, Mobile Networks and
Applications, vol. 19, no. 2, pp. 171-209, Jan. 2014.
X. Wu, X. Zhu, G. Q. Wu, and W. Ding, Data Mining with Big Data, IEEE
Transactions on Knowledge and Data Engineering, vol. 26, no. 1, pp. 97107, Jan.
2014.
Z. Zheng, J. Zhu, and M. R. Lyu, Service-Generated Big Data and Big Data-as-a-
Service: An Overview, Proc. IEEE International Congress on Big Data, pp. 403
410, Jun/Jul. 2013.
116
References
I. Palit and C.K. Reddy, Scalable and Parallel Boosting with MapReduce, IEEE
Transactions on Knowledge and Data Engineering, vol. 24, no. 10, pp. 1904-1916,
2012.
M.-Y Choi, E.-A. Cho, D.-H. Park, C.-J Moon, and D.-K. Baik, A Database
Synchronization Algorithm for Mobile Devices, IEEE Transactions on Consumer
Electronics, vol. 56, no. 2, pp. 392-398, May 2010.
IBM, What is big data?, http://www.ibm.com/software/data/bigdata/what-is-big-
data.html [Accessed June 1, 2015]
Hadoop Apache, http://hadoop.apache.org
Wikipedia, http://www.wikipedia.org
Image sources
Walmart Logo, By Walmart [Public domain], via Wikimedia Commons
Amazon Logo, By Balajimuthazhagan (Own work) [CC BY-SA 3.0
(http://creativecommons.org/licenses/by-sa/3.0)], via Wikimedia Commons
117
Big Data
MapReduce
118
MapReduce
Hadoop
119
MapReduce
Scaling Out
120
MapReduce
Jobs
MapReduce job is a unit of work that needs to be
executed
121
MapReduce
122
MapReduce
Data flow
124
MapReduce
Data flow
125
MapReduce
Data flow
Rack
Data Center
Rack-Local Map Task
Map Task
A node hosting the
HDFS Block
HDFS block replicas for
a map tasks input split
could be running other map tasks
Job Scheduler will look for a free map slot on
a node in the same rack as one of the blocks
127
MapReduce
Data flow
Map
Map task will write its output to the local disk
Map task output is not the final output, it is only the
intermediate output
Reduce
Map task output is processed by Reduce Tasks to produce
the final output
Reduce Task output is stored in HDFS
For a completed job, the Map Task output can be
discarded
129
MapReduce
Combiner Function
User specified function to run on the Map output
Forms the input to the Reduce function
Specifically designed to minimize the data transferred
between Map Tasks and Reduce Tasks
Solves the problem of limited network speed on the
cluster and helps to reduce the time in completing
MapReduce jobs
132
MapReduce
Multiple Reducer
133
MapReduce
Multiple Reducers
Shuffle
Zero Reducer
REFERENCES
136
References
V. Mayer-Schnberger, and K. Cukier, Big data: A revolution that will transform how
we live, work, and think. Houghton Mifflin Harcourt, 2013.
T. White, Hadoop: The Definitive Guide. O'Reilly Media, 2012.
J. Venner, Pro Hadoop. Apress, 2009.
S. LaValle, E. Lesser, R. Shockley, M. S. Hopkins, and N. Kruschwitz, Big Data,
Analytics and the Path From Insights to Value, MIT Sloan Management Review,
vol. 52, no. 2, Winter 2011.
B. Randal, R. H. Katz, and E. D. Lazowska, "Big-data Computing: Creating
revolutionary breakthroughs in commerce, science and society," Computing
Community Consortium, pp. 1-15, Dec. 2008.
G. Linden, B. Smith, and J. York. "Amazon.com Recommendations: Item-to-Item
Collaborative Filtering," IEEE Internet Computing, vol. 7, no. 1, pp. 76-80, Jan/Feb.
2003.
137
References
J. R. GalbRaith, "Organizational Design Challenges Resulting From Big Data,"
Journal of Organization Design, vol. 3, no. 1, pp. 2-13, Apr. 2014.
S. Sagiroglu and D. Sinanc, Big data: A review, Proc. IEEE International
Conference on Collaboration Technologies and Systems, pp. 42-47, May 2013.
M. Chen, S. Mao, and Y. Liu, Big Data: A Survey, Mobile Networks and
Applications, vol. 19, no. 2, pp. 171-209, Jan. 2014.
X. Wu, X. Zhu, G. Q. Wu, and W. Ding, Data Mining with Big Data, IEEE
Transactions on Knowledge and Data Engineering, vol. 26, no. 1, pp. 97107, Jan.
2014.
Z. Zheng, J. Zhu, and M. R. Lyu, Service-Generated Big Data and Big Data-as-a-
Service: An Overview, Proc. IEEE International Congress on Big Data, pp. 403
410, Jun/Jul. 2013.
138
References
I. Palit and C.K. Reddy, Scalable and Parallel Boosting with MapReduce, IEEE
Transactions on Knowledge and Data Engineering, vol. 24, no. 10, pp. 1904-1916,
2012.
M.-Y Choi, E.-A. Cho, D.-H. Park, C.-J Moon, and D.-K. Baik, A Database
Synchronization Algorithm for Mobile Devices, IEEE Transactions on Consumer
Electronics, vol. 56, no. 2, pp. 392-398, May 2010.
IBM, What is big data?, http://www.ibm.com/software/data/bigdata/what-is-big-
data.html [Accessed June 1, 2015]
Hadoop Apache, http://hadoop.apache.org
Wikipedia, http://www.wikipedia.org
Image sources
Walmart Logo, By Walmart [Public domain], via Wikimedia Commons
Amazon Logo, By Balajimuthazhagan (Own work) [CC BY-SA 3.0
(http://creativecommons.org/licenses/by-sa/3.0)], via Wikimedia Commons
139
Big Data
HDFS
140
HDFS
Hadoop
141
HDFS
142
HDFS
143
HDFS
HDFS Characteristics
144
HDFS
Blocks
Files in HDFS are divided into block size chunks 64
Megabyte default block size
145
HDFS
HDFS
146
HDFS
Namenode
Manages the filesystem namespace
Namenode keeps track of the datanodes that have
blocks of a distributed file assigned
Maintains the filesystem tree and the metadata for all
the files and directories in the tree
Stores on the local disk using 2 file forms
Namespace Image
Edit Log
147
HDFS
Namenode
148
HDFS
Metadata
149
HDFS
Metadata Types
Structural Metadata
Focuses on the data structure's design and
specification
Descriptive Metadata
Focuses on the individual instances of application
data or the data content
150
HDFS
Datanodes
151
HDFS
Client Access
152
HDFS
Namenode Failure
Namenode keeps track of the datanodes that have blocks
of a distributed file assigned Without the namenode, the
filesystem cannot be used
153
HDFS
2. Secondary Namenode
154
HDFS
155
HDFS
2. Secondary Namenode
Secondary namenode does not act the same way as the
namenode
Secondary namenode periodically merges the
namespace image with the edit log to prevent the edit log
from becoming too large
Secondary namenode usually runs on a separate
computer to perform the merge process because this
requires significant processing capability and memory
156
HDFS
HDFS Federation
HDFS HA (High-Availability)
157
HDFS
HDFS Federation
Allows a cluster to scale by adding namenodes
158
HDFS
HDFS Federation
Namespace volumes are all independent
Namenodes do not communicate with each other
Failure of a namenode is also independent to other
namenodes
A namenode failure does not influence the
availability of another namenodes namespace
159
HDFS
HDFS High-Availability
Pair of namenodes (Primary & Standby) are set to be in
Active-Standby configuration
160
HDFS
HDFS High-Availability
161
Big Data
REFERENCES
162
References
V. Mayer-Schnberger, and K. Cukier, Big data: A revolution that will transform how
we live, work, and think. Houghton Mifflin Harcourt, 2013.
T. White, Hadoop: The Definitive Guide. O'Reilly Media, 2012.
J. Venner, Pro Hadoop. Apress, 2009.
S. LaValle, E. Lesser, R. Shockley, M. S. Hopkins, and N. Kruschwitz, Big Data,
Analytics and the Path From Insights to Value, MIT Sloan Management Review,
vol. 52, no. 2, Winter 2011.
B. Randal, R. H. Katz, and E. D. Lazowska, "Big-data Computing: Creating
revolutionary breakthroughs in commerce, science and society," Computing
Community Consortium, pp. 1-15, Dec. 2008.
G. Linden, B. Smith, and J. York. "Amazon.com Recommendations: Item-to-Item
Collaborative Filtering," IEEE Internet Computing, vol. 7, no. 1, pp. 76-80, Jan/Feb.
2003.
163
References
J. R. GalbRaith, "Organizational Design Challenges Resulting From Big Data,"
Journal of Organization Design, vol. 3, no. 1, pp. 2-13, Apr. 2014.
S. Sagiroglu and D. Sinanc, Big data: A review, Proc. IEEE International
Conference on Collaboration Technologies and Systems, pp. 42-47, May 2013.
M. Chen, S. Mao, and Y. Liu, Big Data: A Survey, Mobile Networks and
Applications, vol. 19, no. 2, pp. 171-209, Jan. 2014.
X. Wu, X. Zhu, G. Q. Wu, and W. Ding, Data Mining with Big Data, IEEE
Transactions on Knowledge and Data Engineering, vol. 26, no. 1, pp. 97107, Jan.
2014.
Z. Zheng, J. Zhu, and M. R. Lyu, Service-Generated Big Data and Big Data-as-a-
Service: An Overview, Proc. IEEE International Congress on Big Data, pp. 403
410, Jun/Jul. 2013.
164
References
I. Palit and C.K. Reddy, Scalable and Parallel Boosting with MapReduce, IEEE
Transactions on Knowledge and Data Engineering, vol. 24, no. 10, pp. 1904-1916,
2012.
M.-Y Choi, E.-A. Cho, D.-H. Park, C.-J Moon, and D.-K. Baik, A Database
Synchronization Algorithm for Mobile Devices, IEEE Transactions on Consumer
Electronics, vol. 56, no. 2, pp. 392-398, May 2010.
IBM, What is big data?, http://www.ibm.com/software/data/bigdata/what-is-big-
data.html [Accessed June 1, 2015]
Hadoop Apache, http://hadoop.apache.org
Wikipedia, http://www.wikipedia.org
Image sources
Walmart Logo, By Walmart [Public domain], via Wikimedia Commons
Amazon Logo, By Balajimuthazhagan (Own work) [CC BY-SA 3.0
(http://creativecommons.org/licenses/by-sa/3.0)], via Wikimedia Commons
165
CDN (Content Delivery Network)
CDN Introduction
166
CDN
Table of Contents
CDN Motivation & Structure
CDN Procedures
167
CDN
CDN Motivation
CDN is a network constructed from a group of
strategically placed and geographically distributed
caching servers
168
CDN
CDN Motivation
Network traffic that is accessed by mobile users (e.g., smart
devices) is rapidly increasing
169
CDN
Using CDN, both content
CDN Structure download time and network
traffic are reduced
Content
Provider
User
Store
Caching popular
Server contents in
advance
Content request and delivery route with CDN
Content request and delivery route without CDN
170
CDN
171
CDN
CDN Structure
172
CDN
CDN Structure
When a user requests a content to its nearest
caching server, the server can delivery the
content if the requested content is in its cache
173
CDN
CDN Procedures
When a user requests a content to its nearest caching server, the
server can delivery the content if the requested content is in its
cache
174
CDN
CDN Procedures
If the requested content is not in the local servers cache,
content request is redirected to the remotely located CP
175
CDN
176
CDN
177
CDN
REFERENCES
178
References
Content Delivery Functional Architecture in NGN, Telecommunication
Standardization Sector of ITU, White Paper, Sep. 2010.
Content delivery networks: Market dynamics and growth perspectives, Informa
Telecoms & Media, White Paper, Oct. 2012.
Cisco, Cisco Visual Networking Index: Global Mobile Data Traffic Forecast Update,
http://www.cisco.com/c/en/us/solutions/collateral/service-provider/visual-networking-
index-vni/white_paper_c11-520862.pdf [Accessed June 1, 2015]
Akamai, http://www.akamai.com/index.html/
LimeLight, http://www.limelight.com/
Level 3, http://www.level3.com/
CDNetworks, http://www.us.cdnetworks.com/
179
CDN (Content Delivery Network)
CDN Hierarchical
Content Delivery
180
Hierarchical Content Delivery
181
Hierarchical Content Delivery
182
Hierarchical Content Delivery
183
Hierarchical Content Delivery
184
Hierarchical Content Delivery
Case 1
Requested content is in the local cluster
Content request message is delivered to the CCF
CCF sends a session request message to the
CDPF to deliver the content to the user
CDPF delivers the content to the user
185
Hierarchical Content Delivery
Content Delivery Procedures
Case 1 Procedures
186
Hierarchical Content Delivery
187
Hierarchical Content Delivery
189
Hierarchical Content Delivery
Case 3
When the requested content is not in the CDN
Content request message is sent from the
CD & LCF to the CP
CP delivers the content to the user through
the local cluster
The requested content can be stored in
the local cluster
190
Hierarchical Content Delivery
Content Delivery Procedure
Case 3 Procedures
191
CDN
REFERENCES
192
References
Content Delivery Functional Architecture in NGN, Telecommunication
Standardization Sector of ITU, White Paper, Sep. 2010.
Content delivery networks: Market dynamics and growth perspectives, Informa
Telecoms & Media, White Paper, Oct. 2012.
Cisco, Cisco Visual Networking Index: Global Mobile Data Traffic Forecast Update,
http://www.cisco.com/c/en/us/solutions/collateral/service-provider/visual-networking-
index-vni/white_paper_c11-520862.pdf [Accessed June 1, 2015]
Akamai, http://www.akamai.com/index.html/
LimeLight, http://www.limelight.com/
Level 3, http://www.level3.com/
CDNetworks, http://www.us.cdnetworks.com/
193
CDN (Content Delivery Network)
CDN Market
194
CDN Market
195
CDN Market
196
CDN Market
197
CDN Market
198
CDN Market
199
CDN Market
200
CDN Market
201
CDN Market
202
CDN
REFERENCES
203
References
Content Delivery Functional Architecture in NGN, Telecommunication
Standardization Sector of ITU, White Paper, Sep. 2010.
Content delivery networks: Market dynamics and growth perspectives, Informa
Telecoms & Media, White Paper, Oct. 2012.
Cisco, Cisco Visual Networking Index: Global Mobile Data Traffic Forecast Update,
http://www.cisco.com/c/en/us/solutions/collateral/service-provider/visual-networking-
index-vni/white_paper_c11-520862.pdf [Accessed June 1, 2015]
Akamai, http://www.akamai.com/index.html/
LimeLight, http://www.limelight.com/
Level 3, http://www.level3.com/
CDNetworks, http://www.us.cdnetworks.com/
204
CDN (Content Delivery Network)
CDN R&D
205
CDN
206
CDN
208
CDN
209
CDN
210
CDN
REFERENCES
211
References
Content Delivery Functional Architecture in NGN, Telecommunication
Standardization Sector of ITU, White Paper, Sep. 2010.
Content delivery networks: Market dynamics and growth perspectives, Informa
Telecoms & Media, White Paper, Oct. 2012.
Cisco, Cisco Visual Networking Index: Global Mobile Data Traffic Forecast Update,
http://www.cisco.com/c/en/us/solutions/collateral/service-provider/visual-networking-
index-vni/white_paper_c11-520862.pdf [Accessed June 1, 2015]
Akamai, http://www.akamai.com/index.html/
LimeLight, http://www.limelight.com/
Level 3, http://www.level3.com/
CDNetworks, http://www.us.cdnetworks.com/
212