Escolar Documentos
Profissional Documentos
Cultura Documentos
DATA SHEET
Cloudera Enterprise gives us an easy
way to manage multiple clusters.
PAUL PERRY, DIRECTOR OF SOFTWARE,
EXPERIAN MARKETING SERVICES
Big Data 27
OTFL SME One-On-One
Audie Hittle
CTO of the Federal Market Segment
EMC Isilon
Storage: Software Dened
Every agency may be unique, but they
all want to transform their data stor-
age and information management
operations to achieve levels previously
not considered possible.
However today these levels are
possible, EMC Isilons Audie Hittle told
OTFL in a recent interview, through
intelligent data storage technologies
recently made possible through the use of sophisticated soft-
ware dened storage capabilities.
Mr. Hittle invests a signicant percentage of his time and
energy helping agencies understand how they can transform
their operations to achieve the desired operational efciencies.
He noted that by some estimates, up to 80% of all new data
created is unstructured such as imagery, video, massive
home directories, or network log les.
Weve found that scale-out network attached storage
(NAS) solutions are extremely well suited to effectively and ef-
ciently address these needs.
Validated Solution
EMC provides the Big Data technology of OneFS and the
Isilon product portfolio of Scale-Out Network Attached Stor-
age (NAS) to immediately help government cut costs, increase
efciencies, and take advantage of their Big Data potential,
explained Mr. Hittle.
I have personally validated a Big Data example, with a senior
government executive who was briefed by his own operations
team. They reduced their IT data storage stafng requirements
by 90%, enabling reallocation of resources to other higher pri-
ority missions. This thanks to the intelligence and automation of
the EMC Isilon operating system, OneFS.
Isilon does not use the industry standard Random Array of
Independent Disks (RAID) for data storage and protection, but
rather uses advanced software algorithms to distribute and
protect the data across an entire cluster of disks.
By doing this, it can offer sophisticated capabilities such as:
Role-Based Access Control (RBAC) to help ensure separa-
tion of data access from administrative access, and to
differentiate appropriate levels of administrative access
Auto-balancing, SyncIQ for remote synchronization;
SmartPools for next-generation tiering
SmartLock for data protection and retention, fundamen-
tally changing the way data storage is managed.
The example cited above is just one of many and reinforc-
ing data from industry analysts like IDC and Gartner Group
substantiating dramatic capital expense and operating expense
efciencies on the order of 40% to 50% overall, Mr. Hittle said.
Three Questions To Ask
Certainly, no one will deny operational efciency is a
prominent part of the discussion with IT buyers today. But all
too often, the discussion is focused on minimizing the up front
or initial cost of the procurement of the product or solution, Mr.
Hittle explained.
However one of the questions that could or should be
asked, and frequently is not, deals with Total Cost of Ownership
or TCO. The real value is in understanding what the TCO is going
to be over the life of the product. This includes everything from
initial procurement costs to energy to full life-cycle operations
and management costs, he said.
Data storage efciency improvements have enabled
achievement of 40% operating expense (OPEX) savings and
the ability to automate so many of the traditionally human-
resource/staff intensive functions, which go a long way in re-
ducing a clients TCO.
A second question might deal with the timelines and costs of
migrating data, he continued.
Traditionally, data migration is one of the most staff-
intensive processes, and yet, advanced concepts such as never
migrate again enable organizations to completely avoid costs
associated with any future data storage migrations.
Mr. Hittle also urged buyers to ask prospective data storage
vendors: How do you plan to deal with data migrations, both
from the existing storage to the new, and for any future expan-
sions or enhancements?
On how to organize infrastructure to take advantage of Big
Data, Mr. Hittle follows the advice of that famous philosopher
and Hall of Famer, Yogi Berra, who is recognized for saying,
Predictions are always toughespecially when they are about
the future.
When it comes to organizing your infrastructure, therefore,
it seems it would be best to avoid making any big predictions
or bets, where possible. Buy what you need now, and scale-out
or pay as you grow into the future.
The goal is to keep it simple, both in terms of architecture
and operational interfaces. In other words, look for things that
offer ease of architectural planning and integration as well as
signicant automation to minimize or eliminate unnecessary
operator intervention, Mr. Hittle said. n
28 Big Data
OTFL SME One-On-One
Jay Mork
Strategic Technology and Chief Technology Ofcer
General Dynamics Advanced Information Systems
Big Data can be a big problem in todays
data-driven world.
For some agencies Big Data, and
how to effectively manage it, can be a big
problem, Jay Mork told OTFL recently.
At General Dynamics Advanced
Information Systems, we understand
that there is no one-size-ts-all solution
when it comes to Big Data, he explained.
Instead, each Big Data solution is the sum of its parts; how
we innovatively combine relevant tools and technologies, tailor-
ing and ne tuning them for each customer to help advance
their mission, quickly and effectively.
General Dynamics has a long history and rich heritage in the
management of critical mission systems for customers in the
Intelligence Community (IC). We have an in-depth understand-
ing of what it takes to help our customers derive intelligence
from the prolic amounts of data they are inundated with each
and every day.
No Silver Bullet
When we think about Big Data, there is not one silver bullet
that comes to mind, Mr. Mork noted. Instead, we think about
the right combination of relevant solutions and approaches.
That is where true innovation happens,
A key component of Big Data is how to effectively break
down the silos of data; and reorganizing it to be more accessible,
anytime, anywhere.
We have found that by creating a unied environment
where the raw data resides, we are then able to leverage our
proven open architecture (OA) approach, said Mr. Mork.
By placing the OA on top of the raw data, we are able to
harness the power of our Big Data analytics to help our custom-
ers extract strategic meaning from these large volumes of raw
data. Our OA also allows our customers to rapidly integrate and
plug-in commercial-off-the-shelf (COTS) products, saving valu-
able time and money.
Leverage Existing Infrastructures
An essential element of successfully putting Big Data to
work is the architecture, noting that some customers existing
infrastructure is not built upon a foundation of OA.
When this is the case, we help our customers make a smart
migration to OA, actively leveraging existing infrastructures,
he said.
At General Dynamics, its all about the scally-smart migra-
tion to an OA design that will provide increased efciencies,
reduce dependency on hardware and enhance capabilities for
more users; not simply about ipping a switch.
Mr. Mork added the beauty of our OA solutions is that
they are exible and enable us to design, develop and deploy
custom Big Data solutions based on our customers mission-
specic needs.
He also counseled government managers to keep in mind
that a successful Big Data solution should take a full lifecycle
approach.
We build the architecture that allows customers to buy,
design and implement what they need based on their dynamic
mission. Our OA approach allows customers to access their big
data, delving into the unknown and extracting value, manipu-
lating it with technologies and tools. This exible and scalable
approach means customers can do whatever they need to do
with their Big Data.
Activity Based Intelligence (ABI)
As the volume of data continues to increase, activity based
intelligence (ABI) will remain a top priority for many IC decision
makers, Mr. Mork said.
For instance, in the IC, our customers need to sift through
mounds of geospatial, signal, communications and human intel-
ligence data to derive actionable intelligence in support of a
variety of missions.
With intelligence pulled from a variety of sources across
these equally varied domains, ABI goes beyond simply collect-
ing data and storing it, he explained.
ABI enhances the dimensions and context of mission-critical
information for analysts, while highlighting areas where more
information is required. ABI also allows analysts to connect the
dots faster and more efciently and, most importantly, provide
decision makers with real-time intelligence. And with our ex-
ible OA serving as the foundation to allow for rapid insertion of
needed technology, customers can truly harness the power of
Big Data.
Additionally, Mr. Mork said General Dynamics will continue
to invest in internal research and development efforts including
GDNexus.
From innovative concept to operations, GDNexus helps our
customers solve their Big Data challenges and reduce acquisi-
tion risk, increase operational capability and leverage proven
technology to take their Big Data solutions to the next level. n
The Sum Of Its Parts
Big Data 29
30 Big Data
My Big Data Top Ten
After collecting volumes of data from a variety of formats and
sources and then analyzing them using my human processor, Ive
come up with my Big Data Top Ten.
1
Big Data needs a consensus denition.
Go to any tech event and Big Data talk is on the lips of at-
tendees. While most say they know Big Data when they
see it, when asked to give a specic denition, answers
vary widely.
But we are getting closer.
NIST denes Big Data as: Where the data volume, acquisition
velocity, or data representation limits the ability to perform effec-
tive analysis using traditional relational approaches or requires the
use of signicant horizontal scaling for efcient processing.
The MeriTalk research would amend the NIST denition to: Big
Data is the set of technical capabilities and management processes
for converting vast, fast, or varied data into useful knowledge.
2
Big Data is well, big!
In 1978, I sold Big Data storage to the Naval Research
Lab. My format was an 8 oppy disk storing a robust
8 kilobytes. Hard multiple-platter Disk Packs stored
maybe 1 megabyte. Today hard disk capacities of 1.5 terabyte are
commonplace.
According to the TechAmerica report, in 2009, the govern-
ment produced 848 petabytes of data and healthcare data alone
reached 150 exabytes. Five exabytes (10
18
gigabytes) would con-
tain all words ever spoken by human beings on earth.
3
Big Data is new Not true
While the term Big Data with initial caps is new, big
data itself is not new.
For example, NOAA/National Weather Service has
been processing it since the 1950s. Today NOAA manages over 30
petabytes of new data per year. (How many 8K oppys is that?)
4
Big Data characteristics: The 4Vs
Dealing with Big Data means dealing with:
(a) Volume The sheer amount of data generated or
data intensity that must be ingested, analyzed, and
managed.
(b) Velocity The speed data is being produced and changed;
and the speed with which data must be received, understood, and
processed.
(c) Variety Structured and unstructured data in a variety
of formats creates integration, management, governance, and
architectural pressures on IT.
(d) Veracity The quality and provenance of received data
must be veried.
5
Why Big Data now!
Technology nally has the power to handle the 4Vs.
We now have the tools to really ask what if and to
explore data sets that were available before or didnt
exist before. Now it is possible to really think about the art of the
possible. We are witnessing a true democratization of data.
6
Big Data is now affordable.
You dont need to start from scratch with new IT
investments. You can use your existing capabilities,
technologies and infrastructure to begin pilots. There
is no single technology is required; there are no must haves.
7
The Big Data path is a circle not
a straight line.
Dene. Assess. Plan. Execute. Review. TechAmerica
recommends a ve step cyclical approach that is itera-
tive versus serial in nature, with a constant closed feedback loop
that informs ongoing efforts.
8
The Help Wanted sign is up!
We have to grow the next generation of data scien-
tists. The McKinsey Global Institute Analysis report
predicts a shortfall of 200,000 data scientists over
the next ve or so years and shortfall of 1 million managers in
organizations where data will be critical for them to be successful
in their organization e.g. government.
9
Government is funding foundational
Big Data R&D.
Projects are moving forward via the $200 million in-
vestment announced by the administration in March
2012. In October 2012, NSF/NIH announced 8 awards covering
three areas: data management, data analytics and collaboration/
data sharing. New solicitations, competitions and prizes are in the
ofng with opportunities for anyone who has a good idea.
10
The Big Data Market is growing bigger.
Deltek forecasts demand for big data solutions
by the U.S. government will increase from $4.9
billion in FY 2012 to $7.2 billion in FY 2017 at a
compound annual growth rate (CAGR) of 8.2%.
Big Data means big opportunities. Thats truly a big deal. n
By Jeff Erlichman, Editor, On The FrontLines
Assured IT. Delivered by Cloud.
> Lower your data center resource costs by up to 35%
> Reduce inter-data center transaction bandwidth requirements by 50%
> Reduce cloud access and backbone network costs by 40%
Learn how Cienas trusted, reliable, and secure cloud networking solutions
enable the highest performance, most efcient and cost-effective cloud service
architectures for Government.
http://www.ciena.com/solutions/cloud
CREATE A DATA CENTER WITHOUT WALLS
Welcome To
The Cloud