Você está na página 1de 9

What is Big data ?

Data is comprised of bits and bytes, and as humans we are immersed in data in our everyday lives. The data lying in the servers of your company was just data until yesterday
sorted and filed. Suddenly, the slang Big Data got popular and now the data in your
company is Big Data. The term covers each and every piece of data your organization has
stored till now. It includes data stored in clouds and even the URLs that you bookmarked.
Your company might not have digitized all the data. You may not have structured all the
data already. But then, all the digital, papers, structured and non-structured data with your
company is now Big Data.
In short, all the data whether or not categorized present in your servers is collectively
called BIG DATA. All this data can be used to get different results using different types of
analysis. It is not necessary that that all analysis use all the data. Different analysis uses
different parts of the BIG DATA to produce the results and predictions necessary.
Big Data is essentially the data that you analyze for results that you can use for
predictions and for other uses. When using the term Big Data, suddenly your company or
organization is working with top level Information technology to deduce different types of
results using the same data that you stored intentionally or unintentionally over years.

Why we need Big


Data?

These are the reasons for which Big Data is required


More than 90% of data has been generated in last two years only.
Almost 80 % of data are unstructured or they exist in wide range of formats,
thus hard
to analyze.
There are certain limitations with Structured data also, when dealing with
very large
size of data.
It is difficult to integrate data coming from multiple systems.
When business users need to isolate the are which will help them to grow.
Data which is potentially valuable are either dormant or discarded.
When combining large volume of unstructured data, It becomes more
costlier.
There are some data types whose information have very short and useful
lifespan.
When existing information are modified on the basis of context, it becomes
more
meaningful.

Data Explosion
More than 2 Exabyte of data is generated every day.
Following are the few sources which are generating tremendous
amount of data every day.
Stock Market It generates more than 1 TB of data on a single day.
Sensor Data Data generated by a single engine in a cross country
flight is 20 TB/hour.
YouTube In a single minute more than 48 hours of videos are
uploaded by all users.
Facebook It generates more than 10 TB of data in a single day.

Big Data and Its Source:


Biodata is a broad term for data sets so large or complex that
traditional data processing applications are inadequate. Challenges
include analysis, capture, data curation, search, sharing, storage,
transfer, visualization, and information privacy.
Its Source:
Atmospheric Science, Astronomy, Biochemical and Medical Records
Internet Pages
Internet Texts and Documents
Military Surveillance
Photographic Archives
Social Media
Scientific Research
Sensor Networks
Search Index Data
Web Logs

FOR MORE DETAILS :


HP SOFTWARE UNIVERSITY
123,MTH ROAD,AMBATTUR OT,CHENNAI-53
CALL US@7338877900

Você também pode gostar