Você está na página 1de 6

How To Start With Big Data Project ?

IBM Big Data Strate

BIG DATA is not just HADOOP


Understand and navigate federated big data sources Manage & store huge volume of any data Federated Discovery and Navigation

Hadoop File System MapReduce

Structure and control data

Data Warehousing

Manage streaming data

Stream Computing

Analyze unstructured data

Text Analytics Engine

Integrate and govern all data sources


3

Integration, Data Quality, Security, Lifecycle Management, MDM


2012 IBM Corporation

Business-Centric Big Data Enables You to Start With a Critical Business Pain and Expand the Foundation for Future Requirements

Big data isnt just a technologyits a business strategy for capitalizing on information resources Getting started is crucial Success at each entry point is accelerated by products within the Big Data platform Build the foundation for future requirements by expanding further into the big data platform

2012 IBM Corporation

gy: Move th

e ANALYTICS Closer to the Data

IBM Big Data Strategy: Move the ANALYTICS Closer to the Data
Analytic Applications
BI / Exploration / Functional Industry Predictive Content Reporting Visualization App App Analytics Analytics

New analytic applications drive the requirements for a big data platform
Integrate and manage the full variety, velocity and volume of data Apply advanced analytics to information in its native form Visualize all available data for ad -hoc analysis ( even in motion!) Development environment for building new analytic applications Workload optimization and scheduling Security and Governance

IBM Big Data Platform


Visualization & Discovery Application Development Systems Management

Accelerators

Hadoop System
BigInsights
certified Apache Hadoop

Stream Computing

Data Warehouse

Information Integration & Governance

And grow and evolve on your current IT infrastructure

32
2012 IBM Corporation

Four Entry Points of Big Data


Analytic Applications
BI / Exploration / Functional Industry Predictive Content Reporting Visualization App App Analytics Analytics

Unlock Big Data

IBM Big Data Platform


Visualization & Discovery Application Development Systems Management

Simplify Your Warehouse

Accelerators

Hadoop System

Stream Computing

Data Warehouse

Preprocess Raw Data

Information Integration & Governance

Analyse Streaming Data


2012 IBM Corporation

33

Hadoop
Open-source software framework from Apache Inspired by
Google MapReduce GFS (Google File System)

HDFS Map/Reduce

2012 IBM Corporation

InfoSphere BigInsights
Platform for volume, variety, velocity Enhanced Hadoop foundation Analytics Text analytics & tooling Application accelerators
Enterprise class

Can run also on top of

Enterprise Edition
Licensed
Application accelerators Pre-built applications Text analytics Spreadsheet-style tool

Usability Web console Spreadsheet-style tool Ready-made apps Enterprise Class Storage, security, cluster management Integration Connectivity to Netezza, DB2, JDBC databases, etc
10

Basic Edition Free download


Integrated install Online InfoCenter BigData Univ.

RDBMS, warehouse connectivity Administrative tools, security Eclipse development tools Performance enhancements ... .

Apache Hadoop

Breadth of capabilities
2012 IBM Corporation

Spreadsheet-style Analysis
Web-based analysis and visualization

Spreadsheet-like interface
Def ine and manage long running data collection jobs Analyze content of the text on the pages that have been retrieved

11

2012 IBM Corporation

Build a Big Data Program MapReduce example


Eclipse tools
For Jaql, Hive, Pig Java MapReduce, BigSheets plug-ins, text analytics, etc.

12

2012 IBM Corporation

BigInsights and the data warehouse


Big Data analytic applications

Traditional analytic tools

Data warehouse

BigInsights

Filter
14

Transform

Aggregate
2012 IBM Corporation

Você também pode gostar