Escolar Documentos
Profissional Documentos
Cultura Documentos
Sponsor
Big Data: How to Convert the Big Hype into Big Value With Analytics
Optimized systems
Universal systems
Optimized systems
Represent next generation data management and analytic solutions that could not previously be supported because of:
Limited or incomplete information
Technology limitations
Cost
Big Analytics
Build models
Statistical Techniques Multiple linear regression Non-linear progression Factor analysis Structural equations model Cluster analysis Forecasting Logistic regression Non-Statistical Techniques Blog mining Neural networks Market basket analysis Operations Research Mixed integer programming Linear programming
Analysis services
Market Analytics Market volume forecasting Market share models Promotion effectiveness Market basket analysis Price elasticity modeling Product portfolio analysis Lifestyle segmentation Demand forecasting Customer Analytics Customer behavior analysis Profiling & segmentation Response modeling Cross-sell/up-sell modeling Loyalty & attrition modeling Profitability & lifetime modeling Purchase/usage behavior analysis Propensity scoring Campaign management
8
Source: www.dexterity.in
Conclude
3
What shall I do now?
Whats useful?
Refine
Distill information, apply algorithms, identify patterns
Act
2 4
Share with others, make a decision, initiate a process
Assemble V3+
Capture and query, monitor and stream, big data sources
Multimedia
Relational EDW with at rest data Dimensional cubes/marts with at rest data
One-size fits all data management Rigid data governance Reporting and OLAP Reports and dashboards
Non-relational stores with at rest data Streaming systems with in motion data
Flexible & optimized data management Flexible data governance Advanced analytic functions & predictive models Sophisticated visualization of large result sets
Key requirements are: The ability for organizations to easily analyze large volumes of structured and multi-structured data with good price/performance The need to make technologies for developing and running these analyses more usable by information workers such as data scientists Organizations will likely use multiple data management systems and analytic tools the challenge is deciding which to use when and interconnecting the systems
11
Use Cases - 1
Use Case
Real-Time Filtering, Monitoring & Analytics Near-Real-Time Analytics Data Integration Hub BI Accelerator New LOB BI Application Investigative Computing
Built for purpose (optimized) systems
Streaming System
Analytic RDBMS
NonRelational System
12
Use Cases - 2
Use Case
Real-Time Filtering, Monitoring & Analytics
Application Examples
In-line fraud detection Dynamic network/smart grid optimization Real-time equipment tracking, failure prediction & action Customer next best off/option Customer service center optimization Shipping/freight service-level tracking & re-optimization Detailed data integration & archiving Detailed data filtering & transformation Detailed data aggregation
Near-Real-Time Analytics
Reduce latency of existing LOB analyses Reduce costs of existing LOB analyses
Display advertising spot buying & effectiveness Web site traffic analysis & optimization Dynamic product/service pricing optimization Enhanced customer modeling & analytics Improved fraud detection New sensor-based analytic applications
Investigative Computing
13
Monitor fish movements and pollution levels via geo-spatial views, public transparency reporting, cross-institutional collaboration
Data scientist
Data mining, what-if analysis for impact analysis, correlating pollution levels with events & seasonal activity
Source IBM: Inspired by the River and Estuary Observatory Network (REON) project Monitoring pollution levels in rivers and estuaries in New York state using sensors
14
SMS
2 3
DM receives list of candidate marketing offers from EMM Optionally EMM calls out to SPSS to help determine candidate offers
1 5
Next Best Action delivered to the customer through the appropriate channel
Decision Management determines NBA from: Marketing offers (EMM) Service Problems Billing Information Location Service Issue Issue Resolution Dispute Satisfaction Account Management Advice Self Service Channel Match Agent Match etc.
Decision Services
Business Optimization Rules Predictive Analytics Text Analytics Entity Analytics
Source: IBM
Hadoop
Core Database
Demographic (DB, surveys) Interactions (Call center, Web)
15
Requirements:
Cost effectively manage increasing data volume Reduce the number of data warehouses and ETL jobs Reduce analytical processing times and provide intra-day analytics
Capture and store all detailed transaction data (POS data, web clicks, supply chain events, etc.) for analysis
Solution:
Hadoop data hub and BI accelerator Manages all detailed structured (and multi-structured) data Data hub is used to distribute required data to other analytics systems Hadoop system is also used to accelerate performance-critical analyses
Primary source: Presentation by Dr. Phillip Shelley (CTO Sears Holdings and CEO MetaScale) at the Hadoop Summit, June 2012
16
Primary source: Presentation by Dr. Phillip Shelley (CTO Sears Holdings and CEO MetaScale) at the Hadoop Summit, June 2012
17
18
37.3% 4.0%
12.1%
46.6%
Identify discussion volume across multiple social channels Measure consumer reaction Compare results with traditional research data
Neutral
Positive
Negative
Ambivalent
Drove market actions that prevented costly pricing discounts and package re-design
Source: IBM
19
Analytic Applications
BI / Exploration / Functional Industry Predictive Content BI / Reporting Visualization App App Analytics Analytics Reportin g
20
True value is gained from a hybrid of existing and new data systems
Integration with existing enterprise systems will be a key vendor differentiator The vendors that win will be those that can best tackle cost and/or advanced analytics requirements Big data is causing significant innovation in:
Data management Analytics and visualization Data-driven automation
Copyright BI Research, 2012 21
22
Questions?
23
Contacting Speakers
If you have further questions or comments: Colin White, BI Research cwhite@bi-research.com Harriet Fryman hfryman@us.ibm.com