Você está na página 1de 26

BI for Big Data

Beyond the Hype

1 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555


Pentaho Mission
The Future of Analytics: Big Data Exploration without Boundaries

Modern, unified data integration and business


analytics platform
• Native integration into big data ecosystem
• Embeddable, cloud-ready analytics

Fast and Broad Innovation


• Open source development model

Critical mass achieved


• Over 1,000 commercial customers
• Over 10,000 production deployments

2 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555


Ian Fyfe
Big Data Solutions Engineering, Pentaho

Ian brings over 20 years of experience in the business analytics software market
with roles spanning consulting services, pre-sales engineering, product
management and product marketing. Ian started his career by co-founding a
business intelligence startup and has worked at Business Objects, Informix,
Epiphany, PeopleSoft and Jaspersoft.

3 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 3


Common Use Cases

4
4 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
The Value of Big Data for our Customers
Big opportunities

Drive incremental revenue


• Predict customer behavior across all channels
• Understand and monetize customer behavior

Improve operational effectiveness


• Machines/sensors: predict failures, network attacks
• Financial risk management: reduce fraud, increase security

Reduce data warehouse cost


• Integrate new data sources without increased database cost
• Provide online access to ‘dark data’

5 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555


Example Use Cases Today

Transactional Non-Transactional
•Fraud detection •Web pages, blogs etc
•Financial services / stock •Documents
markets •Physical events
•Application events
Sub-Transactional •Machine events
•Weblogs
•Social/online media
•Telecoms events

© 2010, Pentaho. All Rights Reserved. www.pentaho.com. US and Worldwide: +1 (866) 660-7555 | Slide
6 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
Click Stream Analytics
From buying patterns to revenue

Business Challenge
• Monetize buying patterns hidden in billions of
data points
• Quickly analyze multi-channel click stream data

Pentaho Benefits
• Reduced ETL time to analyze blended data
from Hadoop, Hbase & data warehouse
• Use of big data analytics to grow revenue from
targeted campaigns

7 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555


Device Data Analytics
Big Data for Fortune 100 Enterprise Storage provider

Business Challenge
• Affordably scale machine data from storage
devices for customer support app
• Predict device failure
• Enhance product performance

Pentaho Benefits
• Easy to use ETL & analysis for Hadoop, Hbase,
& Oracle data sources
• 15x cost improvement
• Stronger performance against customer SLA’s

8 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555


Innovative Organizations Use Pentaho
to Unlock Value from Big Data Stores

Online Retailer Mobile & Digital Media


Understanding the buying patterns Embedded Pentaho to measure
of 5 million users from click stream massive volumes of mobile and
data stored in Hadoop & HBase event data generated from mobile
devices stored in MongoDB

Gaming Travel & Entertainment


Better monetization of premium Helping thousands of travel
game features through analyzing partners like expedia.co.uk and
large volumes of player data - thomascook.fr improve promotional
stored in MongoDB & Infobright targeting using Hbase and Hadoop

Social Commerce Healthcare


Better campaign performance Embedded Pentaho to better
through monitoring social media, patient care & compliance through
page clicks and email marketing analysis of unstructured digital pen
data stored in HP Vertica data stored in CouchDB

9 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555


Pentaho Embedded Analytics
New Revenue Stream in Eight Weeks

Business Challenge
• Gain new revenue source from add-on
module with reporting, analysis & dashboards
• Get to market fast to differentiate

Pentaho Benefits
• Easy to embed & brand
• Broad capabilities result in new revenue stream
• Increased functionality & compelling
visualizations

10 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555


Embedded Analytics
Dashboard Designer
Pentaho Uniquely Positioned to Win

Why We Win in Embedded:


• Architectural ‘sweet spot’ for Pentaho
platform
• Flexible pricing, adaptable to fit partner
pricing
• Open source and innovation Dashboard Framework
• Fastest time-to-market for embedded
analytics

Continued Leadership:
• Cloud & multi-tenancy ease-of-use
• Simplified REST services for ISVs
• BI Platform SDK enhancements – deep
solution examples, tutorials and training
• Continued focus on standards and
extensibility

11 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555


Big Data Technologies
BI Strengths and Weaknesses

12
12 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
© 2012, Pentaho. All Rights Reserved.
The Current Solutions
10,000

Current Database Solutions are designed for


GIGABYTES OF DATA CREATED (IN BILLIONS)

structured data.

• Optimized to answer known questions quickly


• Schemas dictate form/context
5,000
• Difficult to adapt to new data types and new
questions
• Expensive at petabyte scale

0 10%
2005 2010 2015

STRUCTURED DATA UNSTRUCTURED DATA

13 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555


Main Big Data Technologies

Hadoop NoSQL Databases Analytic RDBMS


• Low cost, reliable • Huge horizontal scaling • Optimized for bulk-load
scale-out architecture and high availability and fast aggregate
• Distributed computing • Highly optimized for query workloads
Proven success in retrieval and appending • Types
Fortune 500 • Types • Column-oriented
companies • Document stores • MPP
• Exploding interest • Key Value stores • In-memory
• Graph databases

Hadoop NoSQL Databases Analytic Databases

14 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555


Hadoop Core Components

HADOOP DISTRIBUTED FILE SYSTEM (HDFS)


❯ Massive redundant storage across a commodity
cluster
MAPREDUCE
❯ Map: distribute a computational problem
across a cluster
❯ Reduce: Master node collects the answers to
all the sub-problems and combines them

MANY DISTROS AVAILABLE

US and Worldwide: +1 (866) 660-7555 | Slide

© 2010, Pentaho. All Rights Reserved. www.pentaho.com.


15 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
Major Hadoop Utilities

Apache Pig
High-level language
for expressing data
Apache Hive analysis programs
Apache HBase
SQL-like language and
metadata repository The Hadoop database.
Random, real -time
read/write access

Hue
Apache Zookeeper
Browser-based
desktop interface for Highly reliable
interacting with distributed
Hadoop coordination service

Oozie
Flume
Server-based
workflow engine for Distributed service for
Hadoop activities collecting and
aggregating log and
event data

Sqoop
Apache Whirr
Integrating Hadoop
with RDBMS Library for running
Hadoop in the cloud

16 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555


Hadoop & Databases

17 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555


Big Data Platform Challenges

“The working conditions can


be are shocking”

ETL Developer

18 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555


Challenges

1. Somewhat immature
2. Lack of tooling
3. Steep technical learning curve
4. Hiring qualified people
5. Availability of enterprise-ready products and tools
6. High latency (Hadoop)
7. Running inside the cluster

19 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555


Challenges
Ingestion / Manipulation /
Integration

Scheduling

Modeling

WOULD YOU RATHER DO THIS? … OR THIS?

20 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555


Investigating
BI & Big Data Solutions

21
21 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
Questions to Ask
Business Drivers
1. Mandate to reduce EDW costs?
2. Clear use case that you need to solve?
3. Do you have access to technical skill set?

Technical
1. Do you have more than one kind of big data store, for example Hadoop as well as HBase,
MongoDB or Cassandra?
2. Would you prefer to use the same tool for big data stores in addition to your traditional relational
data stores?
3. Are you ok waiting minutes or even hours to access your big data?
4. Are you ok using a spreadsheet-like interface to access and analyze your data?
5. Do you need complete BI capabilities, including reporting, interactive visualization, and predictive
analytics?
6. Do you need to enrich your big data with data from outside of the big data platform?
7. Is the big data you want to analyze bigger than the amount of memory you have available?

http://blog.pentaho.com/tag/ian-fyfe/

22 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555


Demo

23
23 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555
© 2012, Pentaho. All Rights Reserved.
Complete Big Data Analytics &
Visual Data Management

Data Ingestion Enterprise & Data Discovery Predictive Analytics


Manipulation Ad Hoc Reporting Visualization
Integration

Pentaho Big Data Analytics

Analytic
Hadoop NoSQL Relational
Databases

24 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555


Open

Discussion

25 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555


Thank You
JOIN THE CONVERSATION. YOU CAN FIND US ON:

blog.pentaho.com Facebook.com/Pentaho

@Pentaho Pentaho Business Analytics

26 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555

Você também pode gostar