Bem-vindo(a) ao Scribd!

Pular no carrossel

Understanding Hadoop Framework

Enviado por

Prashant Sharma

0% acharam este documento útil (0 voto)

62 visualizações31 páginas

1st part of the presentation to understand Hadoop and BIG data

Título original

Understanding Hadoop framework

Direitos autorais

Formatos disponíveis

PDF, TXT ou leia online no Scribd

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Denunciar este documento

1st part of the presentation to understand Hadoop and BIG data

Direitos autorais:

Attribution Non-Commercial (BY-NC)

Formatos disponíveis

Baixe no formato PDF, TXT ou leia online no Scribd

Sinalizar o conteúdo como inadequado

0% acharam este documento útil (0 voto)

62 visualizações31 páginas

Understanding Hadoop Framework

Enviado por

Prashant Sharma

1st part of the presentation to understand Hadoop and BIG data

Direitos autorais:

Attribution Non-Commercial (BY-NC)

Formatos disponíveis

Baixe no formato PDF, TXT ou leia online no Scribd

Sinalizar o conteúdo como inadequado

Pular para a página

Você está na página 1de 31

Pesquisar no documento

Course Topics

Week 1
Understanding Big Data Introduction to HDFS

Week 5
Analytics using Hive Understanding HIVE QL

Week 2
Playing around with Cluster Data loading Techniques

Week 6
NoSQL Databases Understanding HBASE

Week 3
Map-Reduce Basics, types and formats Use-cases for Map-Reduce

Week 7
Real world Datasets and Analysis Hadoop Project Environment

Week 4
Analytics using Pig Understanding Pig Latin

Week 8
Project Reviews Planning a career in Big Data

How it works
Live classes Class recordings Module wise Quizzes, Coding Assignments 24x7 on-demand technical support Project work on large Datasets Online certification exam Lifetime access to the Learning Management System

Complementary Java Classes

What is Big Data?

Facebook Example

Facebook users spend 10.5 billion minutes (almost 20,000 years) online on the social network Facebook has an average of 3.2 billion likes and comments are posted every day.

Twitter Example
Twitter has over 500 million registered users. The USA, whose 141.8 million accounts represents 27.4 percent of all Twitter users, good enough to finish well ahead of Brazil, Japan, the UK and Indonesia. 79% of US Twitter users are more like to recommend brands they follow 67% of US Twitter users are more likely to buy from brands they follow 57% of all companies that use social media for business use Twitter

Other Industrial Usecases

Insurance Healthcare Retail Recommendations Groupings Genome Sequencing Utilities

Hadoop Users

http://wiki.apache.org/hadoop/Po weredBy

Data volume is growing exponentially

Estimated Global Data Volume:

2011: 1.8 ZB
2015: 7.9 ZB

The world's information doubles every two years Over the next 10 years:
The number of servers worldwide will grow by 10x Amount of information managed by enterprise data centers will grow by 50x Number of files enterprise data center handle will grow by 75x

Source: http://www.emc.com/leadership/programs/digitaluniverse.htm, which was based on the 2011 IDC Digital Universe Study

Un-Structured Data is exploding

Why DFS?
Read 1 TB Data

1 Machine
4 I/O Channels Each Channel 100 MB/s

10 Machines
4 I/O Channels Each Channel 100 MB/s

Why DFS?
Read 1 TB Data

1 Machine
4 I/O Channels Each Channel 100 MB/s

10 Machines
4 I/O Channels Each Channel 100 MB/s

45 Minutes

Why DFS?
Read 1 TB Data

1 Machine
4 I/O Channels Each Channel 100 MB/s

10 Machines
4 I/O Channels Each Channel 100 MB/s

45 Minutes

4.5 Minutes

What Is Distributed File System? (DFS)

What is Hadoop?
Apache Hadoop is a framework that allows for the distributed processing of large data sets across clusters

of commodity computers using a simple programming model.

Companies using Hadoop: - Yahoo - Google - Facebook

- Amazon
- AOL - IBM - And many more at

http://wiki.apache.org/hadoop/PoweredBy

Hadoop Eco-System

Hadoop Core Components:

HDFS Hadoop Distributed File System (storage) MapReduce (processing)

What is HDFS?
HDFS - Hadoop Distributed File System
Highly fault-tolerant

High throughput
Suitable for applications with large data sets Streaming access to file system data Can be built out of commodity hardware

Main Components Of HDFS:

NameNode:
master of the system maintains and manages the blocks which are present on the DataNodes

DataNodes:
slaves which are deployed on each machine and provide the actual storage responsible for serving read and write requests for the clients

Secondary NameNode:

metadata

Secondary NameNode:
Not a hot standby for the NameNode Connects to NameNode every hour* Housekeeping, backup of NemeNode metadata Saved metadata can build a failed NameNode

NameNode

Single Point Failure You give me metadata every hour, I make it secure

Secondary NameNode

metadata

JobTracker and TaskTracker:

HDFS Architecture

Job Tracker

Job Tracker Contd.

HDFS Client Creates a New File

Rack Awareness

Anatomy of a File Write:

Anatomy of a File Read:

Thank You
See You in Class Next Week

Você também pode gostar

Prashant-Sr. ETL Consultant
Documento6 páginas
Prashant-Sr. ETL Consultant
Prashant Sharma
Ainda não há avaliações
Hadoop Release 2.0
Documento54 páginas
Hadoop Release 2.0
Prashant Sharma
Ainda não há avaliações
Check My Trip
Documento2 páginas
Check My Trip
Prashant Sharma
Ainda não há avaliações
Informatica Data Migration Guidebook
Documento10 páginas
Informatica Data Migration Guidebook
Prashant Sharma
Ainda não há avaliações
Prashant Sharma
Documento6 páginas
Prashant Sharma
Prashant Sharma
Ainda não há avaliações
Informatica Pushdown Tips
Documento8 páginas
Informatica Pushdown Tips
Angad Karunan
Ainda não há avaliações
Hamish Whittal - Shell Scripting
Documento272 páginas
Hamish Whittal - Shell Scripting
Ganbayar Shaggy
Ainda não há avaliações
The Yellow House: A Memoir (2019 National Book Award Winner)
No Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Nota: 4 de 5 estrelas
4/5 (98)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
No Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Nota: 4 de 5 estrelas
4/5 (5795)
Shoe Dog: A Memoir by the Creator of Nike
No Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Nota: 4.5 de 5 estrelas
4.5/5 (537)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
No Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Nota: 4.5 de 5 estrelas
4.5/5 (474)
Grit: The Power of Passion and Perseverance
No Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Nota: 4 de 5 estrelas
4/5 (588)
On Fire: The (Burning) Case for a Green New Deal
No Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Nota: 4 de 5 estrelas
4/5 (74)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
No Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Nota: 3.5 de 5 estrelas
3.5/5 (231)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
No Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Nota: 4 de 5 estrelas
4/5 (895)
Never Split the Difference: Negotiating As If Your Life Depended On It
No Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Nota: 4.5 de 5 estrelas
4.5/5 (838)
The Little Book of Hygge: Danish Secrets to Happy Living
No Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Nota: 3.5 de 5 estrelas
3.5/5 (400)
Principles: Life and Work
No Everand
Principles: Life and Work
Ray Dalio
Nota: 4 de 5 estrelas
4/5 (599)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
No Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Nota: 4.5 de 5 estrelas
4.5/5 (345)
Yes Please
No Everand
Yes Please
Amy Poehler
Nota: 4 de 5 estrelas
4/5 (1891)
The Unwinding: An Inner History of the New America
No Everand
The Unwinding: An Inner History of the New America
George Packer
Nota: 4 de 5 estrelas
4/5 (45)
Team of Rivals: The Political Genius of Abraham Lincoln
No Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Nota: 4.5 de 5 estrelas
4.5/5 (234)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
No Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Nota: 3.5 de 5 estrelas
3.5/5 (2259)
Angela's Ashes: A Memoir
No Everand
Angela's Ashes: A Memoir
Frank McCourt
Nota: 4.5 de 5 estrelas
4.5/5 (440)
Steve Jobs
No Everand
Steve Jobs
Walter Isaacson
Nota: 4.5 de 5 estrelas
4.5/5 (806)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
No Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Nota: 4.5 de 5 estrelas
4.5/5 (266)
The Emperor of All Maladies: A Biography of Cancer
No Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Nota: 4.5 de 5 estrelas
4.5/5 (271)
John Adams
No Everand
John Adams
David McCullough
Nota: 4.5 de 5 estrelas
4.5/5 (2409)
Rise of ISIS: A Threat We Can't Ignore
No Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Nota: 3.5 de 5 estrelas
3.5/5 (137)
Fear: Trump in the White House
No Everand
Fear: Trump in the White House
Bob Woodward
Nota: 3.5 de 5 estrelas
3.5/5 (738)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
No Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Nota: 4 de 5 estrelas
4/5 (1090)
Bad Feminist: Essays
No Everand
Bad Feminist: Essays
Roxane Gay
Nota: 4 de 5 estrelas
4/5 (1016)
The Glass Castle: A Memoir
No Everand
The Glass Castle: A Memoir
Jeannette Walls
Nota: 4.5 de 5 estrelas
4.5/5 (1713)
The Outsider: A Novel
No Everand
The Outsider: A Novel
Stephen King
Nota: 4 de 5 estrelas
4/5 (1839)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
No Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Nota: 4.5 de 5 estrelas
4.5/5 (121)
A Man Called Ove: A Novel
No Everand
A Man Called Ove: A Novel
Fredrik Backman
Nota: 4.5 de 5 estrelas
4.5/5 (4610)
The Woman in Cabin 10
No Everand
The Woman in Cabin 10
Ruth Ware
Nota: 3.5 de 5 estrelas
3.5/5 (2322)
The Light Between Oceans: A Novel
No Everand
The Light Between Oceans: A Novel
M.L. Stedman
Nota: 4.5 de 5 estrelas
4.5/5 (789)
Wolf Hall: A Novel
No Everand
Wolf Hall: A Novel
Hilary Mantel
Nota: 4 de 5 estrelas
4/5 (3811)
Brooklyn: A Novel
No Everand
Brooklyn: A Novel
Colm Tóibín
Nota: 3.5 de 5 estrelas
3.5/5 (1937)
The Perks of Being a Wallflower
No Everand
The Perks of Being a Wallflower
Stephen Chbosky
Nota: 4.5 de 5 estrelas
4.5/5 (2104)
The Art of Racing in the Rain: A Novel
No Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Nota: 4 de 5 estrelas
4/5 (4200)
Little Women
No Everand
Little Women
Louisa May Alcott
Nota: 4 de 5 estrelas
4/5 (104)
Manhattan Beach: A Novel
No Everand
Manhattan Beach: A Novel
Jennifer Egan
Nota: 3.5 de 5 estrelas
3.5/5 (792)
Her Body and Other Parties: Stories
No Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Nota: 4 de 5 estrelas
4/5 (821)
Sing, Unburied, Sing: A Novel
No Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Nota: 4 de 5 estrelas
4/5 (1103)
A Tree Grows in Brooklyn
No Everand
A Tree Grows in Brooklyn
Betty Smith
Nota: 4.5 de 5 estrelas
4.5/5 (1929)
The Constant Gardener: A Novel
No Everand
The Constant Gardener: A Novel
John le Carré
Nota: 3.5 de 5 estrelas
3.5/5 (104)
SPEC - Shell - Qatar - Multi Disciplinary Construction 2020-09-24 15-22-16
Documento53 páginas
SPEC - Shell - Qatar - Multi Disciplinary Construction 2020-09-24 15-22-16
RAJESH
Ainda não há avaliações
Université Des Mascareignes
Documento1 página
Université Des Mascareignes
kev sev
Ainda não há avaliações
Controlling Rail Potential of DC Supplied Rail Traction Systems PDF
Documento10 páginas
Controlling Rail Potential of DC Supplied Rail Traction Systems PDF
ysperdana
Ainda não há avaliações
PHD Thesis Download Free Computer Science
Documento4 páginas
PHD Thesis Download Free Computer Science
gingermartinerie
100% (2)
Wikwik
Documento2 páginas
Wikwik
David Gusarishvili
Ainda não há avaliações
An Unequal Split Dual Three-Phase PMSM With Extended Torque-Speed Characteristics For Automotive Application
Documento13 páginas
An Unequal Split Dual Three-Phase PMSM With Extended Torque-Speed Characteristics For Automotive Application
Shovan Dey
Ainda não há avaliações
Sr. Devops Engineer Job Sample
Documento2 páginas
Sr. Devops Engineer Job Sample
SannithiBalaji
Ainda não há avaliações
Java Full Stack Interview Questions
Documento5 páginas
Java Full Stack Interview Questions
Naveen Alluri
Ainda não há avaliações
Sravanti - SSE - MNR Solutions
Documento4 páginas
Sravanti - SSE - MNR Solutions
shan
Ainda não há avaliações
RF-JUR - 24/3 - CT: Mounting Instructions
Documento13 páginas
RF-JUR - 24/3 - CT: Mounting Instructions
Lim Bora
Ainda não há avaliações
Gowthaman Natarajan Prabha P: Name Name of Spouse
Documento1 página
Gowthaman Natarajan Prabha P: Name Name of Spouse
Gautam Natraj
Ainda não há avaliações
Trident's Expertise V1.5
Documento17 páginas
Trident's Expertise V1.5
trident
Ainda não há avaliações
Script Rotation V 1.8.1BETA
Documento122 páginas
Script Rotation V 1.8.1BETA
Duni Ahmad
Ainda não há avaliações
Poster Vourch Et Al IEEE Sensors 2013d
Documento1 página
Poster Vourch Et Al IEEE Sensors 2013d
Marcelo Baptista
Ainda não há avaliações
Sanog16 Mpls Transport Santanu
Documento75 páginas
Sanog16 Mpls Transport Santanu
nambiar123
Ainda não há avaliações
MS Word 2. Excel
Documento13 páginas
MS Word 2. Excel
Kunjal Pal
Ainda não há avaliações
Roland Vs 640 Mechanical Drawing
Documento40 páginas
Roland Vs 640 Mechanical Drawing
Edwardo Ramirez
Ainda não há avaliações
Communication Electronic 2 Edition - Frenzel ©2008 Created by Kai Raimi - BHC
Documento24 páginas
Communication Electronic 2 Edition - Frenzel ©2008 Created by Kai Raimi - BHC
Gmae Fampulme
Ainda não há avaliações
Abacus Junior 30
Documento4 páginas
Abacus Junior 30
郑伟健
Ainda não há avaliações
6-SiFive Promotes RISC-V 20190905
Documento29 páginas
6-SiFive Promotes RISC-V 20190905
nauman wazir
Ainda não há avaliações
Getting Started With STM32 - Introduction To STM32CubeIDE
Documento18 páginas
Getting Started With STM32 - Introduction To STM32CubeIDE
Xiaofang Jiang
Ainda não há avaliações
459251104-DCS-UH-1H-Flight-Manual-EN-pdf 5
Documento1 página
459251104-DCS-UH-1H-Flight-Manual-EN-pdf 5
mav87th-1
Ainda não há avaliações
Graseby MR10 - Respiration Monitor - Service Manual
Documento34 páginas
Graseby MR10 - Respiration Monitor - Service Manual
Eduardo
Ainda não há avaliações
CSX Signal Aspects and Indications 10-1-2004
Documento26 páginas
CSX Signal Aspects and Indications 10-1-2004
bearstrains
100% (2)
PVV Solar Cell US0123 PDF
Documento2 páginas
PVV Solar Cell US0123 PDF
Nguyễn Anh Danh
Ainda não há avaliações
Mechatronics Lesson Plan
Documento3 páginas
Mechatronics Lesson Plan
Gnaneswaran Narayanan
Ainda não há avaliações
Google's Country Experiences
Documento5 páginas
Google's Country Experiences
Csaba Köbli
Ainda não há avaliações
unclass-DISN CPG
Documento181 páginas
unclass-DISN CPG
jacquez.kainoa
Ainda não há avaliações
ReliefVent Final
Documento16 páginas
ReliefVent Final
HAZEL JEAN BIGCAS
Ainda não há avaliações
3m Filtrete Fap 04 Series
Documento10 páginas
3m Filtrete Fap 04 Series
JP anonymous
Ainda não há avaliações