Bem-vindo(a) ao Scribd!

C08 Big Data Spark

Enviado por

0% acharam este documento útil (0 voto)

855 visualizações1 página

1. Introduction. 1.1. Big data and data science projects. 1.2. Big data architectures. 1.3. Data processing and analytics patterns. 1.4. Reproducible data science.  Python: Anaconda and Jupyter.  Apache Zeppelin.

Direitos autorais

Formatos disponíveis

PDF, TXT ou leia online no Scribd

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Denunciar este documento

Direitos autorais:

Formatos disponíveis

Baixe no formato PDF, TXT ou leia online no Scribd

Sinalizar o conteúdo como inadequado

0% acharam este documento útil (0 voto)

855 visualizações1 página

C08 Big Data Spark

Enviado por

angelvi

Direitos autorais:

Formatos disponíveis

Baixe no formato PDF, TXT ou leia online no Scribd

Sinalizar o conteúdo como inadequado

Pular para a página

Você está na página 1de 1

Pesquisar no documento

Course 8 Big Data with Apache Spark

Program 1. Introduction.
1.1. Big data and data science projects.
1.2. Big data architectures.
1.3. Data processing and analytics patterns.
1.4. Reproducible data science.
 Python: Anaconda and Jupyter.
 Apache Zeppelin.
2. Spark framework and APIs.
2.1. Evolution of the Apache Hadoop ecosystem.
2.2. Spark technology stack.
2.3. Spark APIs.
2.4. Programming languages: Scala, Python and R.
2.5. Logical and physical architecture.
3. Data processing with Spark.
3.1. The Spark programming model.
3.2. New features in Spark v2.
3.3. Spark applications.
 Local mode.
 Cluster: Standalone manager, YARN, Mesos.
3.4. Spark programming (I) RDDs.
3.5. Spark programming (II): DataFrames, Datasets and GraphFrames.

4. Spark Streaming.
4.1. Overall architecture.
4.2. Structured Streaming.
4.3. Programming with Spark Streaming.
 Stream data sources.
 Transformations.
 Output operations.
 Interaction with other components..
4.4. Case example with Spark Streaming.
5. Machine Learning with Spark MLlib.
5.1. Spark ML and MLlib.
5.2. Pipelines.
5.3. Case example: Classification and regression.
5.4. Case example: Recommender systems.
5.5. Case example: Dynamic clustering.
Prerequisites It is strongly recommended to use GNU/Linux (Ubuntu, Debian or any other
distribution), either as the native operating system or inside a virtual machine (e.g.
VirtualBox). Participants will use a self-contained environment with all software
requirements already pre-installed.
It is assumed some previous knowledge about programming in any language, but
preferably Python 3 or Java. Basic knowledge about data mining/machine learning
algorithms is also advisable.
Bibliography - Zecevic, P., Bonaci. M. (2016). Spark in Action. Manning Publications.
- Karau, H., Konwinski, A., Wendell, P., Zaharia, M. (2015). Learning Spark: Lightning-
Fast Big Data Analysis. O'Reilly Media.
- Guller, M. (2015). Big Data Analytics with Spark: A Practitioner's Guide to Using
Spark for Large Scale Data Analysis. Apress.
- Karau, H., Warren, R. (2017) High Performance Spark: Best Practices for Scaling and
Optimizing Apache Spark. O’Reilly Media.
- Ryza, S., Laserson, U., Owen, S., Wills, J. (2015). Advanced Analytics with Spark:
Patterns for Learning fom Data at Scale. O'Reilly Media.

Você também pode gostar

Settlers of Catan The Prosperity Rules
Documento1 página
Settlers of Catan The Prosperity Rules
angelvi
Ainda não há avaliações
Instituto Tecnológico de Galicia
Documento8 páginas
Instituto Tecnológico de Galicia
angelvi
Ainda não há avaliações
Cat An 2 Players
Documento1 página
Cat An 2 Players
Carmelo Vaccarello
Ainda não há avaliações
Optimization Publications
Documento4 páginas
Optimization Publications
angelvi
Ainda não há avaliações
An Introduction To Reinforcement Learning
Documento63 páginas
An Introduction To Reinforcement Learning
angelvi
Ainda não há avaliações
Cat An 2 Players
Documento1 página
Cat An 2 Players
Carmelo Vaccarello
Ainda não há avaliações
Mastering Chess and Shogi by Self-Play With A General Reinforcement Learning Algorithm
Documento19 páginas
Mastering Chess and Shogi by Self-Play With A General Reinforcement Learning Algorithm
eldelmoño luci
Ainda não há avaliações
Introduction To Linux
Documento22 páginas
Introduction To Linux
angelvi
Ainda não há avaliações
Linux Crash Course: Command Description Filesystem / PWD - ..
Documento5 páginas
Linux Crash Course: Command Description Filesystem / PWD - ..
angelvi
Ainda não há avaliações
Ethernet
Documento11 páginas
Ethernet
angelvi
Ainda não há avaliações
A Recursive Algorithm For Describing Evolution in Transition P Systems
Documento12 páginas
A Recursive Algorithm For Describing Evolution in Transition P Systems
angelvi
Ainda não há avaliações
Build Your Own 2D Game Engine and Create Great Web Games
Documento1 página
Build Your Own 2D Game Engine and Create Great Web Games
angelvi
Ainda não há avaliações
Duplex (Telecommunications)
Documento4 páginas
Duplex (Telecommunications)
angelvi
Ainda não há avaliações
C05 Neural Networks and Deep Learning
Documento2 páginas
C05 Neural Networks and Deep Learning
angelvi
Ainda não há avaliações
LS Vs AG - G2 - English
Documento35 páginas
LS Vs AG - G2 - English
angelvi
Ainda não há avaliações
LS Vs AG - G1 - English
Documento33 páginas
LS Vs AG - G1 - English
angelvi
Ainda não há avaliações
MCTS and AlphaGo
Documento58 páginas
MCTS and AlphaGo
angelvi
Ainda não há avaliações
Agricola
Documento8 páginas
Agricola
angelvi
Ainda não há avaliações
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
No Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Nota: 4 de 5 estrelas
4/5 (5794)
The Little Book of Hygge: Danish Secrets to Happy Living
No Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Nota: 3.5 de 5 estrelas
3.5/5 (400)
Shoe Dog: A Memoir by the Creator of Nike
No Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Nota: 4.5 de 5 estrelas
4.5/5 (537)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
No Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Nota: 4 de 5 estrelas
4/5 (895)
The Yellow House: A Memoir (2019 National Book Award Winner)
No Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Nota: 4 de 5 estrelas
4/5 (98)
The Emperor of All Maladies: A Biography of Cancer
No Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Nota: 4.5 de 5 estrelas
4.5/5 (271)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
No Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Nota: 3.5 de 5 estrelas
3.5/5 (231)
Never Split the Difference: Negotiating As If Your Life Depended On It
No Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Nota: 4.5 de 5 estrelas
4.5/5 (838)
Grit: The Power of Passion and Perseverance
No Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Nota: 4 de 5 estrelas
4/5 (588)
On Fire: The (Burning) Case for a Green New Deal
No Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Nota: 4 de 5 estrelas
4/5 (73)
Yes Please
No Everand
Yes Please
Amy Poehler
Nota: 4 de 5 estrelas
4/5 (1891)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
No Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Nota: 4.5 de 5 estrelas
4.5/5 (474)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
No Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Nota: 4.5 de 5 estrelas
4.5/5 (266)
The Unwinding: An Inner History of the New America
No Everand
The Unwinding: An Inner History of the New America
George Packer
Nota: 4 de 5 estrelas
4/5 (45)
Fear: Trump in the White House
No Everand
Fear: Trump in the White House
Bob Woodward
Nota: 3.5 de 5 estrelas
3.5/5 (738)
Team of Rivals: The Political Genius of Abraham Lincoln
No Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Nota: 4.5 de 5 estrelas
4.5/5 (234)
Principles: Life and Work
No Everand
Principles: Life and Work
Ray Dalio
Nota: 4 de 5 estrelas
4/5 (599)
Angela's Ashes: A Memoir
No Everand
Angela's Ashes: A Memoir
Frank McCourt
Nota: 4.5 de 5 estrelas
4.5/5 (440)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
No Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Nota: 3.5 de 5 estrelas
3.5/5 (2259)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
No Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Nota: 4 de 5 estrelas
4/5 (1090)
Rise of ISIS: A Threat We Can't Ignore
No Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Nota: 3.5 de 5 estrelas
3.5/5 (137)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
No Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Nota: 4.5 de 5 estrelas
4.5/5 (344)
Bad Feminist: Essays
No Everand
Bad Feminist: Essays
Roxane Gay
Nota: 4 de 5 estrelas
4/5 (1015)
Steve Jobs
No Everand
Steve Jobs
Walter Isaacson
Nota: 4.5 de 5 estrelas
4.5/5 (806)
John Adams
No Everand
John Adams
David McCullough
Nota: 4.5 de 5 estrelas
4.5/5 (2409)
The Glass Castle: A Memoir
No Everand
The Glass Castle: A Memoir
Jeannette Walls
Nota: 4.5 de 5 estrelas
4.5/5 (1712)
The Outsider: A Novel
No Everand
The Outsider: A Novel
Stephen King
Nota: 4 de 5 estrelas
4/5 (1839)
The Light Between Oceans: A Novel
No Everand
The Light Between Oceans: A Novel
M.L. Stedman
Nota: 4.5 de 5 estrelas
4.5/5 (789)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
No Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Nota: 4.5 de 5 estrelas
4.5/5 (121)
Brooklyn: A Novel
No Everand
Brooklyn: A Novel
Colm Toibin
Nota: 3.5 de 5 estrelas
3.5/5 (1937)
The Woman in Cabin 10
No Everand
The Woman in Cabin 10
Ruth Ware
Nota: 3.5 de 5 estrelas
3.5/5 (2322)
A Man Called Ove: A Novel
No Everand
A Man Called Ove: A Novel
Fredrik Backman
Nota: 4.5 de 5 estrelas
4.5/5 (4609)
Little Women
No Everand
Little Women
Louisa May Alcott
Nota: 4 de 5 estrelas
4/5 (104)
Manhattan Beach: A Novel
No Everand
Manhattan Beach: A Novel
Jennifer Egan
Nota: 3.5 de 5 estrelas
3.5/5 (792)
Wolf Hall: A Novel
No Everand
Wolf Hall: A Novel
Hilary Mantel
Nota: 4 de 5 estrelas
4/5 (3811)
The Perks of Being a Wallflower
No Everand
The Perks of Being a Wallflower
Stephen Chbosky
Nota: 4.5 de 5 estrelas
4.5/5 (2102)
The Art of Racing in the Rain: A Novel
No Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Nota: 4 de 5 estrelas
4/5 (4200)
A Tree Grows in Brooklyn
No Everand
A Tree Grows in Brooklyn
Betty Smith
Nota: 4.5 de 5 estrelas
4.5/5 (1929)
Sing, Unburied, Sing: A Novel
No Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Nota: 4 de 5 estrelas
4/5 (1103)
Her Body and Other Parties: Stories
No Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Nota: 4 de 5 estrelas
4/5 (821)
The Constant Gardener: A Novel
No Everand
The Constant Gardener: A Novel
John le Carré
Nota: 3.5 de 5 estrelas
3.5/5 (104)
Portfolio Day EMCEE
Documento3 páginas
Portfolio Day EMCEE
Maitem Stephanie Galos
85% (27)
Envision Algebra 1 Answers
Documento64 páginas
Envision Algebra 1 Answers
Mahmoud Soliman
Ainda não há avaliações
VCP Profile
Documento897 páginas
VCP Profile
tanirocks
Ainda não há avaliações
The Shortest Path Problem On Chola Period Built Temples With Dijkstra's Algorithm in Intuitionistic Triangular Neutrosophic Fuzzy Graph
Documento21 páginas
The Shortest Path Problem On Chola Period Built Temples With Dijkstra's Algorithm in Intuitionistic Triangular Neutrosophic Fuzzy Graph
Science Direct
Ainda não há avaliações
Etnoantropoloski Problemi 2023-02 11 Bojic
Documento24 páginas
Etnoantropoloski Problemi 2023-02 11 Bojic
Lazarovo
Ainda não há avaliações
Intercultural Communication Amongst Native Language Speakers Using Uncertainty Reduction Theory
Documento3 páginas
Intercultural Communication Amongst Native Language Speakers Using Uncertainty Reduction Theory
Czarina Jane S. Vidal
Ainda não há avaliações
Supplement+to+Mounce+Chs.26 27
Documento6 páginas
Supplement+to+Mounce+Chs.26 27
טימותיוסבִּנדִּי
Ainda não há avaliações
Conjunctions With Explanation
Documento6 páginas
Conjunctions With Explanation
Sunil Darisipudi D
Ainda não há avaliações
Salesforce Unit 1, 2, 3
Documento116 páginas
Salesforce Unit 1, 2, 3
Gauravs
100% (2)
Latvians, Were They Turks, The Phenomenon of The Turkic Language
Documento52 páginas
Latvians, Were They Turks, The Phenomenon of The Turkic Language
Erden Sizgek
Ainda não há avaliações
What I Can Do: Overseas Heroes
Documento4 páginas
What I Can Do: Overseas Heroes
JA Riel
83% (6)
Lesson Plan and Commentary: Delta COURSE
Documento7 páginas
Lesson Plan and Commentary: Delta COURSE
Phairouse Abdul Salam
Ainda não há avaliações
Master Data Management (MDM) : Training Curriculum: SCCM Current Branch + Imaging
Documento6 páginas
Master Data Management (MDM) : Training Curriculum: SCCM Current Branch + Imaging
Morling Global
Ainda não há avaliações
The Noble Art of Misquoting Camus - Fro
Documento21 páginas
The Noble Art of Misquoting Camus - Fro
fragitsa k
Ainda não há avaliações
British Council Test
Documento26 páginas
British Council Test
lamiaa
Ainda não há avaliações
21st Century Lit Q2
Documento5 páginas
21st Century Lit Q2
lyra collado
Ainda não há avaliações
Intro To Creative Writing Syllabus-Revised
Documento4 páginas
Intro To Creative Writing Syllabus-Revised
Tracy
Ainda não há avaliações
Literature Terminology & Vocabulary
Documento7 páginas
Literature Terminology & Vocabulary
M M
Ainda não há avaliações
Microsoft Word FAQ: (Frequently Asked Questions) by Charles Kenyon
Documento52 páginas
Microsoft Word FAQ: (Frequently Asked Questions) by Charles Kenyon
Sahithi Katakam
Ainda não há avaliações
How To Write A Screenplay Powerpoint
Documento30 páginas
How To Write A Screenplay Powerpoint
alan
Ainda não há avaliações
Jean Mark Mutumbo 5 Fallen Angels The Progenitors of Hybrid Races and Promulgators of Dark Knowledge
Documento12 páginas
Jean Mark Mutumbo 5 Fallen Angels The Progenitors of Hybrid Races and Promulgators of Dark Knowledge
isaacdsapi
Ainda não há avaliações
STI College Baguio Course Manager Mobile Application Proposal
Documento12 páginas
STI College Baguio Course Manager Mobile Application Proposal
Aldwin Villanueva
Ainda não há avaliações
Administration Guide Open Bee Scan (En)
Documento45 páginas
Administration Guide Open Bee Scan (En)
peka76
Ainda não há avaliações
University of Cambridge International Examinations Cambridge International Primary Achievement Test
Documento12 páginas
University of Cambridge International Examinations Cambridge International Primary Achievement Test
Susanne Krishnan
Ainda não há avaliações
BÁO CÁO THỰC TẬP
Documento14 páginas
BÁO CÁO THỰC TẬP
Sunshine's Books
Ainda não há avaliações
Blocking Notation
Documento1 página
Blocking Notation
Rachel Damon
100% (1)
Debugging Update Task and Background Task
Documento11 páginas
Debugging Update Task and Background Task
EmilS
Ainda não há avaliações
Assignment: Bachelor'S of Arts Programme (B.A.G)
Documento4 páginas
Assignment: Bachelor'S of Arts Programme (B.A.G)
Linthoi Nganbi
Ainda não há avaliações
Realism in The Novel of Thomas Hardy
Documento21 páginas
Realism in The Novel of Thomas Hardy
Andrew Lucas
Ainda não há avaliações
Introduction To Ordinary Differential Equations: DR Faye-Jan 2014
Documento48 páginas
Introduction To Ordinary Differential Equations: DR Faye-Jan 2014
Ashwin Veera
Ainda não há avaliações