Escolar Documentos
Profissional Documentos
Cultura Documentos
Course Brochure
DATA SCIENCE foundation
DATA SCIENCE advanced
python
These programs are experien al applica on driven programs for highly mo vated working
professionals to become data science prac oners.
SESSION DURATION
2 HRS (2 Hr – Concept Learning and Concept Implementation)
program curriculum
Module 1 – Statistics and Probability
Define Sta s cal Inference.
List the terminologies of Sta s cs.
Probability Theory.
Condi onal Probability.
Bayes Theorem.
Module 2 – Introduction to R
R basics, understanding of data types, func ons, control structures, data
manipula ons, date and string manipula ons.
List the terminologies of Sta s cs.
Module 6 – Classification
Define Classifica on
Decision Trees.
Random Forest.
Naïve Bayes.
Support Vector Machines.
“ Pr me
Google has gone from two
deep learning projects to 1000 today.
~ Fortune C L A S S E S
data science advanced - course overview
SESSION DURATION
2 HRS 30 MINS (1 Hr 30 Mins – Concept Learning, 1 hr – Concept Implementation)
Collaborate with mentors on coding assignments and projects in the last 1 hr of
every session.
program curriculum
1.Introduction to Probability and Statistics for Data Science
This module aims at preparing you for the essen al skill of thinking like a sta s cian.
This module will enable you to change your analy cal thinking process, and you will begin to start
looking at data and numbers from a different perspec ve. This is a fundamental
module and strong concepts in this area will enable you to differen ate yourself as a
Data Scien st.
From a tools perspec ve, you will gain confidence with tools like R and Excel.
“ Pr me
The big technology trend is to make systems
intelligent and data is the raw material.
C L A S S E S
~ Amod Malviya, CTO, Flipkart
Fundamentals of Probability
Introduc on to random variables
Probability theory
Condi onal probability
Bayes Theorem
The Concept of a data set
Probability distribu on and differences between discrete and con nuous distribu ons
“ Pr me
Tech giants have acquired 140
AI companies since 2011.
~ Observer Magazine C L A S S E S
Time Series Analysis
Decomposi on of Time Series
Trend and Seasonality detec on and forecas ng
Smothering Techniques
Understanding ACF & PCF plots
ARIMA Modeling
Holt-Winter Method
Distance-based classifiers.
Neural networks
Perceptron and Single Layer Neural Network.
Back Propaga on algorithm and a typical Feed Forward Neural Net.
Hands-on with R with a Case.
“ Pr me
Without big data, companies are blind and deaf,
wandering out onto the web like deer on a freeway.
~ Geoffrey Moore, American organizational theorist C L A S S E S
Support vector machines (SVM).
Linear learning machines and kernel space, making kernels and working in feature space.
SVM algorithm and comparison with Neural Nets
Demonstrate the working of SVM classifica on problems using a business case in R.
Ensemble methods
Bagging and boos ng and its impact on bias and variance
C 5.0 boos ng
Random Forest
AdaBoost
Gradient boos ng machines
Unsupervised learning algorithm-Clustering
Different clustering methods; review of several distance measures
Itera ve distance-based clustering
Dealing with con nuous, categorical values in K-Means
Construc ng a hierarchical cluster, K-medoids, k-mode and density-based clustering to
handle different types in prac ce.
Test for stability check of clusters
Hands on implementa on of each of these methods will be conducted in R.
Bayesian belief nets, Naïve Bayes, popular techniques to handle Overfi ng and
Underfi ng
Introduc on to genera ve techniques
Bayesian belief nets (BBN)
Naïve Bayes- a special case of BBN
Hands-on Naïve Bayes in R
How to avoid Overfi ng and Underfi ng
Refresher on all the machine learning algorithms
SESSION DURATION
3 HRS (1 Hr 30 Mins – Concept Learning, 1hr 30 mins - Collaborate with mentors on
coding assignments and projects .
program curriculum
1. Introduction to Python
Overview of Python
The Companies using Python
Other applica ons in which Python is used
Discuss Python Scripts on UNIX/Windows
Values, Types, Variables
Operands and Expressions
Condi onal Statements
Loops
Command Line Arguments
Wri ng to the screen
“ Pr me
AI is the most important technology on the
planet today.
~ Dave Choplin C L A S S E S
2. Sequences and File operations
Python files I/O Func ons
Numbers
Strings and related opera ons
Tuples and related opera ons
Lists and related opera ons
Dic onaries and related opera ons
Sets and related opera ons
5. Data Visualization
matplotlib library
Grids, axes, plots
Markers, colours, fonts and styling
Types of plots - bar graphs, pie charts, histograms
Contour plots
CAPSTONE project
The course culminates in an enterprise project for a fic ous client that will expose you to every stage
of the data science process – from data acquisi on and prepara on to evalua on, interpreta on,
deployment, opera ons, and op miza on. The project is an opportunity for you test your skills and
demonstrate your ability to invent solu ons for real-world problems.
“ Pr me
In the next 10 years, data science and software will
do more for medicine than all of the biological sciences together
~ Vinod Khosla, American-Indian engineer C L A S S E S
lead trainers
sravan chaitanya Siddhartha
IIT - Madras, IIM - Ahmedabad IIT - Madras, University of Alberta
A Competent Professional with about Siddhartha is the Founder of Zessta.
15+ years of experience in Consul ng, A hardcore technologist Siddhartha’s
Data Science, successfully trained passion, lies in developing innova ve and
more than 4000+ working professionals futuris c products in machine learning
in becoming Data Scien sts. and AI.
about us
Prime Classes is a career accelerator for youth promoted by IIT and IIM Alumni. Our vision is to
create a na onal level talent pool by skilling millions of youth for growing industry needs in
new age technologies viz Data Science, AI and Machine Learning.
“
“ An investment in knowledge pays the best interest
Pr me C L A S S E S