Você está na página 1de 1

CURRICULUM:

DATA SCIENCE PRODEGREE

Data Acquisition(Import & Export) | Indexing | Selection and Filtering


INTRODUCTION - 24 HOURS DATA FRAME Sorting & Summarizing | Descriptive Statistics | Combining and Merging Data
MANIPULATION Frames | Removing Duplicates | Discretization and Binning | String
Intro to Program | Curriculum Overview | Learning Methodology | Manipulation | PLUS: Project Work on Python
BATCH LAUNCH
Guest Lecture
OTHER PREDICTIVE Intro to Machine Learning | Random Forests | Sklearn Library & Statsmodels
Data | Variables | Data Types | Measures of Central Tendency in Data | MODELLING TOOLS
ALL ABOUT DATA Understanding Skewness in Data | Measures of Dispersion | Data
Distribution PROJECT 4 Project 4 - Default Modelling using Logistic Regression in Python

Analysis of Variance and Covariance | One way analysis of variance |


PROJECT 5 Project 5 - Credit Risk Analytics using SVM in Python
Assumption of ANOVA | Statistics associated with one way analysis of variance
ANOVA/ REGRESSION | Interpreting the ANOVA Results | Two way analysis of variance | Project 6 - Intrusion Detection using Decision Trees & Ensemble
ANALYSIS Interpreting the ANOVA Results | Analysis of Covariance | Examine Regression PROJECT 6 Learning in Python
Results | What is Regression Analysis | Linear and Logistic Regression |
Statistics Associated with Regression

Decision Trees and Neural Networks | Introduction to Predictive Modelling SAS - 40 HOURS
PREDICTIVE with Decision Trees | Assumptions | Formulate the Model Estimate the
MODELLING Parameters | Check the Prediction Accuracy INTRODUCTION TO What is SAS? | Key Features | Submitting a SAS Program | SAS Program
SAS AND SAS Syntax Examining SAS Datasets Accessing SAS Libraries | Sorting and Grouping
TREE AND BAYESIAN Decision Trees, Bagging | Random Forests, Boosted Trees | Bayesian PROGRAMS Reporting Data | Using SAS Formats
NETWORK MODELS Classification Models
READING AND Reading SAS Datasets | Reading Excel Data | Reading Raw Files | Reading
NEURAL NETWORKS Perceptron, MLP, Back Propagation | Revision of Key Concepts MANIPULATING DATA Database Data | Creating Summary Reports | Combining Datasets

DATA Writing Observations | Writing to Multiple Datasets | Accumulating Total


R - 66 HOURS TRANSFORMATIONS Creating Accumulating Total for a Group of Data | Data Transformations

Introduction to Macro Variables | Automatic Macro Variables | User Defined


R Base Software | Understanding CRAN | RStudio The IDE | Basic Building Macro Variables | Macro Variable Reference | Defining and Calling Macros |
MACROS
R BASICS Blocks in R | Sequence of Numbers in R | Understanding Vectors in R | Macro Parameters | Global and Local Symbol Table | Creating Macro
Basic Operations Operators and Types Variables in the Data Step

Handling Missing Values in R | Subsetting Vectors in R | Matrices and Introduction to SQL | How Does RDBMS Work? | SQL Procedures | Specifying
R FUNCTIONS Data Frames in R | Logical Statements in R | Lapply, sapply, vapply and
SQL
Columns | Specifying Rows | Presenting Data | Summarizing Data | Writing
tapply Functions Join Queries using SQL | Working with Subqueries, Indexes and Views | Set
Operators | Creating Tables and Views using Proc SQL
LINEAR REGRESSION Covariance and Correlation | Multivariate Analysis | Assumptions of
THEORY - R Linearity Hypothesis Testing | Limitations of Regression PROJECT 7 Project 7 - Store Data Analytics in SAS

BUSINESS CASE: Business Case : Managing Credit Risk | Meaning of Credit Risk | Impact
MANAGING CREDIT of Credit Default | Sources of Data for Managing Risk | Understanding
RISK Loss Given Default | Understanding Default TABLEAU - 10 HOURS
Loss Given Default Linear Regression R | Extract Data in R | Univariate
Analysis of Data | Apply Data Transformations | Bivariate Analysis of Introduction to Visualization | Working with Tableau | Visualization in Depth
LOSS GIVEN DEFAULT Data | Identify Multicollinearity in Data | Treatment on Data | Identify TABLEAU BASIC Data Organisation | Advanced Visualization | Mapping | Enterprise
LINEAR REGRESSION Heteroscedasticity Discuss what could be the Reason for Heteroscedasticity Dashboards Data Presentation
R | Modelling of Data Variable Significance Identification | Model
Significance Test | Predict using Testing Data Set | Validate the Model BEST PRACTICES FOR Have a Methodology | Know Your Audience | Define Resulting Actions
Performance DASHBOARDING AND Classify Your Dashboard | Profile Your Data | Use Visual Features Properly |
REPORTING AND Design Iteratively
LOGISTIC REGRESSION Reason for Logistic Regression | The Logistic Transform | Logistic CASE STUDY
THEORY - R Regression Modelling | Model Optimisation | Understanding ROC Curve
INTRODUCTION TO Choice of three projects on various domains
PROJECT 1 Project 1 - Default Modelling using Logistic Regression in R THE GROUP PROJECT
Introduction to SVM | Classification as a Hyper Plane Location Problem |
SUPPORT VECTOR
JOB READINESS - 8 HOURS
Motivation for Linear Support Vectors | SVM as Quadriatic Optimization
MACHINES (THEORY)
Problem | Non Linear SVM | Introduction to Kernel Functions

PROJECT 2 Project 2 - Default Modelling using SVM in R


RESUME BUILDING Resume Building | Personal Branding | Tips and Resources | Interview
Introduction to Decision Trees | Theory of Entropy & Information Gain | AND INTERVIEW PREP Skills
Stopping Rules | Overfitting Problem | Cross Validations for Overfitting
DECISION TREES 1:1 MOCK 1:1 Mock Interviews with Industry Veterans to Clear the Technical Round of
Problem | Prunning as a Solution for Overfitting | Ensemble Learning
INTERVIEWS Interviews to Give You Confidence to Face Real World Scenarios
Notion | Concept of Bootstrap Aggregation | Concept of Random Forest

Business Case : Intrusion Detection in IT Network | Meaning of Intrusion GROUP PROJECT Groups Present their Project Presentation in Front of Their Peers and industry
BUSINESS CASE PRESENTATION Experts Evaluate the Solution (Refresher session for online batches)
in IT Cost of Intrusion | Meaning of Intrusion Detection System

Project 3 - Network Intrusion Detection using Decision Tree & Ensemble


PROJECT 3 Learning in R
HANDS-ON PROJECTS
GUEST LECTURE Industry View from Expert | Refresher on R | Open House
NETWORK INTRUSION
DEFAULT MODELLING DETECTION USING
DEFAULT MODELLING
USING LOGISTIC DECISION TREE &
USING SVM IN R
REGRESSION IN R ENSEMBLE LEARNING IN R
PYTHON - 35 HOURS
DEFAULT MODELLING INTRUSION DETECTION
CREDIT RISK
USING LOGISTIC USING DECISION TREES
What is Python? | Installing Anaconda | Understanding the Spyder Integrated ANALYTICS USING
REGRESSION IN & ENSEMBLE LEARNING
PYTHON BASICS Development Environment (IDE) | Lists, tuples, dictionaries, variables PYTHON
SVM IN PYTHON
IN PYTHON

DATA STRUCTURES Intro to Numpy Arrays | Creating ndarrays | Indexing | Data Processing STORE DATA
IN PYTHON USED using Arrays | File Input and Output | Getting Started with Pandas ANALYTICS IN SAS
FOR DATA ANALYSIS

Você também pode gostar