Awe Smplawe SMPL

interconnected - neuron
input layer --- hidden layer --- output layer

classifying
forward propagation
cost = generated o/p - actual o/p
Neural Networks is a set of algorithms designed to learn the way our brain works.
The biological neurons inspire the structure and the function of the neural
networks.
Each artificial neuron is modeled as follows:
Each neuron receives inputs.

Adds weights and biases to the input.
Sum inputs with weights and bias.
This triggers the Activation function.
Activation function will take the weighted sum of the previous step and fire the
output.
How do Neural Networks Learn?
Welcome back!
Hope you have understood the structure of a Neural Network.

Hope you also know how the weights and biases propagate to give an output.
How do you think will an artificial neuron learn?
Supervised Learning
In Supervised Learning, you give the list of Inputs and Outputs for a Neural
Network to Learn.
Based on the actual output, the Neuron adjusts its weights and biases. This is
achieved by a process called Training.
Back Propogation
Neural Networks learn using Back Propagation.

When the input is combined with the weights and bias to trigger the activation, it
is called forward propagation.
When the error estimates are propagated backward to adjust the weights and biases,
it is called backpropagation.
Steps in Back Propagation
During training, the input combines with random weights and bias values.
They trigger the activation function.
The output of the activation function is the predicted output.
The predicted output is compared to the actual output.
The difference is propagated backward.
The bias values and weights are updated based on the error values.
Shallow Learning
You would have come across this term Shallow Learning. When does shallow learning
happen in Neural Networks?
When you just have one hidden layer between your inputs and outputs.
Another way of interpreting that idea is, you are just performing one task at a
time.
Shallow Learning - Tasks and Algorithms
Some of the tasks that can be grouped under shallow learning are:
Feature Extraction - Preprocessing

Mapping specific features into vector space - Kernel methods
Rule-based decision making - Decision Trees
Combining various estimates - Ensemble methods
Deep Learning
In deep learning, you have two or more hidden layers. This allows the algorithm to
perform multiple tasks.
There are Multiple Stack Layers of Neurons

There are Specialized layers based on data type - Image, Text, and Voice
The Story So Far
Hope you got an idea of an Artificial Neural Network

Hope you understood how they learn with Back Propagation
Hope you learned the difference between shallow and deep learning.
Now you will learn how different Neural Network Architectures are used for
different data types.
Structured Data
Any data that is available in Data Base format is structured data.
When it comes to structured data, shallow learning models can be applied for
supervised learning.
Unstructured Data comprise:
Voice
Text
Image
Videos
Deep Learning Models work well for unstructured data.
Sequence Models work well for Voice Data.

Recurrent Neural Networks can be trained for Translation and many Conversational
Systems.
Text Data
Recurrent Neural Networks work well for Text Data.
However, even a general multi layer perceptron can be trained for text data.
Word2Vec is a popular Deep Learning Representation of Text Data.
Images and Videos
When it comes to Images, Convolutional Neural Networks (CNNs) are the best. CNNs
can be stacked to extract features and detect objects from Images.
Restricted Boltzmann Machine
Restricted Boltzmann Machine (RBM) was conceptualized by Geoff Hinton. It is used

in
Dimension Reduction
Regression
Classification.
RBM has a very simple architecture. You will learn that in detail in the following
cards.
RBMs have two layers:
Input
Hidden
Each input layer is linked to all the hidden layers. Each node in the input layer
is not connected to one another.
In the above figure, v represents input layer and h represents hidden layer.
Learning in RBM
RBMs learn by re-contructing the inputs.

They learn in an unsupervised fashion.
They re-construct the input in a backward manner and update the parameters
accordingly.
Deep Belief Network (DBN) is an extension of RBM.

DBNs are stacked RBMs layer by layer.
The diagram above explains how they are stacked.
Learning in DBN
DBN uses pretraining before applying backpropagation.

This pretraining is similar to how RBS learn by reconstruction.
Generative weights determine how parameters in one layer depend on the parameters
in the previous layer.
In the forward pass, it is the observed weight.
In the reverse pass, it is the generative weights.
This avoids the learning from getting into local minima.
Applications of DBN
Sequence Recognition in Videos

Facial Recognition
Motion capture
Machines Interpreting Images
Machines understand images based on the pixel intensity.

So any image in matrix representation will have numbers from 1 to 255.
1 - White
256 - Dark shade of Green
Feature Extractions
Every image has the same objects represented in slightly different manner.
Each object has to be recognized.

Every feature should be extracted.
How can you identify objects and extract each feature and send it to a neural
network for learning?
Convolution Layer
When the image is superimposed with a filter, and run through it once, the output
obtained is another image with reduced dimensions. This one layer of the filter is
called convolution layer.
The above GIF explains a sample filter.
Pooling reduces the size of an image. Only those pixels that stand out are brought
out of pooling.
The above-seen image is an image pooled after convolution.
You can also observe the dimensions reduced in the image after pooling.
Output Layer
The Output layer is the last layer where the image is converted into a 1-
Dimensional matrix.
This is how a typical Convolutional Neural Network Architecture looks like.
Imagenet Database
ImageNet is described as a database of millions of Images scraped from the internet

that has been done for Object Recognition Software Research.
All the images have been annotated by hand.

Every year they conduct a ImageNet Large Scale Visual Recognition Challenge
(ILSVRC) to gauge the accuracy of many software to classify objects correctly.
Pre-Trained Models
There are multiple pre-trained models available for Image Classification and object
detection.
Some of them include
AlexNet
ResNet
VGG19
InceptionResNet
GoogLeNet
DenseNet
NASNet
Some of these models have participated in the ImageNet Challenge in the past.
How do Pre-Trained Models Work?
You can load any of the available pre-trained models and pass a random image. The
models will output the probability of each object in that image.
RNN for Sequence Data
Sequence Data has a temporal or time component to it. The current component is
dependent on the previous component. When you go for a normal Deep Neural Network
Architecture, you cannot do justice to the temporal part.
Hence the model of choice is Recurrent Neural Network (RNN).
The image shows the architecture of a Single RNN. During training and
implementation, each RNN is stacked against multiple RNNs to Input and Output the
Sequence of Data. This helps these neurons in the temporal aspect and retaining
memory.
Forward Propogation
Input is provided to the NNet at each given time step.

In the successive time step, the output of the (n-1)'th neuron is the input of the
n'th neuron in the sequence.
This is repeated until all the information is loaded.
Once the time steps are exhausted, the output is calculated.
Depending on the error, it is backpropagated to the weights and biases to adjust.
Back Propogation
The image shows how the RNNs are stacked in a series. It also shows how the weights
and biases are calculated in each time step.
Recursive Neural Tensor Networks
RNTNs are networks that are exclusively used for Natural Language Processing.
NLP is an application of computer science in understanding and synthesizing Natural

Human Spoken Language. Since RNTS have a recursive tree structure, they are used in
NLP.
NLP
NLP has many components to understand Natural Language, some of them include:
Lemmatization
Named Entity Parsing
Part of Speech Tagging
Segmentation
A combination of the components mentioned above is applied to Natural Language to
make them Machine Understandable.
RNTNs are leveraged for a lot of these transformations.

Word2vec
Word2vec is a popular deep learning model where words are represented as vectors in
n dimension space.
When the words are interpreted as vectors, understand the intent and context
becomes easy in NLP.
Word2vec has a pre-trained corpus and vector created. Each curated word corpus is
transformed into a word2vec format to understand the span of words for analysis.
Autoencoders
Autoencoder architecture comprises two deep belief networks stacked against each
other symmetrically.
The number of input and output layers is the same in Autoencoders.
They are predominantly used in Dimension Reduction as a substitute for Principal
Component Analysis.
Learning in Autoencoders
Since Autoencoders are an extension of Deep Belief Networks, they learn in an
unsupervised manner. They learn by reconstructing the input.
In de-noising autoencoders, if the input is fed with noise, the output is a de-
noised version of the input.
Applications
Autoencoders are applied in the following areas:
Data Compression
Image Search
Information Retrieval
Topic Modeling
-----------------------------------------------------------------------------------
----------------------------------
Neural Networks Algorithms are inspired from the structure and functioning of the
Human Biological Neuron.
true
Recurrent Networks work best for Speech Recognition.

true
A Shallow Neural Network has only one hidden layer between Input and Output layers.
true
GPU stands for __________.

graphics processing units
Gradient at a given layer is the product of all gradients at the previous layers
true
Recurrent Neural Networks are best suited for Text Processing.

true
Name the component of a Neural Network where the true value of the input is not
observed.
hidden layer
The rate at which cost changes with respect to weight or bias is called
gradient
_____________________ is a Neural Nets way of classifying inputs.

Fwd propagation
In a Neural Network, all the edges and nodes have the same Weight and Bias values.
false
Process of improving the accuracy of a Neural Network is called _______________.

training
Support Vector Machines, Naive Bayes and Logistic Regression are used for solving
___________________ problems.
classification
What is the difference between the actual output and generated output known as?
cost
________________ works best for Image Data.

convolution
Data Collected from Survey results is an example of

structured data
_______________ is a recommended Model for Pattern Recognition in Unlabeled Data.
Autoencoders
Prediction Accuracy of a Neural Network depends on _______________ and

______________.
Weight and Bias
What is the method to overcome the Decay of Information through time in RNN known
as?
Gating
All the Visible Layers in a Restricted Boltzmannn Machine are connected to each
other.
false
A _________________ matches or surpases the output of an individual neuron to a

visual stimuli.
convolution
What is the best Neural Network Model for Temporal Data?

RNN
The measure of Difference between two probability distributions is know as

________________________.
KL Divergence
Recurrent Network can input Sequence of Data Points and Produce a Sequence of
Output.
true
Restricted Boltzmann Machine expects the data to be labeled for Training.

false
All the neurons in a convolution layer have different Weights and Biases.
false
What are the two layers of a Restricted Boltzmann Machine called?

visible & hidden
RELU stands for

Rectified Linear Unit
What does LSTM stand for?

Long Short Term Memory
A Deep Belief Network is a stack of Restricted Boltzmann Machines.

true

Awe Smplawe SMPL

Enviado por

Dados do documento

Título original

Direitos autorais

Formatos disponíveis

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Direitos autorais:

Formatos disponíveis

Awe Smplawe SMPL

Enviado por

Direitos autorais:

Formatos disponíveis

interconnected - neuron

input layer --- hidden layer --- output layer

cost = generated o/p - actual o/p

Each artificial neuron is modeled as follows:

Each neuron receives inputs.

How do Neural Networks Learn?

Hope you have understood the structure of a Neural Network.

Neural Networks learn using Back Propagation.

Steps in Back Propagation

Shallow Learning - Tasks and Algorithms

Feature Extraction - Preprocessing

There are Multiple Stack Layers of Neurons

The Story So Far

Hope you got an idea of an Artificial Neural Network

Unstructured Data comprise:

Sequence Models work well for Voice Data.

Recurrent Neural Networks work well for Text Data.

Word2Vec is a popular Deep Learning Representation of Text Data.

Images and Videos

Restricted Boltzmann Machine (RBM) was conceptualized by Geoff Hinton. It is used

RBMs have two layers:

RBMs learn by re-contructing the inputs.

Deep Belief Network (DBN) is an extension of RBM.

DBN uses pretraining before applying backpropagation.

Sequence Recognition in Videos

Machines Interpreting Images

Machines understand images based on the pixel intensity.

Each object has to be recognized.

The above GIF explains a sample filter.

The above-seen image is an image pooled after convolution.

This is how a typical Convolutional Neural Network Architecture looks like.

ImageNet is described as a database of millions of Images scraped from the internet

All the images have been annotated by hand.

Some of them include

How do Pre-Trained Models Work?

RNN for Sequence Data

Hence the model of choice is Recurrent Neural Network (RNN).

Input is provided to the NNet at each given time step.

Recursive Neural Tensor Networks

NLP is an application of computer science in understanding and synthesizing Natural

RNTNs are leveraged for a lot of these transformations.

Autoencoders are applied in the following areas:

Recurrent Networks work best for Speech Recognition.

GPU stands for __________.

Recurrent Neural Networks are best suited for Text Processing.

_____________________ is a Neural Nets way of classifying inputs.

Process of improving the accuracy of a Neural Network is called _______________.

________________ works best for Image Data.

Data Collected from Survey results is an example of

_______________ is a recommended Model for Pattern Recognition in Unlabeled Data.

Prediction Accuracy of a Neural Network depends on _______________ and

A _________________ matches or surpases the output of an individual neuron to a

What is the best Neural Network Model for Temporal Data?

The measure of Difference between two probability distributions is know as

Restricted Boltzmann Machine expects the data to be labeled for Training.

What are the two layers of a Restricted Boltzmann Machine called?

RELU stands for

What does LSTM stand for?

A Deep Belief Network is a stack of Restricted Boltzmann Machines.

Você também pode gostar