Escolar Documentos
Profissional Documentos
Cultura Documentos
An assignment ON :
BIVARIATE DATA ,SCATTER DIAGRAM, SIMPLE
LINEAR CORRELATION, MEASURE AND
PROPERTIES
SUBMITTED BY
SUBMITTED TO
DEBIPRASAD KANTAL
FS-10/14
B.F.SC. 2ND YR. 2ND SEM
DR. S. SAHU
DEPT. OF FES
If the scatter f points indicates that a curve car better fit the data, then the relationship
between the variables is said to be non-linear or curvilinear. Some curvilinear
relationships are shown in Fig.3 and Fig.4
Directions
1. Gather the data; determine the high and low values for each factor.
2. Decide which factor will be plotted on which axis.
a. Theorizing a cause and effect relationship, put the suspected cause on the
horizontal
axis, and the suspected effect on the vertical axis.
3. Draw and label the axes clearly.
a. Make the axes roughly the same length, creating a square plotting area.
b. Label each axis with increasing values from left to right, and from bottom to
top.
c. Label each axis to match the full range of values for that factor. In other
words, make the lowest numerical label slightly less than the lowest data
value, and the highest label slightly greater than the highest value. The data
should fill the whole plotting area.
4. Plot the paired data.
a. Use concentric circles (or offset dots) to indicate identical paired-data
points.
b. Differentiate distinct strata by using filled vs. unfilled symbols, or different
colors.
5. Title the chart and provide necessary annotations to describe what it shows.
6. Identify and classify the pattern of correlation shown by the plotting of the data.
7. Identify what you have learned; decide on your next steps.
Introduction to Correlation Analysis
The statistical methods discussed so far are primarily intended to describe a single
variable i.e., univariate populations. In this chapter the techniques that are useful in
studying the relationships that exist when the data on two or more variables is
available, are discussed.
If on the same individual, data on two variables say X and Y are listed, it is called a
bivariate population. In this bivariate population, for every value of X, there is a
corresponding value of Y. By treating these variables X and Y separately, measures
of central tendency, dispersion etc., can be worked out. In addition to these
measures it may be of interest to study the strength of relationship existing between
the variables and the nature of their relationship. The study of the former aspect is
referred to as correlation and the latter as regression analysis.
Correlation Classifications
Correlation can be classified into three basic categories:
Linear- Variables that are correlated through a linear relationship can display either
positive
or negative correlation
Non-linear- Two variables may be correlated but not through a linear model. This
type of model is called non-linear. The model might be one of a curve.
No correlation- Two quantitative variables may not be correlated at all
2:non-linear correlation
1:linear correlation
3: no correlation
Definition of simple linear correlation
Simple linear correlation is a measure of the degree to which two variables vary
together, or a measure of the intensity of the association between two variables.
The parameter being measure is (rho) and is estimated by the statistic r, the
correlation coefficient.
r can range from -1 to 1, and is independent of units of measurement.
The strength of the association increases as r approaches the absolute value of 1.0
A value of 0 indicates there is no association between the two variables tested
Strength of Correlation
In the above expression, X and Y denote the measurements on variables X and Y and n is
the number of pairs of observations i.e. the sample size.
Examples:2
Length and weight of juveniles of fish, Income and expenditure are some of the
examples for positive correlation
Rate of infection and yield, demand and supply are the some of the examples for
negative correlation
Growth & demand for fish, size of the shoe and number of intelligent boys / girls are
some of the example for no correlation.
When r = +1 there exists a strict linear relationship and the correlation between the variables
is said to be perfectly positive.
When r = -1 the relationship is linear and correlation between the variables is perfectly
negative.
The correlation coefficient equal to one (either positive or negative) indicates perfect
6
correlation between the variables. Perfect correlation rarely occurs in biological data though
values as high as 0.99 have been obtained in some cases. The closer the value of the
coefficient to one, the greater is the intensity or the degree of association between the
variables. Values of r near zero may arise when there is no relationship or when there is a
real relationship but it is not linear.
Properties of the correlation coefficient
The variables are said to be positively correlated if r is positive and negatively correlated if
r is negative. Positive correlation indicates that two variables are moving in the same
direction, i.e., as one increases the other increases or if one decreases the other decreases.
Negative correlation indicates that the two variables are moving in opposite direction i.e., as
one increases the other decreases
EXERCISE
1. What is data ?
2. What is bivariate data?
3. define univariate data ?
4. What are the types of variable ?
5. what is independent variable and dependent variable ?
6. What is a scattered diagram?
7. what are the basic categories of correlation ?
8. define linear correlation?
9. define non-linear correlation?
10. When does no correlation occur?
11. how to measure simple linear correlation?
12. What is the range of r for moderate correlation?
13. What do you mean by strength of correlation?
14. Classify correlation ?
15. What are the properties of the correlation coefficient ?
16. Give an example for positive correlation ?
17. Give an example for no correlation ?
18. Give an example for negative correlation ?
19. What are the type of strength of correlation ?
20. what is the range of r for strong correlation?
1. When the data is distributed quite close to the line the correlation is said to be ------------------(ans-strong)
2. correlation coefficient is denoted by--------------(ans-r)
3. The strength of a linear relationship is measured by---------(ans-correlation
coefficient)
4. Range of correlation coefficient is -------------------(-1 r 1.)
5. ---------------------is the range for moderate linear association .(0.65 < | r | < 0.90)
6. When one measurement is made on each observation-------------- analysis is applied.(
univariate)
7. ----------------------------variable is a condition or piece of data in an experiment that can
be controlled or changed.( independent)
8. Two quantitative variables may not be correlated at all is called--------------- (no
correlation)
Column-II
A
-1 to 1
B strong correlation
C a strict linear relationship
D r
E weak linear association
F
univariate analysis