Você está na página 1de 33

Scatter Plots

Objective: Determine the correlation of


a scatter plot
| Jul 2012| © 2012 UPES
Jul 2012 © 2012 UPES
Scatter Plot

▪ A scatter plot is a graph of a collection of ordered


pairs (x,y).

▪ The graph looks like a bunch of dots, but some


of the graphs are a general shape or move in a
general direction.

Jul 2012 © 2012 UPES


Positive Correlation

▪ If the x-coordinates and the


y-coordinates both increase,
then it is POSITIVE
CORRELATION.
▪ This means that both are
going up, and they are
related.

Jul 2012 © 2012 UPES


Positive Correlation

If you look at the age of a child and the child’s


find that as the child gets older, the child gets
taller. Because both are going up, it is positive
correlation.

Age 1 2 3 4 5 6 7 8
Height 25 31 34 36 40 41 47 55

Jul 2012 © 2012 UPES


Negative Correlation

▪ If the x-coordinates and the


y-coordinates have one
increasing and one
decreasing, then it is
NEGATIVE CORRELATION.
▪ This means that 1 is going up
and 1 is going down, making a
downhill graph. This means the
two are related as opposites.

Jul 2012 © 2012 UPES


Negative Correlation

▪ If you look at the age of your family’s car and its


value, you will find as the car gets older, the car
is worth less. This is negative correlation.

Age 1 2 3 4 5
of car

Valu $30,000 $27,00 $23,50 $18,70 $15,35


e 0 0 0 0
Jul 2012 © 2012 UPES
No Correlation

▪ If there seems to be no
pattern, and the points
looked scattered, then
it is no correlation.

▪ This means the two


are not related.

Jul 2012 © 2012 UPES


No Correlation

▪If you look at the size shoe


a baseball player wears,
and their batting average,
you will find that the shoe
size does not make the
player better or worse,
then are not related.

Jul 2012 © 2012 UPES


Plot the data on the graph such that homework time
is on the y-axis and TV time is on the x-axis.

TV Homework 240
30 min. 180 min. 210
45 min. 150 min. 180

Homework
Time on 150
120 min. 90 min.
120
240 min. 30 min.
90
90 min. 120 min. 60
150 min. 120 min. 30
180 min. 90 min.
30 90 150 210
60 120 180 240
Time Watching TV
Jul 2012 © 2012 UPES
Describe the relationship between time spent on
homework and time spent watching TV.
Trend appears linear. 240
210
Trend is decreasing.
180

Homework
Time on 150
120
90
60
30

30 90 150 210
Negative correlation. 60 120 180 240
Time Watching TV
Jul 2012 © 2012 UPES
Scatter Diagram Method

Scatter Diagram is a graph of


observed plotted points where each
points represents the values of X & Y
as a coordinate. It portrays the
relationship between these two
variables graphically.

Jul
Jul2012
2012 © 2012 UPES
A perfect positive
correlation
Weight
Weigh
t
of B
Weigh A linear
relationship
t
of A

Height
Heigh Heigh
t t
of A of B
Jul
Jul2012
2012 © 2012 UPES
Karl Pearson's Coefficient
of
Correlation
Pearson’s ‘r’ is the most common correlation
coefficient.
Karl Pearson’s Coefficient of Correlation denoted by- ‘r’

The coefficient of correlation ‘r’ measure then degree of linear relationship


between two variables say x & y.

Jul
Jul2012
2012 © 2012 UPES
-1 ≤ r ≥ +1

Degree of Correlation is expressed by


value of Coefficient
Direction of change is Indicated by sign
( - ve) or ( + ve)

Jul
Jul2012
2012 © 2012 UPES
When deviation taken from actual mean:
r(x, y)= Σxy /√ Σx² Σy²

When deviation taken from an assumed


mean:
r= N Σdxdy - Σdx Σdy

√N Σdx²-(Σdx)² √N Σdy²-(Σdy)²

Jul
Jul2012
2012 © 2012 UPES
Procedure for computing the
correlation coefficient
▪ Calculate the mean of the two series ‘x’ &’y’
▪ Calculate the deviations ‘x’ &’y’ in two series from their
respective mean.
▪ Square each deviation of ‘x’ &’y’ then obtain the sum
of the squared deviation i.e.∑x2 & .∑y2
▪ Multiply each deviation under x with each deviation
under y & obtain the product of ‘xy’.Then obtain the
sum of the product of x , y i.e. ∑xy
▪ Substitute the value in the formula.

Jul
Jul2012
2012 © 2012 UPES
Interpretation of Correlation
Coefficient (r)
▪ The value of correlation coefficient ‘r’ ranges from -1 to +1
▪ If r = +1, then the correlation between the two variables is said to be
perfect and positive
▪ If r = -1, then the correlation between the two variables is said to be
perfect and negative
▪ If r = 0, then there exists no correlation between the variables

Jul
Jul2012
2012 © 2012 UPES
High Degree of positive
correlation
▪ Positive relationship

r = +.80

Weight

Heigh
Jul
Jul2012
2012
t © 2012 UPES
Degree of correlation

▪ Moderate Positive Correlation

r=+
Sho 0.4
e
Size

Weigh
Jul
Jul2012
2012
t © 2012 UPES
Degree of correlation

▪ Perfect Negative Correlation

r = -1.0
TV
watching
per
week

Exam
Jul
Jul2012
2012
score © 2012 UPES
Degree of correlation

▪ Moderate Negative Correlation

r = -.80
TV
watching
per
week

Exam
Jul
Jul2012
2012
score © 2012 UPES
Degree of correlation

▪ Weak negative Correlation

Shoe
Size r=-
0.2

Weigh
Jul
Jul2012
2012
t © 2012 UPES
Degree of correlation
▪ No Correlation (horizontal line)

r=
IQ 0.0

Heigh
Jul
Jul2012
2012
t © 2012 UPES
Degree of correlation (r)
r = +.80 r = +.60

r = +.40 r = +.20

Jul
Jul2012
2012 © 2012 UPES
2) Direction of the Relationship
▪ Positive relationship – Variables change in the same direction.
– As X is increasing, Y is increasing
– As X is decreasing, Y is decreasing Indicated by
► E.g., As height increases, so doessign; (+) or (-).
weight.
▪ Negative relationship – Variables change in opposite directions.
– As X is increasing, Y is decreasing
– As X is decreasing, Y is increasing
► E.g., As TV time increases, grades decrease

Jul
Jul2012
2012 © 2012 UPES
Advantages of Scatter Diagram

▪Simple & Non Mathematical method


▪Not influenced by the size of
extreme item
▪First step in investing the
relationship between two variables

Jul
Jul2012
2012 © 2012 UPES
Disadvantage of scatter diagram

Can not adopt the an exact degree


of correlation

Jul 2012 © 2012 UPES


The Least Squares (Regression)
Line

A good line is one that minimizes


the sum of squared differences
between the points and the line.

Jul
Jul2012
2012 © 2012 UPES
The Simple Linear Regression Line

Example
► A car dealer wants to find
the relationship between
the odometer reading and
the selling price of used
cars.
► A random sample of 100
cars is selected, and the
data recorded in a file
called odometer Independent Dependent
variable x variable y
► Find the regression line.
Jul
Jul2012
2012 © 2012 UPES
Q3 Scatter Diagrams
The table shows The ages and arm spans of
seven students
Agein(years)
a school. 16 13 13 10 18 10 15 The table shows the ages and second-hand
values
Age of car (years) 2 1 4 7 10 9 8
Arm Span (inches) 62 57 59 57 64 55 61
of seven cars.
Value of car (£) 4200 4700 2800 1900 400 1100 2100

a. Draw a scatter graph of the results a. Draw a scatter graph of the results
b. Describe the type and strength of b. Describe the type and strength of
correlation correlation
c. Write a sentence
Jul 2012
explaining the c. Write a sentence explaining the © 2012 UPES
relationship relationship
Worksheet
Scatter Diagrams
Q5 The table shows the daily rainfall and the number
Of sunbeds sold at a resort on the south coast

Amount of rainfall (mm) 0 1 2 5 6 9 11


Number of sunbeds sold 380 320 340 210 220 110 60

a. Draw a scatter graph of the results


b. Describe the type and strength of
correlation
c. Write a sentence
Jul 2012
explaining the © 2012 UPES
relationship
p39

Q3 Scatter Diagrams
Q4
6 X
8 X
4000
6
6 X

6 X 3000

Value of car
X X
4
Arm Span

6 X X
2000
(inches)

2 X

(£)
X X
6
0 X X
1000
5 X
8

5 1 2 3 4 5 6 7 8 9 10 11
6 Age of Car (years)
5 10 11 12 13 14 15 16 17
4 18 Age
(Years)
5
2

5
0

Jul 2012 © 2012 UPES


Scatter Diagrams
Q8 1200
1100 X
Q5 40
1000
0 X

35 X
X 900
0
Number of sun beds

30

Calorie
800 X
0 X
X

s
25
0 700
sold

20 X
0 X
X 600 X
15 X
0 X
1 2 3 4 5 6 7 8 9 10 500 X
10 11 Amount of rainfall X
0 (mm)
400

50
300 10 20 30 40
50 Fat
0 (g)
200

Jul 2012 100 © 2012 UPES

Você também pode gostar