Escolar Documentos
Profissional Documentos
Cultura Documentos
I. INTRODUCTION
A. The game of Cricket
The above graph [1] shows the percentage of
As mentioned in [1]. Cricket is a bat-and-ball team
resources remaining for a team to the number of overs
sport that is originated in England and is one of the
bowled. As we see it in an exponential graph reducing
most popular games in the world. Moving on from the
as more number of wickets keep falling and comes
conventional Test cricket, it has slowly ventured into
down to zero when the 9th wicket falls. Duckworth-
limited over formats like ODI and T20 so that a
Lewis observed a close connection between the
definite result is obtained, making it more entertaining
availability of these resources and team’s final score,
as a spectator sport. Sometimes due constraints like
which this algorithm tries to exploit.
bad weather (rain, sandstorms and bad lights),
In above table the remaining overs are plotted
floodlight failure and crowd issue certain amount of
against wickets lost.
overs are lost and hence a definite result isn’t
obtained. To overcome these obstacles methods have
been devised to revise target scores and/or declare a
winner.
E. Formula for D/L score calculation[6] in death over cause unfair result if inning gets
Let, truncated.
S: Team 1’s score [4] We are going to these tools to extract such
R1: Resources % available to team 1 (from R. P. patterns and will try to minimise bias. This is the
table) foundation for evaluation part of the project and base
R2: Resources % available to team 2 (from R. P. input for extension part.
table) Following are some observed patterns [2]:
T: Target score for team 2 1. Pattern 1: Team winning the toss wins the
matches in 66% cases.
Case 1: 2. Pattern 2: Team batting first wins the
If R1>R2, match in 64% cases.
T=S(R1/R2)+1; 3. Pattern 3: 54% of teams winning toss
Reduces team’s score in proportion to reduction in elects to field first in the rain affected
resources matches.
4. Pattern 4: Average of difference in run
Case 2: rate between winning and losing team
If R1=R2, scores is not significant
T=S+1;
No adjustment required A. Inferences from the above patterns[2]
1. D/L method has been biased towards the
Case 3: team batting 1st.
If R1<R2; 2. D/L method has been biased towards the
T=S+[G50*(R2-R1)/100]+1; team winning the toss.
Where G50 for matches involving ICC full 3. D/L method stresses more on wickets
member nations, at present is 235 rather than run rate and the runs scored.
Increase team 2 target score by the extra runs that
are predicted in accordance with the extra resources Based on the above observations, we have a
heuristic model to predict the winner in a match
F. Project Goals decided by D/L method. In this model, we calculate a
The primary goal of this project is to evaluate the weighted score for each of the two teams using the
Duckworth Lewis system and identify its limitations following formula[2]:
from scenarios and patterns. In addition to evaluation
of system we want to propose an interesting extension Weighted Score = (overall probability of winning)
which will be capable to address these limitations. We ×
are not needed to propose an entirely new model [α (toss winner or toss loser) +
some additional resources along with the former will β (batting or fielding) +
do the job. In the try to identify the shortcomings in γ (rainy season or no rainy season)
the existing D/L we are going to use the data mining × (home or away or neutral)]
classification algorithm C4.5 using WEKA tool and
propose an extension to D/L method. We would like Historically, in matches decided by Duckworth-
to show that data mining techniques (classification Lewis[2]:
using WEKA tool) can be used as effective tools to 1. The team batting first won on 64% of the
evaluate systems such as the D/L method and also occasions involving two of the top nine teams.
devise alternative models. 2. In 66% of the occasions, the team winning the
toss has won.
II. LITRATURE SURVEY 3. In 54% of the occasions, the team winning the
toss decides to field.
As mentioned above in order to extent the present 4. In 82% of the occasions, matches were affected
system WEKA tool was used to identify patterns in by rain.
the data. WEKA tools are mainly used to deal with
different predicators that different aspects of the game Where:
like pitch degradation, net run-rate, and toss. WEKA α=0.66 …from (1)
tools are applied on the sample dataset of matches β=0.64 …from (2)
which were affected by D/L and their scores were re- γ = 0.82 * 0.54 * 0.36 from (3 and 4)
evaluated.
In such matches the interruption of game for
multiple times in same match gives unfair prediction III. MOTIVATION
in target also while batting second the fall of wickets