Escolar Documentos
Profissional Documentos
Cultura Documentos
1. Basic idea
3. Rank-Aggregation Methods
5. Rating Aggregation
The dictum ,,the whole is greater than the sum of its parts
Merge several ranked lists into in order build a single new superior list
l1 l2 l3 lk
...
Aggregated list
Arrows theorem:
No voting system can ever simultaneously satisfy the four criteria of an
unrestricted domain,independence of irrelevant alternatives,the Pareto
priciple ,and non- dictatorship.
A rank aggregated list is only as good (or as bad ) as the lists from which it is
built.
Mathematical ideas (Llull count ) of Ramon Llull (1232-1315) and date back to 1770 by
Jean-Charles de Borda .
For each ranked list ,each candidate receives a score that is equal to the number of
candidates he or she outranks.
Massey Colley
OD
Example : 1st 1st 1st
2nd 2nd 2nd
3rd 3rd 3rd
4th 4th 4th
5th 5th 5th
Handle ties
Example:
ranked list from best to worst as {Miami ,VT,UNC/UVA,Duke} (the slash / indicates a tie)
Duke
Borda scores Miami
UNC
UVA
VT
The integers representing rank in several ordered lists are averaged to create
a rank aggregated list.
Example:
Example :
let l1 and l2 be two full ranked lists .
Team i is ranked 1st in l1 ,but 3rd in l2.
Team j is ranked 2nd in both lists.
=>Average rank produces a tie for second place.
1. Using of past data with pair-wise matchups(i and j played each other,then
the winner should be ranked ahead of the losing team).But a tie-breaking
team is more difficult when averaging more than two lists.
Example : i defeated j ,j defeated k ,k defeated i =>circular tie.
2. Tie breaking list (the superior).We have a method for determining which
ranke list is better (chapter 16).
Average rank can be applied only if all lists are full.
Produce an aggregated list that does not contain ranks .(An average rating
vector ->a ranking vector).
1st 1st
2nd 2nd
. A is stronger . A is much
. than B . stronger than B
. .
last last
Each ranked list of length n provides data for =n(n-1)/2 simulated games.
A margin of victory are related to the difference in the ranked position of the
two teams.
Combiner Method
Aggregated List
Combiner Method
Simulated data method of rank aggregation using OD,Massey ,and Colley as the combiner method
1.The ranked aggregated list is only as good (or as bad ) as the input lists.
2. The combiner method acts as a ,,smoother in that it minimizes the effect of outliers
,which are lists containing anomalies that seem inconsistent with the rankings in other
lists.
Example:
We removed Duke from data set
-Massey and Colley methods are used as the combiner methods
=>swap in the rankings between UNC and UVA.
-OD method is the combiner method =>create a consistent ranking.
the OD rank-aggregation method is robust.
Combiner Method
OD(r=o/d) Massey Colley
7 6 5 1st
1 2nd
UNC 3rd
UVA
4th
5th
2
Apply the PagRank method
Two new rank aggregation methods are much more sophisticated then
two old methods.
After several lists l1 ,l2 ,l3 ,...,lk haven been aggregated into one list , a
refinement step called local Kemenization can be implementd to the further
improve the list .
The sum of Kendall tau measures between input list li and where
i=1,...,k.
Dont exist pair-wise swaps of items in the list -> is locally Kemeny
optimal.
Check #3
Question :Does the fourth place item(UVA) beat the third place item (UNC) in the majority of the input lists?
Answer :Yes,UVA beats UNC in two out of the three input lists ->Action :Swap UNC and UVA in
Check #4
Question :Does the fifth place item(Duke) beat the fourth place item (UVA) in the majority of the input lists?
Answer :No,Duke never beats UVA in the three input lists ->Action :No
1st
2nd
The locally Kemenized list is 3rd
4th
5th
.65 VT
.027 Miami
.50 UNC
-3.4 UVA
-8.0 UNC
.36 UVA .012 UVA
.006 UNC
Duke
rMassey = Miami
UNC
UVA
VT
Duke Duke
Miami Miami
rColley = UNC rOD = UNC
UVA UVA
VT VT
Rave = Miami
UNC
UVA
VT
Collapsing the information in the two dimensional matrix into a one dimensional
rating vector.
Method 1:
The row sums of Rave ->a measures of a teams offensive output
The column sums of Rave ->a measures of a teams defensive output
The offensive rating vector o= Rave e ,with e=(1,1...,1)T
The defensive rating vector d= eT Rave
The rating vector r=o/d
Method 2:
Apply Markov method
Normalize row of matrix RTave
Method 3:
Using of the dominant eigenvectors of Rave (the Perron vector of Rave )
The dominant eigenvector of an inrreducible nonnegative matrix ->nonnegative
The result
Combining human-generated lists ,or merge data from both human and
computer sources.