Escolar Documentos
Profissional Documentos
Cultura Documentos
ABSTRACT between the user and the item), in this approach, the authors are
Collaborative filtering is one of the most popular and effective particularly interested in the ratio or implicative relationship
techniques available today in the recommender system. However, between the user and the data item in a particular context in order
most of them use symmetric similarity measures. Therefore, the to make recommendations to the user more effective. Another study
default effect and the role of the pair of users are the same, but in in the application of statistical implication analysis to the
practice this may not be true. In addition, they only logically recommender system was the user-based collaborative filtering
demonstrate the existence of a priority relationship between two recommender system using association rules combined implication
users rather than the level of the relationship in practice. In this cohesion measure [16] to calculate the similarity for each pair of
paper, we propose a new approach for the collaborative filtering users in collaborative filtering. Recently, in [5], we was proposed
based on the variation analysis of the implication index. An recommendation based on the variance of implication index in
asymmetric measure is developed which can be used to rank or implication field to user for solving these issues.
filter information based on the variation of the implication index by In this paper we also use statistical implication analysis to propose
a counter-example. This measure provides a meaningful a new approach to collaborative filtering based on threshold value
recommendation with a certain level of implication. Experimental of the equivalence plane in the implication field [8] to continue to
results shown that the proposed approach can overcome the solve the problems of asymmetric user influence and the
drawbacks in the traditional recommender systems. implication relationship between the users in the recommender
systems.
Keywords
The paper is organized in five parts, the first one introduces the
Implication index; implication field; collaborative filtering; context and issues to be solved by the present system as well as
implication threshold; equipotential plane. proposing our proposed approached, and the second part presents
the related contents. To the statistical implication analysis and the
1. INTRODUCTION extended studies in the implication field, the third part presents the
Because of the rapid increase of data in era of information model of the recommender system based on the variance of the
explosion today, recommender systems [1][2] become a tool that is implied index in the implication field, the next part is the
extremely necessary and widely used more in electronic trade and experimental section model with scenarios and finally conclusions.
services such as Amazon, Pandora, Netflix, etc. The objective of
the recommender systems is to filter useful information from a 2. IMPLICATION STATISTICAL FIELD
large amount of information so that it is predictable, user will use
them to rate for an item and thereby recommendation items 2.1 Implication statistical analysis
(products, services, etc.) suitable for the user. Algorithms for the Statistical implication analysis (SIA) theory [11] [13] [14],
recommender system have attracted the attention of the researchers proposed by Regis Gras, studies the implication relationship of data
for practical application. Among them, the collaborative filter variables. Measures in the analysis implicative statistical us
algorithms [17] are the most widely used. Most of these algorithms implication index (aka Gras implication index) and implication
are based on the measure of symmetry for filtering information and intensity, are used to detect the rule or R-rule (rule of the rule)
recommendations for users. Recently, several solutions have been strong implicative relationship between the two sides of the rule, or
proposed that use asymmetric similarity to the recommendation to measure the correlation between two variables (individual,
system, such as asymmetric similarity for collaborative filtering via attribute ...), these measures are asymmetric. In addition, statistical
matrix factorization [3][4][6]. Recommendation with asymmetric implication analysis focuses on counter example factor analysis. It
user influence and global importance value [18] to address the can be presented as follows:
asymmetric effects of users in the recommendation system. Let 𝐸 be a finite set of binary variables, A and B are two subsets of
Another new trend is the use of statistical implication analysis in 𝐸 , respectively, which contain the elements 𝑎 ∈ 𝐴 such that
the recommendation system, which addresses the problem of 𝐴(𝑎) = 𝑡𝑟𝑢𝑒 and 𝑏 ∈ 𝐵, such that 𝐵(𝑏) = 𝑡𝑟𝑢𝑒, sets 𝐴̅ , 𝐵̅ is the
asymmetric user influence and solves the problem of assessing the complement of sets 𝐴 and 𝐵 respectively, let 𝑛𝑎 = 𝑐𝑎𝑟𝑑(𝐴), 𝑛𝑏 =
occurrence or functional relationship. Interaction between users 𝑐𝑎𝑟𝑑(𝐵) is the cardinality of 𝐴 and 𝐵 respectively, 𝑛𝑎̅ =
and data items in practice, such as the recommender system model 𝑐𝑎𝑟𝑑( 𝐴̅), 𝑛𝑏̅ = 𝑐𝑎𝑟𝑑( 𝐵̅) is the cardinality of the set 𝐴̅ and the set
based on approach to association rules combined implicative 𝐵̅ and 𝑛𝑎𝑏̅ = 𝑐𝑎𝑟𝑑(𝐴 ∩ 𝐵̅) is the cardinality of the set 𝐴 ∩ 𝐵̅, that
measure [15], to overcome the disadvantage of traditional is a set containing the elements that satisfy the properties 𝑎 = 𝑡𝑟𝑢𝑒
recommender systems (They only focus on the logic that and 𝑏 = 𝑓𝑎𝑙𝑠𝑒 , 𝑛𝑎𝑏̅ also called counter-example, and also
demonstrates the existence or absence of a priority relationship randomly and independently selects subsets of X and Y same
cardinality with 𝐴, 𝐵 respectively, meaning 𝑐𝑎𝑟𝑑(𝑋) = 𝑛𝑎 and Now, to further examine the relationship between the implication
𝑐𝑎𝑟𝑑(𝑌) = 𝑛𝑏 . Let 𝑋̅ and 𝑌̅ respectively be the complement of 𝑋 index and implication intensity. Take the primitive of the equation
and 𝑌 in 𝐸 and have corresponding cadinality as 𝑛𝑎̅ = 𝑛 − 𝑛𝑎 (1), we have:
𝑛𝑏̅ = 𝑛 − 𝑛𝑏 . dφ 1 −q2
=- e 2 <0 (5)
The implication relationship between 𝐴 and 𝐵 is modeled in the dq √2π
statistical implication analysis as follows (see Figure 1). This confirms that the implication intensity increases as 𝑞
decreases, but the rate of increase is determined by formula (5),
which allows for a more rigorous study of the variability of 𝜑.
2.3 Implication Field
2.3.1 Implication statistical field
Figure 1. The illustration of the components of statistical analysis Consider the implication index 𝑞(𝑎, 𝑏̅) in the four-dimensional
implicated by Venn diagrams space 𝐸, with the point 𝑀 whose coordinates are the parameters
associated with the binary variables a and b are (𝑛, 𝑛𝑎 , 𝑛𝑏 , 𝑛𝑎𝑏̅ ),
Implication intensity measure 𝜑(𝑎, 𝑏) of rule 𝐴 → 𝐵 is defined by
then 𝑞(𝑎, 𝑏̅) is a scalar field by applying the mapping from space
[11][13]:
𝑅 4 to space 𝑅. For the vector 𝑔𝑟𝑎𝑑. 𝑞 contains the partial
𝜑(𝑎, 𝑏) = 1 − Pr( 𝑄(𝑎, 𝑏̅) ≤ 𝑞(𝑎, 𝑏̅)) derivatives of 𝑞 for the variables 𝑛, 𝑛𝑎 , 𝑛𝑏 , 𝑛𝑎𝑏̅ is a special gradient
𝑛𝑎𝑏
̅ ∞ field is called implication field, because it satisfies the Schwartz
𝜆𝑠 −𝜆 1 𝑡2
(1)
1−∑ 𝑒 = ∫ 𝑒 − 2 𝑑𝑡 , với 𝑛𝑦 < 𝑛 criteria for the mixed differential, that is, The mixed derivative
= 𝑠! √2𝜋
𝑠=0 ̅) 𝑞(𝑎,𝑏 event of each pair of variables [8], is:
{ 0, other wise δ δq δ δq
( )= ( ) (6)
𝒏𝒂 𝒏𝒃̅ δna∧b
̅ δnb δnb δna∧b̅
where 𝝀 = and 𝑞(𝑋, 𝑌̅) is implication index and is definded
𝒏
by: Similar to each other pairs in the variables (𝑛, 𝑛𝑎 , 𝑛𝑏 , 𝑛𝑎𝑏̅ ) ,
𝒏𝒂 𝒏̅
𝒃
g𝑟𝑎𝑑 𝑞 is considered to be the potential of 𝑞. Vector grad q is
𝒏𝒂𝒃
̅− 𝒏
̅) =
𝒒(𝒂, 𝒃 (2) performed to change the space of the confidentiality of(3) the case, it's
𝒏𝒂𝒏̅
𝒃
√ sort of the low value to a higher value. At each point of the gradient,
𝒏
In terms of approximation (e.g. 𝜆 ≥ 4 ), 𝑞(𝑎, 𝑏̅) is the we observe an increase in the implied density of space and to what
approximation of the normal distribution 𝑁(0,1). extent the rate at which it changes under the influence of one or
more parameters.
The implication rule that X→Y is admissible at the confidence
level 𝛼 if and only if 𝜑(𝑋, 𝑌) ≥ 1 − 𝛼. [11][13]. 2.3.2 Implication index equipotential plane
Consider the implication index as a function of four
2.2 Implication index variation variables𝑞(𝑛, 𝑛𝑎 , 𝑛𝑏 , 𝑛𝑎^𝑏̅ ). A line or plane of equipotential in field
Let consider small variations in the neighborhood of all four 𝐶 is curved in 𝐸, an ordered 4-dimensionals space along which or
observed values of variables: 𝑛, 𝑛𝑎 , 𝑛𝑏 , 𝑛𝑎𝑏̅ . These variables must at which point a variable 𝑀 maintains the same value of potential
be considered as real numbers and q as a continuously of 𝑞. The equation of this curve is shown in [8]:
differentiable function with respect to these variables constrained
𝑛𝑎 𝑛𝑏̅
to respect inequalities: 0 ≤ 𝑛𝑎 ≤ 𝑛𝑏 ; 𝑛𝑎^𝑏̅ ≤ inf{𝑛𝑎 , 𝑛𝑏 } and 𝑛𝑎𝑏̅ −
𝑞(𝑎, 𝑏̅) − 𝑛 =0 (7)
sup{𝑛𝑎 , 𝑛𝑏 } ≤ 𝑛. The differential of q in Frechet’s geometry is 𝑛𝑎 𝑛𝑏̅
expressed in the following way: √
𝑛
𝜕𝑞 𝜕𝑞 𝜕𝑞 𝜕𝑞 Next, on that curve, the scalar product of grad q and 𝑑𝑀 are 0 (in
𝑑𝑞 = 𝑑𝑛 + 𝑑𝑛 + 𝑑𝑛 + 𝑑𝑛 ̅ = 𝑔𝑟𝑎𝑑𝑞. 𝑑𝑀 (3) (4)
𝜕𝑛 𝜕𝑛𝑎 𝑎 𝜕𝑛𝑏 𝑏 𝜕𝑛𝑎^𝑏̅ 𝑎^𝑏 (3) and (7)). This is understood as the orthogonal of a gradient
tangent or a hyperplastic tangent to that curve, that is, to the
with 𝑀 the point with the coordinates (𝑛, 𝑛𝑎 , 𝑛𝑏 , 𝑛𝑎𝑏̅ ) belong to
equipotential plane.
the scalar vector field 𝐶, 𝑑𝑀 is the differential component vector of
the instance variables and grad q is the partial differential vector of To illustrate, considering the relationship from a potential F
the variables. depends only on two variables, figure 2 shows, for example, the
orthogonal orientation of the gradient to the difference of
From (3), the differential of the function 𝑞 appears as a scalar isomorphism, along the constant F but changes from F7 to F10 [8].
product between gradient q and the increase of 𝑞 on the surface
representing the variables of the function 𝑞(𝑛, 𝑛𝑎 , 𝑛𝑏 , 𝑛𝑎𝑏̅ ). 𝑔𝑟𝑎𝑑 𝑞 Grad F in M’
𝒚
denotes the variability of the function of four variables, which is
the cardinalities of the sets 𝐸, 𝐴, 𝐵, and 𝐴 ∩ 𝐵̅, which points to the Grad F in M
direction of the function 𝑞 in four dimensions space. In fact, the M’
M
value of this differential lies in the estimation of the increase F7
’
(positive or negative) of q that we note 𝛥𝑞 relative to the respective F8
variations ∆𝑛, ∆𝑛𝑎 , ∆𝑛𝑏 , and ∆𝑛𝑎𝑏̅ . So we have: F9
𝜕𝑞 𝜕𝑞 𝜕𝑞 𝜕𝑞
F10
∆𝑞 =
𝜕𝑛
∆𝑛 +
𝜕𝑛𝑎
∆𝑛𝑎 +
𝜕𝑛𝑏
∆𝑛𝑏 +
𝜕𝑛𝑎^𝑏
∆𝑛𝑎𝑏̅ + 𝑜(∆𝑞) (4) 𝒙
̅
with 𝑜(𝑞) is an infinitely small. Figure 2. The illustrating potentials relationship depends only on 2
variables.
In this case, the potential q forms the equipotential plane (shown in considered are still acceptable when the number of counter-
Figure 2 for easy representation). examples remains in the "acceptable" threshold, because in these
We can understand that this case stronger for strict plane and situations rules are still active and effective. In data analysis, the
weakening in the more sparse. To get a value q in this case, fixed 3 problem is to define a consensus standard, thereby quantifying the
variables, such as 𝑛 , 𝑛𝑎 , 𝑛𝑏 and q values compatible with the confidence threshold of the rule according to user requirements.
constraints of field. In this section, we propose a recommendation based on the
variation of the implication index depending on the variation of the
3. RECOMMENDATION BASED ON counter example in the implication field for determining the
EQUIPOTENTIAL PLANE IN equipotential plane of implication index set, from there, the item
(or k-top items list) consultant is suitable for the user with a definite
IMPLICATION FIELD implication threshold. The threshold θ is the tolerance value of q in
3.1 Implication statistical rules the same equipotential plane, θ is defined by byFactor. To
Let 𝐷 is a dataset that consists of 𝑇 transactions, each transaction determine θ, it is necessary to consider how the dependent variable
𝑇𝑖 consists of objects or items that are objects that appear in 𝑞(𝑎, 𝑏) varies when an element 𝑥 is added to (or removed from) the
transactions such as (products, services, ...). The itemset 𝐼 is a set sample data, with four occurrences.
consisting of m items. The frequency of occurrence of a data item
To more specific, the partial derivative according to 𝑛, 𝑛𝑎 , 𝑛𝑏 , 𝑛𝑎𝑏̅ :
in the database is denoted δ. The support of the set 𝑋, denoted
by 𝑠𝑢𝑝𝑝𝑜𝑟𝑡(𝑥), is the percentage of transactions that contain 𝑋 in 𝜕𝑞 1 𝑛𝑎 𝑛𝑏̅
= (𝑛𝑎⋀𝑏̅ + ) (8a)
database 𝐷.The association rule is a rule of the form 𝑋 → 𝑌, where 𝜕𝑛 2√𝑛 𝑛
𝑋, 𝑌 ⊂ 𝐼 are itemsets, 𝑋 is called premise, 𝑌 is the consequence. 𝜕𝑞 1 𝑛𝑎𝑏̅ 𝑛 2 1 𝑛𝑏̅
3
(8b)
Association rules is usually evaluated by two metric is the support =− ( ) − √ (5d)
𝜕𝑛𝑎 2 𝑛𝑏̅ 𝑛𝑎 2 𝑛𝑎
(support - S) and reliability (confidence - 𝐶). The support of the √
𝑛
rule 𝑋 → 𝑌 denoted by sup (𝑋 → 𝑌) is the ratio of transactions 1
𝛿(𝑋∪𝑌 ) 𝜕𝑞 1 𝑛𝑎 2 3 1 𝑛𝑎 1 1
including X and Y to total transactions. 𝑠𝑢𝑝(𝑋 → 𝑌 ) = = 𝑛 ̅ ( ) (𝑛 − 𝑛𝑏 )2 + ( )2 (𝑛 − 𝑛𝑏 )2 (8c)
|𝑇| 𝜕𝑛𝑏 2 𝑎𝑏 𝑛 2 𝑛
The confidence of the rule 𝑋 → 𝑌, symbol: conf (𝑋 → 𝑌) is the
𝜕𝑞 1 1
probability that a transaction contains 𝑋 will contain 𝑌 defined by: = =
𝛿(𝑋∪𝑌 ) 𝑠𝑢𝑝(𝑋→𝑌 ) 𝜕𝑛𝑎^𝑏̅ 𝑛 𝑛̅ 𝑛 𝑛 − 𝑛𝑏 ) (8d)
𝑐𝑜𝑛𝑓(𝑋 → 𝑌) = = = 𝑃(𝑌|𝑋). √ 𝑎 𝑏 √ 𝑎(
𝛿(𝑋) 𝑠𝑢𝑝(𝑥) 𝑛 𝑛
The rule in statistical implication analysis for recommnender From (8d), if 𝑛𝑎𝑏̅ increases, the implication index increased, and
system is as follows: Assume I is the set of m items, 𝐴 ⊂ 𝐼 that is thus the intensity implies decreased.
the set of items rated by the user 𝑢𝑎 ; 𝐴̅ is the complement of A. The Let 𝜆1 , 𝜆2 and 𝑞1 , 𝑞2 , corresponding to 𝜆, 𝑞 are related to the
set 𝐵 ⊂ 𝐼 is the set of items rated by the user 𝑢𝑏 ; 𝐵̅ is the original data sample and the extended data sample as Table 1A
complement of B; 𝑛𝑎 = 𝑐𝑎𝑟𝑑(𝐴) is the number of data items (Value ± 1 corresponds to 1 in the additional case and -1 when
rated by the user 𝑢𝑎 , which is the number of elements of the set A); removing the 𝑥 in the dataset) and variability ∆𝑞 = 𝑞2 − 𝑞1 as
𝑛𝑏 = 𝑐𝑎𝑟𝑑(𝐵) is the number of data items rated by user 𝑢𝑏 Table 1B
(number of elements of set B); 𝑛𝑎𝑏̅ = 𝑐𝑎𝑟𝑑(𝐴 ∩ 𝐵̅) is the number
Table 1A. Table of variability of 𝑛𝑎 , 𝑛𝑏 , 𝑛𝑎𝑏̅ , 𝜆
of data items rated by the user 𝑢𝑎 but not rated by the user 𝑢𝑏 . In
a b ∆𝑛𝑎 ∆𝑛𝑏̅ ∆𝑛𝑎𝑏̅ 𝜆1 𝜆2 ∆𝜆
addition to the usual measures, in the above paragraph, the specific
measure of the implicative rules is implication index, this measure (𝑖) 0 0 0 ±1 0 𝑛𝑎 (𝑛𝑏̅ ± 1) >0
expressed the degree to implication which association rules are not, 𝑛±1
an implication rule expressed by a set of four variables (𝑖𝑖) 0 ±1 0 0 0 𝑛𝑎 𝑛𝑏̅ <0
(𝑛, 𝑛𝑎 , 𝑛𝑏 , 𝑛𝑎𝑏̅ ). They are called the cardinalities of the implication 𝑛𝑎 𝑛𝑏̅ 𝑛±1
rules. In other words, the relationship between the user 𝑢𝑎 and 𝑢𝑏 𝑛 (𝑛𝑎 ± 1)(𝑛𝑏̅ ± 1) >0
(𝑖𝑖𝑖) ±1 0 ±1 ±1 ±1
is the relationship between the item set 𝐴 is liked by the user 𝑢𝑎
𝑛±1
and the item set 𝐵 is liked by the user 𝑢𝑏 represented by the set of
(𝑖𝑣) ±1 ±1 ±1 0 0 (𝑛𝑎 ± 1)𝑛𝑏̅ >0
four elements (𝑛, 𝑛𝑎 , 𝑛𝑏 , 𝑛𝑎𝑏̅ ).
𝑛±1
3.2 Threshold implication variability between Table 1B. Table of variability of 𝑞
equipotential planes ∆𝑞
As discussed in the previous section, statistical implication analysis
(𝑖) 𝑛𝑎 (𝑛𝑏̅ + 1) 𝑛𝑎 𝑛𝑏̅ (a)
𝑛𝑎𝑏̅ −
focuses on counter example factor to analysis. 𝑛 + 1 − 𝑛𝑎𝑏̅ − 𝑛
𝑛 𝑛̅
It is difficult to replace an original rule by another rule when a few √𝑛𝑎 (𝑛𝑏̅ + 1) √ 𝑎 𝑏
counter examples (unlikelihood) appear, only when the counter 𝑛+1 𝑛
example higher the confidence of the rule decreases and the rule (𝑖𝑖) 𝑛𝑎 𝑛𝑏̅ 𝑛𝑎 𝑛𝑏̅ (b)
𝑛𝑎𝑏̅ −
can be denied. However, when the number of example (likelihood) 𝑛 + 1 − 𝑛𝑎𝑏̅ − 𝑛
is numerous and the number of counter examples is rarer, the rule 𝑛 𝑛̅ 𝑛 𝑛̅
√ 𝑎 𝑏 √ 𝑎 𝑏
𝑛+1 𝑛
becomes stronger and is recognized. For example, let's look at the
rules that are acceptable. “Ferrari cars are red." Even if one or two (𝑖𝑖𝑖)
(𝑛𝑎𝑏̅ + 1) −
(𝑛𝑎 + 1)(𝑛𝑏̅ + 1) 𝑛 𝑛̅
𝑛𝑎𝑏̅ − 𝑎 𝑏
(c)
of the counter examples appear (Ferrari cars are not red), this rule 𝑛+1 − 𝑛
𝑛 𝑛̅
is maintained, and it will be even confirmed once again by the √(𝑛𝑎 + 1)(𝑛𝑏̅ + 1) √ 𝑎 𝑏
𝑛+1 𝑛
release of new examples. Thus, contrary to mathematics, where
rules are not allowed to have any exceptions, the rules here
(𝑖𝑣) (𝑛𝑎 + 1)𝑛𝑏̅ 𝑛𝑎 𝑛𝑏̅ (d) Step 1: generate rules set from set of transactions by using data
𝑛𝑎𝑏̅ −
𝑛 + 1 − 𝑛𝑎𝑏̅ − 𝑛 mining algorithms (such as apriori).
𝑛 𝑛̅
√(𝑛𝑎 + 1)𝑛𝑏̅ √ 𝑎 𝑏 Step 2: Calculating cardinalities of implication rules, Details are as
𝑛+1 𝑛
follows: Count number of transactions n. Generating two binary
To determine the variation threshold of implication index θ on the (True/False) matrixes 𝑙ℎ𝑠𝑅𝑢𝑙𝑒𝑠, 𝑟ℎ𝑠𝑅𝑢𝑙𝑒𝑠, with true value if item
𝜕𝑞 Δ𝑞
equipotential planes, let
𝜕𝜉
and Δ𝜉 respectively partial derivatives j belong to left hand side for 𝑙ℎ𝑠𝑅𝑢𝑙𝑒𝑠 (respectively, right hand
side for 𝑟ℎ𝑠𝑅𝑢𝑙𝑒𝑠). Then, for each rule[𝑖]:
and increment of q according to ξ, where 𝜉 ∈ {𝑛, 𝑎, 𝑏, 𝑎𝑏̅}. A
𝑙ℎ𝑠𝑃𝑟𝑜𝑑𝑢𝑐𝑡 = 𝑙ℎ𝑠𝑅𝑢𝑙𝑒𝑠 × (𝑑𝑎𝑡𝑎)𝑇
1
variation of 𝑞 from the addition (or eliminate) of an individual on
the dataset can change the number of k implication rules based on 𝑛𝑎[𝑖] = 𝑟𝑜𝑤𝑆𝑢𝑚(𝑙ℎ𝑠𝑃𝑟𝑜𝑑𝑢𝑐𝑡[𝑖])
Δ𝑞
the dataset, this leads to an increase in𝜃 = 𝑘 , it mean:
Δ𝜉 𝑟ℎ𝑠𝑃𝑟𝑜𝑑𝑢𝑐𝑡 = 𝑟ℎ𝑠𝑅𝑢𝑙𝑒𝑠 × (𝑑𝑎𝑡𝑎)𝑇
𝜕𝑞 Δ𝑞 (9) 𝑛𝑏[𝑖] = 𝑟𝑜𝑤𝑆𝑢𝑚(𝑟ℎ𝑠𝑃𝑟𝑜𝑑𝑢𝑐𝑡[𝑖])
=𝑘 + 𝑜(𝑞)
𝜕𝜉 Δ𝜉 𝑛𝑎𝑏 [𝑖]: The calculation is the same as 𝑛𝑎[𝑖], 𝑛𝑏[𝑖]
𝜕𝑞 Δ𝑞 but on both the left and right sides.
where 𝑜(𝑞) is an infinitely small. , are definded with
𝜕𝜉 Δ𝜉 𝑛𝑎𝑏̅ [𝑖] = 𝑛𝑎 [𝑖] − 𝑛𝑎𝑏 [𝑖]
formulas from (8a) to (8d) and (a) to (d) of table 1B.Threshold θ is
Δ𝑞 Step 3: Return (𝑛[𝑖], 𝑛𝑎 [𝑖], 𝑛𝑏 [𝑖], 𝑛𝑎𝑏̅ [𝑖])
defined as 𝑘 from (9).
Δ𝜉 Algorithm 2. RBEP (Recommendation by Equipotential Plane)
3.3 Recommendation based on variation Input: dataset, threshold θ, variant factor byFactor
implication index with threshold value of Output: recommendation: item/ top k item list
equipotential plane Step 1: call IRG(dataset) for generating rules set and calculating
To provide a formal definition of the recommendation task, it is 𝑛, 𝑛𝑎 , 𝑛𝑏 , 𝑛𝑎𝑏̅ .
necessary to introduce some concepts of the consulting system.
Accordingly, the set of users in the system will be denoted by U, Step 2: With each 𝑟𝑢𝑙𝑒(𝑖) , calculate implication index 𝑞(𝑖)
and the set of items is I. In addition, the set of ratings in the system according to formula (2). After that, calculate partial derivatives of
is denoted by R, and the set of possible scales for a rating is 𝒮 (eg, 𝑞(𝑖) follow byFactor according to formula (5).
𝒮 = [1, 5] or 𝒮 = {𝑙𝑖𝑘𝑒, 𝑑𝑖𝑠𝑙𝑖𝑘𝑒}). Also, assume that no more than Step 3: Determine the set of recSet containing q on the same
one rating can be performed by any user 𝑢 ∈ 𝑈 for a particular item equipotential plane follows byFactor: (|∆𝑞(𝑎, 𝑏̅)| ≤ 𝜃), is defined
𝑖 ∈ Ι and written to 𝑟𝑢𝑖 ∈ 𝑅 for this rating. To determine the by equation (10).
subsets of users 𝑢 have rated an item 𝑖, the symbols 𝑈𝑖 are used.
Step 4: return recommended: item or k items from recSet set.
Likewise, Ι𝑢 represents the subset of the line items evaluated by a
user. Finally, the items were rated by two people 𝑢 and 𝑣, that The algorithms described above serves as the basis for a
is, Ι𝑢 ⋂Ι𝑣 , which is an important concept in the presentation of the recommendation model for a statistical-based recommendation
article, and Ι𝑢𝑣 is used to denote this concept, ie Ι𝑢𝑣 = Ι𝑢 ⋂Ι𝑣 . In a system RSIF (Recommender System based on Implication Field)
similar representation, 𝑈𝑖𝑗 is used to denote the set of users that as figure 3:
have rated both items 𝑖 and 𝑗, i.e. 𝑈𝑖𝑗 = U𝑖 ⋂U𝑗 Dataset Implication
field
One of the most important issues related to the recommender
system is the best recommendation and recommended items list n Algorithms Recom-mendation
Data mining ISA (IRG,
best items for the user. This issue included in the search, for a RBEP.etc.) item
particular user u, the new item 𝑖 ∈ Ι\Ι𝑢 which user u most likely algorithms
Knowledge
have interest in. When ratings are available, this task is often
defined as a regression or classification problem (multilayer) whose
purpose is to understand a function: Association
Rule set RSIF
𝑓: 𝑈 × 𝐼 → 𝑆 k-Recommen-
Model dation list (topk-
In which predicts the rating of 𝑓(𝑢, 𝑖) of a user 𝑢 for a new item𝑖. list)
This function is used to recommendation the active user 𝑢𝑎 to an Implication rules
set
item 𝑖 ∗ that rating the highest estimated value [1][2].
𝑖 ∗ = 𝑎𝑟𝑔 max 𝑓(𝑢𝑎 , 𝑗) (10) Figure 3. RSISF Recommender system model
𝑗∈𝐼\𝐼𝑢