Escolar Documentos
Profissional Documentos
Cultura Documentos
ISSN: 1992-8645
www.jatit.org
E-ISSN: 1817-3195
ABSTRACT
Complicated text understanding technology which extracts opinion, and sentiment analysis is called
opinion mining. Building systems to collect/examine opinions about a product in blog posts, comments,
and reviews/tweets is sentiment analysis. Product reviews are the focus of existing work on review mining
and summarization. This study focuses on movie reviews, investigating opinion classification of online
movie reviews based on opinion/corpus words used regularly in reviewed documents.
Keywords: Opinion Mining, Classification Accuracy, Sentiment analysis, Movie reviews, Support Vector
Machine, Bagging
1.
INTRODUCTION
291
ISSN: 1992-8645
www.jatit.org
E-ISSN: 1817-3195
2.
RELATED WORKS
ISSN: 1992-8645
www.jatit.org
E-ISSN: 1817-3195
IDF ( a ) = log
xa
1 +
x
xa
3.3 Classifier
SVM classification has roots in structural risk
minimization (SRM) that determines classification
decision function through empirical risk
minimizing [16]
R
1
l
f (
i = 1
f w ,b = s i g n ( w x + b ) .
Optimal separating hyperplane in SVM is
determined through largest separation margin
between classes bisecting the shortest line between
two classs convex hulls. Optimal hyperplane
satisfies constrained minimization as
1 T
w w,
2
yi ( w xi + b ) 1 .
M in
( xi , yi ), i=1,...n , where xi d is a
y {+1, 1} indicates class
feature vector and i
data
i = 1
( x
f ( x ) =
i = 1
y iK ( si, x ) + b
ISSN: 1992-8645
www.jatit.org
E-ISSN: 1817-3195
( E y ) = ( E y + ( x ) )
= ( E y ) + 2 E ( y ) E ( ( x ) ) + E
= ( E y + E ( x ) )
= ( E y ) + var ience ( ( x ) )
( E y )
2
K ( x i , x j ) = ( x iT x j + r ) d , w h e re > 0
K ( xi , x j ) = ex p
xi x
2
j
), w h e re > 0
3.4 Bagging
Table 1: Classification Accuracy And RNSE For
Various Classifiers Used
Technique
used
294
Classification
Accuracy
RMSE
SVM with
Polykernel
87.00%
0.3606
SVM with
RBF Kernel
73.33%
0.5164
Bagging with
SVM
88.00%
0.2836
( x )
ISSN: 1992-8645
www.jatit.org
E-ISSN: 1817-3195
True positives
True positives + false positives
True positives
True positives + false negatives
fMeasure = 2 *
precision * recall
precision + recall
Figure 3: f Measure
5.
Technique
used
Precision
Recall
F
Measure
SVM with
Polykernel
0.87
0.87
0.87
SVM with
RBF Kernel
0.76
0.733
0.726
Bagging
with SVM
0.881
0.88
0.88
CONCLUSION
ISSN: 1992-8645
[6].
[7].
[8].
[9].
[10].
[11].
[12].
[13].
[14].
[15].
www.jatit.org
[16].
[17].
[18].
[19].
296
E-ISSN: 1817-3195