Escolar Documentos
Profissional Documentos
Cultura Documentos
Clintonvs.Trump2016:AnalyzingandVisualizingTweetsand
SentimentsofDonaldTrumpandHillaryClinton
Siddharth Grover
OklahomaStateUniversity,Stillwater,Oklahoma
ABSTRACT
ANALYSIS
The United States 2016 presidential campaign has seen an unprecedented amount of media coverage, numerous
presidential candidates, and acrimonious debates over wideranging topics from candidates of both the Republican
and the Democratic parties. Twitter is the dominant social medium for people to understand, express, relate and
support the policies proposed by their favorite political leaders. In this poster, we analyzed the sentiment of the
tweets posted by Hillary Clinton and Donald Trump on their Twitter feeds. We also identified the most frequent policy
related keywords used by these candidates, along with the twitter handles they most frequently mentioned in their
tweets. We discovered that Donald Trump was more concerned with media coverage, as he frequently tweeted at and
mentioned media handles and used social space to negatively talk about the other presidential nominees. Trump also
falls short on positive sentiment and had an overall negative Twitter sentiment. Hillary Clinton, on the other hand,
used the same space to discuss some of her current policies and events. Though Clinton did not show the same ability
to drive the mainstream media narrative on Twitter as Trump, she did generate more overall positive sentiment.
TEXTMINING
Textfrom200,000tweetswasextractedandsavedastextfilesusingtheTextImportnodeofSASEnterpriseMiner.
Manyofthetweetswereretweetsandredundant,andwerethereforeremovedtogetawidevarietyoftopicsfor
betteranalysis.Textminingwasinitiatedbyparsingthedatatofindtokens(terms),partsofspeechtags,entities,etc.
Weignoredpartsofspeech,whichfiltersprepositions,determinants,auxiliaryverbs,etc.,alongwithnumericvalues
andpunctuation,asthesecontainfarlessinformation.DatawasthenfilteredusingtheTextFilternodeandIDFwas
setasthetermweightanddefaultsettingforthefrequency.Webuilttwoconceptlinksaroundofficialtwitter
accounts of both candidates to reflect what are
they are talking about.
METHODOLOGY
We extracted about 200,000 tweets accessing the live stream API of Twitter, using a java program, mytwitterscraper,
which is an open source realtime Twitter scraper. The timeline for the analysis was from April 2016 to June 2016. We
concentrated on @realdonaldtrump and @hillaryclinton Twitter handles and also on trending hashtags like
#trump2016 and #clinton2016. We also collected information on the number of followers, retweets, and favourited
tweets for both candidates.
TheconceptlinkinFigure2showstheassociation
betweentermsinthetweetsHillaryClinton
made.Thereisastrongrelationshipamongthe
termsstatus,hillaryclinton,supporterand
love",whichmightimplyshetweetsaboutherlove
forhersupporters.Termslikecampaign,money,
andvotermightimplyaskingvoterstovoteor
donatemoneyforhercampaign.
Figure2.Conceptlinkof@hillaryclinton
TheconceptlinkinFigure3showstheterms
mentionedinthetweetsbyDonaldTrump.Thereis
astrongrelationshipamongtermsfoxnews,cnn,
trump2016,andhillaryclinton,suggestingthat
thesemightbemostmentionedtermsinDonalds
tweets.ThismightimplythatTrumpfrequently
tweetsatmediahandlesandalsotalksaboutterms
likeamerican,gun,andpotus.
Textminingandsentimentanalysiswereperformedto
helpfocusonthefollowingkeypointstobettermake
thecomparisonbetweenbothcandidates.
Most used phrases on Twitter
Which Twitter handles candidates most
tweeted at
Policy key words mentioned on Twitter
Comparing sentiments on Clinton and Trumps
tweets
Figure 1. Text Mining and Sentiment Analysis Framework
Figure3.Conceptlinkof@realdonaldtrump
#analyticsx
Clintonvs.Trump2016:AnalyzingandVisualizingTweetsand
SentimentsofDonaldTrumpandHillaryClinton
Siddharth Grover
OklahomaStateUniversity,Stillwater,Oklahoma
TEXTMININGRESULTS
SENTIMENTANALYSISRESULTS
ANALYSISOFMOSTRECENTTWEETS
Wedidananalysisonthemostrecentoriginal3000tweetsfrombothHillaryClintonandDonaldTrump.
TRUMP
CLINTON
@CNN
16% @POTUS
36%
@FoxNews
14% @billclinton
20%
@foxandfriends
8% @realDonaldTrump 16%
@nytimes
7% @BernieSanders
@JebBush
7% @HFA
10%
4%
Figure5.Mostfrequenttwitterhandlesmentioned
TRUMP
Terror
Immigration
Jobs
Taxes
Guns
Education
Health
Figure5.WordCloudofmostfrequenttermsbyTrump&Clinton
COUNT
133
78
78
53
13
13
8
CLINTON
Guns
Health
Taxes
Immigration
Education
Foreign
Vets
COUNT
250
150
83
80
53
53
28
Figure6.CountofPolicyFocusedKeyWords
METHODOLOGY
SENTIMENT ANALYSIS
WefocusedonperformingsentimentanalysisontweetspostedfromofficialTwitterhandlesofbothcandidates
(@realdonaldtrump and@hillaryclinton).Fromthedatacollectedfortextmining,weextractedtworandomsamples
asmodelingdatasetswith5,000tweetseach.Wefurtherextractedtwoadditionalsetsofrandomdatasetswith
5,000tweetsthatwouldbeusedtotesttheresults.Weusedthemostrecenttweetsfordataexplorationandinitial
trendsbeforedivingintosentimentanalytics.Wethenbuiltdifferentstatisticalmodelsonthose5,000tweetsin
boththemodelingdatasetsusingSASSentimentAnalysisStudiotoidentifythesentimentdistributionfortweets
postedbyTrumpandClinton.Differentruleswerespecifiedforpositive,negative,anddescriptorsinaccordance
withthedatacollected.Finally,allofthemodelsweretestedagainstthetestdatatoseehowtheyllholdand
predictoverallsentimentexpressedbybothcandidates.
Model Results
Positive
Negative
Overall
Precision
Precision
Precision
Trump
94%
90%
92%
Clinton
85%
95%
90%
Figure7.ModelComparison
CONCLUSION
Intheworldofrealtimeinformation,Twitterplaysanimportantroleindisseminatingnewsandopinions.Inorderto
capturethevaryingpoliticalviewsofleadingpresidentialcandidates,weperformedtextminingandsentimentanalysis
onHillaryClintonandDonaldTrumptweetsduringthetimeperiodfromApril,2016 June,2016.Wecreatedconcept
linkstounderstandassociationbetweentermsmentionedinthetweetsbybothcandidates.Wediscoveredthat
DonaldTrumpfrequentlytweetedatandmentionedmediahandlesinhistweetsandthetweetshadanoverall
negativesentiment.HillaryClinton,ontheotherhand,hadmorepolicyfocusedkeywordsandfrequentlytweeted
moretothepoliticalestablishment.Clintonfellshortonengagement,measuredthroughretweets,butgenerated
moreoverallpositivesentiment.
REFERENCES
1. Chakraborty,Goutam,Murali Pagolu andSatishGarla.November2013.BookTextMiningandAnalysis:Practical
Methods,Examples,andCaseStudiesUsingSASSASInstitute.
2. SASInstituteInc.2014.GettingStartedwithSASTextMiner13.2.Cary,NC:SASInstituteInc.