Você está na página 1de 5

Home

Day
Week
Month
Year
Contact
Search

kmeansmatlabcodefeedowndatasource
{Visits(97)PostedbyGarrithGraham1.3k}
IwanttotrythisKmeansclusteringcodeonmyownfilehowdoIchangeitsoitdoesntcreaterandominformationbutreadsitfrommyowndata
source?
%%generatesampledata
K=3;
numObservarations=100;
dimensions=3;
data=rand([numObservarationsdimensions]);
%%cluster
opts=statset('MaxIter',500,'Display','iter');
[clustIDX,clusters,interClustSum,Dist]=kmeans(data,K,'options',opts,...
'distance','sqEuclidean','EmptyAction','singleton','replicates',3);
%%plotdata+clusters
figure,holdon
scatter3(data(:,1),data(:,2),data(:,3),50,clustIDX,'filled')
scatter3(clusters(:,1),clusters(:,2),clusters(:,3),200,(1:K)','filled')
holdoff,xlabel('x'),ylabel('y'),zlabel('z')
%%plotclustersquality
figure
[silh,h]=silhouette(data,clustIDX);
avrgScore=mean(silh);
%%Assigndatatoclusters
%calculatedistance(squared)ofallinstancestoeachclustercentroid
D=zeros(numObservarations,K);%initdistances
fork=1:K
%d=sum((xy).^2).^0.5
D(:,k)=sum(((datarepmat(clusters(k,:),numObservarations,1)).^2),2);
end
%findforallinstancestheclusterclosettoit
[minDists,clusterIndices]=min(D,[],2);
%compareitwithwhatyouexpectittobe
sum(clusterIndices==clustIDX)

Answer(1)matlabdataclusteranalysiskmeans

#1
{AnsweredbyTalDarom}
Thelinethatcreatestherandomdatais:
data=rand([numObservarationsdimensions]);

Justreplacethislinewithcodethatreadsyourdata(probablyusingmatlabcommandsuchastextscan)intoavariablenameddata.

Relatedquestions
1.kmeansmatlabcodefeedowndatasource
IwanttotrythisKmeansclusteringcodeonmyownfilehowdoIchangeitsoitdoesntcreaterandominformationbutreadsitfrommyown
datasource?%%generatesampledataK=3numObservarations=100dimensions=3data=rand([numObservarationsdi...
2.scripterrorrelatingtonamingconvention
Ihavesomedatastoredinamatfilespreadsheetwhenitrytorunmykmeans.mscriptIgetthiserrorandIcantworkoutwhatsgoingon?
AttempttoexecuteSCRIPTkmeansasafunctionErrorin==kmeansat10[clustIDX,clusters,interClustSum,Dist]=...
3.DataclusteringinKMeansAlgorithmusingbinarytreestructure
IamhavingtroubleingeneratingcodeforKMeansclusteringinjava.Ihavealreadyknownthealgorithmbutit'sveryhardtowriteininjava
code.MyassignmentistoretrievedatafromdatabasethenruntheClusteringwithKMeans,inthiscase,thedat...
4.KMeansClusteringusingMahout
I'musingtheclusteringtechniquegivenhereforclusteringalargedataset,whichisgiveninMahoutexamples.However,whenIvisualizethe
particularclusteringIgetthefollowingfigure.I'mreallystrugglingtounderstandwhatthisactuallymeansa...
5.FuzzyKmodesclusteringhowtofindtheclustercenters

I'mtryingtounderstandfuzzykmodesalgorithm(lookmainlyatpage3)inordertoimplementit.I'mstuckatthecalculationofcluster
centerstheysaidasshowninthepicIneedtoknowwhetherthefollowingistrueorfalseandpleasecorrectmeIn...
6.fuzzykmodeclusteringmembershipvaluecalculation
IwassearchingforaclusteringalgorithmtofuzzyclustercategoricalattributesandIfoundthekmodesalgorithmI'vegotthewayitworksbut
I'mnotunderstandingifthemembershiporbelongingmatrixiscalculatedthesamewayasthismatrixinfuz...
7.exceptioninthreadonKMeansclustering[error]
IencounterproblemonKmeansclustering,Iactuallyneedstoclusterdatainputfromnotepadintosomeclusters.howeverIencounter
exceptionandthecpdeisnotworkingwell.kindlyneedshelponthiserrorExceptioninthread"main"java.lang.NullPoin...
8.Groupnpointsinkclustersofequalsize[duplicate]
PossibleDuplicate:KmeansalgorithmvariationwithequalclustersizeEDIT:likecasperOnepointitouttomethisquestionisaduplicate.
Anywayshereisamoregeneralizedquestionthatcoverthisone:http://stats.stackexchange.com/questions/8744/cl...
9.GetpointIDsafterclustering,usingpython[duplicate]
PossibleDuplicate:PythonkmeansalgorithmIwanttocluster10000indexedpointsbasedontheirfeaturevectorsandgettheiridsafter
clusteringi.e.cluster1:[p1,p3,p100,...],cluster2:[...]...IsthereanywaytodothisinPython?Thx~P.s.Th...
10.Kmeansgoingexceptionallyslowwhenclusteringmorethan3documents[closed]
I'mtryingtousekmeanstoclustersimilardocumentstoeachother.IamusingNLTK'sKMeans.WhenIonlycluster3documents,ittakes
lessthan5seconds.ButonceIaddinafourthdocument,itdoesn'tfinish(Icutitoutafter10minutes).Whenther...
11.SameresultfromKmeansandsequentialKmeans?
DoweobtainthesameresultifweapplyKmeansandsequentialKmeansmethodstothesamedatasetwiththesameinitialsettings?Explain
yourreasons.PersonallyIthinktheanswerisNo.TheresultobtainedbysequentialKmeansdependsonthepresent...
12.Excel2010CreateClusterGraph
IsthereawaytocreateclustergraphswithinExcel2010?Morespecifically,Iamlookingforthetypeofclustergraphwhichresemblesa
scattergraphasopposedtoabarchart.IamworkingwithkmeansandthebestIcanachieveinExcelatthemoment...
13.HowcanIgetclusternumbercorrespondtodatausingkmeansclusteringtechniquesinR?[closed]
Iclustereddatabykmeansclusteringmethod,howcanigetclusternumbercorrespondtodatausingkmeansclusteringtechniquesinR?In
ordertogeteachrecordbelongstowhichcluster.example123213=1.12,132.32...
14.Selectinganappropriatesimilaritymetric&assessingthevalidityofakmeansclusteringmodel
Ihaveimplementedkmeansclusteringfordeterminingtheclustersin300objects.Eachofmyobjecthasabout30dimensions.Thedistanceis
calculatedusingtheEuclideanmetric.IneedtoknowHowwouldIdetermineifmyalgorithmsworkscorrectly?Ica...
15.DocumentClusteringBasics
So,I'vebeenmullingovertheseconceptsforsometime,andmyunderstandingisverybasic.Informationretrievalseemstobeatopicseldom
coveredinthewild...Myquestionsstemfromtheprocessofclusteringdocuments.Let'ssayIstartoffwithac...
16.IsitpossibletoseethecurrentiterationnumberinOpenCV'scvKmeans2?
I'mtryingtoclusterareallylargedataset3030764x162into4000clustersusingthecvKmeans2functioninOpenCV2.1.Iwouldliketosee
whichiterationtheKmeansalgorithmiscurrentlyin(similartowhatisdisplayedinMatlab),butIdon'tseean...
17.Errorwhileclusteringdatawithkmeans
I'mtryingtoperformclusteringforkmeansalgorithmfortheinputdatashownhere:https://cwiki.apache.org/MAHOUT/clusteringof
syntheticcontroldata.htmlHoweverwhenthemapreducejobisabouttotakeplaceigettheerror11/10/1621:05:57INFOm...
18.KMeansalternativesandperformance
I'vebeenreadingaboutsimilaritymeasuresandimagefeatureextractionmostofthepapersrefertokmeansasagooduniformclustering
techniqueandmyquestionis,isthereanyalternativetokmeansclusteringthatperformsbetterforanspecificse...
19.SeedselectionstrategiesforKmeans
IwonderwhatkindofseedselectionmethodsIcanapplytoKmeansalgorithm.Googlesearchwasn'tthathelpful.Anysuggestions?...
20.WheretofindareliableKmedoid(Notkmeans)opensourcesoftware/tool?[closed]
IamlearningtheKmedoidsalgorithmsoIamsorryifIaskinappropriatequestions.AsIknow,theKmedoidsalgorithmimplementsaK
meansclusteringbutuseactualdatapointstobecentroidinsteadofmathematicalcalculatedmeans.AsIgoogledonline...
21.matlabclusteringanddataformats
LeadingonfromapreviousquestionFCMClusteringnumericdataandcsv/excelfileImnowtryingtofigureouthowtotaketheoutputed
informationandcreateaworkable.datfileforusewithclusteringinmatlab.%#readthelistoffeaturesfid=fopen(...
22.clusteringandmatlab
HiimtryingtoclustersomedataIhavefromthekdd1999cupdatasettheoutputfromthefilelookslikethis:
0,tcp,http,SF,239,486,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,8,8,0.00,0.00,0.00,0.00,1.00,0.00,0.00,19,19,1.00,0.00,0.05,0.00,0.00,0.00,0.00,0.00,nor...
23.FCMClusteringnumericdataandcsv/excelfile
HiIaskedapreviousquestionthatgaveareasonableanswerandIthoughtIwasbackontrack,Fuzzycmeanstcpdumpclusteringinmatlab
theproblemisthepreprocessingstageofthebelowtcp/udpdatathatIwouldliketorunthroughmatlabsfcmclust...
24.Fuzzycmeanstcpdumpclusteringinmatlab
HiIhavesomedatathatsrepresentedlikethis:
0,tcp,http,SF,239,486,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,8,8,0.00,0.00,0.00,0.00,1.00,0.00,0.00,19,19,1.00,0.00,0.05,0.00,0.00,0.00,0.00,0.00,normal.
Itsfromthekddcup1999whichwasbasedonthedarpaset....
25.ErrorinCentroidcalculationofkmeansinmatlab
IgotastrangeoutputofkMeansimplementedinmatlab.AllmyentriesinmyinputmatrixFofdimensiondxnarebetween0and1.Wheni
runthekmeansalgorithmusingthefollowingmatlabcommandwhichcreates50cluster.[IDX,B]=kmeans(F,50,'MaxIter...
26.kmeansexampleinmatlabdoesnotrun
ItissostrangethatwhenIcopyandpastethefollowingmatlabexamplehttp://www.mathworks.co.jp/help/toolbox/stats/kmeans.htmltothe
workplaceanditsays:???Errorusing==kmeansToomanyinputarguments.Anybodyhasthesameproblem?...
27.KmeansCodingImplementation
Iamlookingforanimplementationofkmeansthatwilloutwhereeachrowofdatabelongstoo.IhavefoundotherlinkslikeMatlab:Kmeans
clusteringButtheydonothelp.SoIamlookingforsomethinglikethis.Ifmydataisasfollows1,2,4,5,6,...
28.PredictingValueswithkMeansClusteringAlgorithm
I'mmessingaroundwithmachinelearning,andI'vewrittenaKMeansalgorithmimplementationinPython.Ittakesatwodimensionaldata
andorganisesthemintoclusters.Eachdatapointalsohasaclassvalueofeithera0ora1.Whatconfusesmeabout...
29.fuzzycmeanscategoricaldata
canthefuzzycmeansappliedonnonnumericaldatasets?i.ecategoricalormixednumericalandcategorical..ifyes(Ihopeso:():howwe
calculateclustercenters?IfNO,whatisthealternative..howtofuzzyclustersthesedata?Ineedtheresp...
30.clusterdataMatlabfunction
IamusingMatlabclusterdatafunctiontoclassifymydata(noiseandnonnoise)into2categories:noiseandnonnoisegroups.Thefunction
workswellexceptthatsometimesitnamesallnoisedataasgroup1andallnonnoisedataasgroup2.Sometimesi...
31.AgglomerativeClusteringinMatlab

Ihaveasimple2dimensionaldatasetthatIwishtoclusterinanagglomerativemanner(notknowingtheoptimalnumberofclusterstouse).
TheonlywayI'vebeenabletoclustermydatasuccessfullyisbygivingthefunctiona'maxclust'value.Forsimp...
32.clusteringdataoutputsirregularplotgraph
OkIwillrundownwhatimtryingtoachieveandhowItryedtoachieveitthenIwillexplainwhyItryedthismethod.Ihavedatafromthe
KDDcup1999initsoriginalformatthedatahas494kofrowswith42columns.Mygoalistryingtoclusterthisd...
33.isthereadiscretizedmethodavailableinmatlab?
Ihaveasetattributeslikesoinmydatafile:Theselectedattributesconsistsofbothdiscreteandcontinuousattributetypes.Theattributes
ProtocolTypeandServiceareoftypediscreteandtheattributeSrcBytes,DstBytes,Countareofcontinuou...
34.isthereadiscretizedmethodavailableinmatlab?
Ihaveasetattributeslikesoinmydatafile:Theselectedattributesconsistsofbothdiscreteandcontinuousattributetypes.Theattributes
ProtocolTypeandServiceareoftypediscreteandtheattributeSrcBytes,DstBytes,Countareofcontinuou...
35.inputmustbeemptyoraformatstring
HiIkeepgettinganerrorwiththis:%%generatesampledataK=3numObservarations=12000dimensions=20data=fopen('M.dat','rt')C=
textscan(data,[numObservarationsdimensions])???Errorusing==textscanSecondinputmustbeemptyoraformatst...
36..datfilehowtocreateonebasedonexceldocument
Ihavea.csvfileinmymatlabfolderwith38columnsandabout48thousandentries.Iwashopingonusingthefindclusterguibutitonly
accepts.datfiles.HowdoIcreatea.datfileinmatlaborspecificallyhowdoIconvertthe.csvfileintoa.da...
37.ClusteringstringsinR(isitpossible?)
Ihaveadatasetwithacolumnthatiscurrentlybeingtreatedasafactorwith1000+levels.Thesearevaluesforthecolumn.Iwouldliketo
cleanupthisdata.Somevaluesarestringslike"18+5=13"and"518=13",Iwouldliketheclustering...
38.Clusteradjacencymatrixofdifferentsizes
Ihavecreatedadjacencymatrixfordirectedgraphsofdifferentsizes.Ihavearound30,000matrices,eachonaseparatetextfile.HowcanI
clusterthem,isthereanytoolsavailable.Whatisthebestwaytorepresentadirectedgraphforclustering.T...
39.Opensourcedataminetools,searchingforagoodoption(GNUdataminingapps)[closed]
IwanttotestsomeappsfordatamininginGNU/LinuxDebian,Idownloaded"GnomeDataMineTools"from
http://www.togaware.com/datamining/gdatamine/Ifollowedtheinstructions,Iinstalledtheapp(s)andthenitsaysthatyoushouldrunthe
command:g...
40.WaystodetermineagroupofunitsinRTS
Lookingforanalgorithmthatcanbeusedtodeterminegroupsofunitsthatmovetogetherasasquadinarealtimestrategygamelike
StarCraft.ThedirectionthatIamcurrentlylookatisaclusteringalgorithmbuthavingahardtimefindingwhichone...
41.Combiningdifferentsimilaritiestobuildonefinalsimilarity
Imprettymuchnewtodataminingandrecommendationsystems,nowtryingtobuildsomekindofrecsystemforusersthathavesuch
parameters:cityeducationinterestTocalculatesimilaritybetweenthemimgonnaapplycosinesimilarityanddiscretesimil...
42.Howclusteringworks,especiallyStringclustering?
Iheardaboutclusteringtogroupsimilardata.IwanttoknowhowitworksinthespecificcaseforString.Ihaveatablewithmorethan
different100,000words.Iwanttoidentifythesamewordwithsomedifferences(eg.:house,house!!,hooouse,HoUse...
43.ClustercentermeanofDBSCANinR?
UsingdbscaninpackagefpcIamabletogetanoutputof:dbscanPts=322MinPts=20eps=0.00501seed0233border872total87235butI
needtofindtheclustercenter(meanofclusterwithmostseeds).Cananyoneshowmehowtoproceedwiththis?...
44.WhatisthedifferencebetweenaConfusionMatrixandContingencyTable?
I'mwrittingapieceofcodetoevaluatemyClusteringAlgorithmandIfindthateverykindofevaluationmethodneedsthebasicdatafroma
m*nmatrixlikeA={aij}whereaijisthenumberofdatapointsthataremembersofclassciandelementsofclus...
45.HowtoperformkmeansclusteringinmahoutwithvectordatastoredasCSV?
Ihaveafilecontainingvectorsofdata,whereeachrowcontainsacommaseparatedlistofvalues.Iamwonderinghowtoperformkmeans
clusteringonthisdatausingmahout.TheexampleprovidedinthewikimentionscreatingsequenceFiles,butotherwise...
46.WhattodowhenKMeansreturnsfewerthanKclusters?
I'veimplementedKMeansinJavaandhaveabitofaheadscratcher.Iselectmyinitialcentroidsbychoosingarandomvalueineach
dimensionwithintherangeofvaluesofthedatapoints.I'verunintocaseswherethisresultsinoneormoreofthesec...
47.kmeansreturnvalueinR
Iamusingthekmeans()functioninRandIwascuriouswhatisthedifferencebetweenthetotssandtot.withinssattributesofthereturned
object.Fromthedocumentationtheyseemtobereturningthesamething,butappliedonmydatasetthevalueofto...
48.HowtomaintaindataentryidinMahoutKmeansclustering
I'musingmahouttorunkmeansclustering,andIgotaproblemofidentifyingthedataentrywhenclustering,forexampleIhavea100data
entriesiddata00.10.20.30.410.20.30.40.5......1000.20.40.40.5afterclustering,Ineedtogettheid...
49.kmeanswithellipsoids
IhavenpointsinR^3thatIwanttocoverwithkellipsoidsorcylinders(Idon'treallycarewhicheveriseasier).Iwanttoapproximately
minimizetheunionofthevolumes.Let'ssaynistensofthousandsandkisahandful.Developmenttime(i.e.s...
50.Clusteronedimensionaldataoptimally?
Doesanyonehaveapaperthatexplainshowthishttp://cran.rproject.org/web/packages/Ckmeans.1d.dp/algorithmruns?Orwhatisthemost
optimalwaytodokmeansclusteringinonedimension?Thankyou....
51.ExceptionthrownwhilerunningKmeansclusteringusingMahout
IwasjusttryingtorunKmeansclusteringusingMahoutbyfollowingthislink.However,Idownloadedquickstartkmeans.shasdirectedandi
learntihadtorunbuildreuters.shinexamples/bindirectorywhichdownloadstheReutersdatasetandpreparesit...
52.ExceptionthrownwhilerunningKmeansclusteringusingMahout
IwasjusttryingtorunKmeansclusteringusingMahoutbyfollowingthislink.However,Idownloadedquickstartkmeans.shasdirectedandi
learntihadtorunbuildreuters.shinexamples/bindirectorywhichdownloadstheReutersdatasetandpreparesit...
53.Lloyd'salgorithm
IsitpossibletorunLloyd'salgorithmtofindthekmeansinonedimensioninpolynomialtime?IknowthatthatthekmeansproblemisNP
hardforanythingmorethanonedimensions.Anyifyouhaveafixeddimension,Lloyd'salgorithmwillruninpolyno...
54.Generatingclustersfromadjacencymatrix/edgelistinR
Iamtryingtofindpotentialclustersorgroupsofnodes(forummessages,inthiscase).Inthecurrentdata,eachnode(message)hasbeen
tentativelygroupedtogetherwithnothermessages,andthatgroupgivenaname.So,weknowthatmsgID1hasbeen...
55.FriendGrouping
Iamwritingaprogramthatfetchesthelinksbetweenfriendsonfacebookandthencreatefriendshipgroupsfromtheselinks.Ihavegotasfar
ascreatingthedatastructurewhichissomethinglike[friend_id:[mutual_friend_id,mutual_friend_id,mutual...
56.Javalibrarymethodoralgorithmtoestimateaggregatestringsimilarity?
Ihaveresponsesfromuserstomultiplechoicequestions,e.g.(roughly):Married/SingleMale/FemaleAmerican/Latin
American/European/Asian/AfricanWhatIwantistoestimatesimilaritybyaggregatingallresponsesintoasinglefieldwhichcanbecompared

...
57.MarkovClusteringAlgorithm
I'vebeenworkingthroughthefollowingexampleofthedetailsoftheMarkovClusteringalgorithm:
http://www.cs.ucsb.edu/~xyan/classes/CS595D2009winter/MCL_Presentation2.pdfIfeellikeIhaveaccuratelyrepresentedthealgorithmbut
Iamnotgettingth...
58.Clusteringalistusingboundaryfunction
Givenalist,I'dliketodivideitintoclustersusinga"boundaryfunction".Suchfunctionwouldtaketwoconsecutiveelementsofthelistand
decidewhetherornottheyshouldbelongtothesamecluster.Soessentially,Iwantsomethinglikethis:clus...
59.Clusteringasparsedatasetofbinaryvectors
IfIhaveasparsedatasetwhereeachdataisdescribedbyavectorof1000elements,eachelementofthisvectorcanbeeither0or1(alotof0
andsome1),doyouknowanydistancefunctionthatcouldhelpmetoclusterthem?Issomethinglikeeuclid...
60.R:Unusedargumentlabelinhclust
I'musingthefollowingcodetobuildandhierarchicalcluster:datread.table(textConnection("pdbPAEHSS1avd_model.pdb3028.0
3920.01ave_model.pdb3083.04019.01ij8_model.pdb2958.03830.01ldo_model.pdb2889.03754.01ldq_model.pdb2758.03590.01lel_m...
61.Clusteringalgorithmtoclusterobjectsbasedontheirrelationweight
Ihavenwordsandtheirrelatednessweightthatgivesmean*nmatrix.I'mgoingtousethisforasearchalgorithmbuttheproblemisIneedto
clustertheenteredkeywordsbasedontheirpairwiserelation.Solet'ssayifthekeywordsare{tennis,feder...
62.Howtobestdoserversidegeoclustering?
Iwanttodopreclusteringforasetofapprox.500,000points.Ihaven'tstartedyetbutthisiswhatIhadthoughtIwoulddo:storeallpointsin
alocalSOLRindexdetermine"naturalclusterpositions"accordingtosomeadministrativeinformation(big...
63.GoogleMapsClusteringMarkers
ihavealistofmarkersbutiwanttochangethemasaddress.vardata={"loc":[{"longitude":81.81718856098772,"latitude":
26.278657439364583},{"longitude":81.81291211952795,"latitude":26.199298735114475},{"longitude":81.74875180993064,"lat...
64.Denclue2.0inR
HasanyonesuccessfullyimplementedtheDenclue2.0algorithminR?(orMatlab)I'mgettingstuckconvertingthehillclimbingtoanEM
versionasoutlinedinthepaperhereI'vebeenabletoconstructthe1.0Algorithmbutamunsureofhowtoaccomplish...
65.HighDimensionalDataClustering
Whatarethebestclusteringalgorithmstouseinordertoclusterdatawithmorethan100dimensions(sometimeseven1000).Iwould
appreciateifyouknowanyimplementationinC,C++orespeciallyC#.Thanksinadvance....
66.Iwonderhowclusteringalgorithmsareusedinwhichallrealworldapplications?[closed]
Whatallareapplicationsofclusteringalgorithms?Whichallrealworldapplicationsuseclusteringalgorithmsandforwhat?...
67.Clustering2dintegercoordinatesintosetsofatmostNpoints
Ihaveanumberofpointsonarelativelysmall2dimensionalgrid,whichwrapsaroundinbothdimensions.Thecoordinatescanonlybe
integers.IneedtodividethemintosetsofatmostNpointsthatareclosetogether,whereNwillbequiteasmallcut...
68.Numberclustering/partitioningalgorithm
Ihaveanordered1Darrayofnumbers.Boththearraylengthandthevaluesofthenumbersinthearrayarearbitrary.Iwanttopartitionthe
arrayintokpartitions,accordingtothenumbervalues,e.g.let'ssayIwant4partitions,distributedas30%...
69.Hierarchicalclusteringforbitsequences
ThisisahomeworkproblemandI'mfacingsomedifficultiestounderstandit.ThehomeworkquestionisClusterthefollowingbitsequences
usinghierarchicalclustering.Ifd(:,:)definesthedistacebetweentwobitsequencesaandb,d(a,b)=HammingDista...
70.Whatcustomizablemachinelearningtoolkitsareavailable?
I'mlookingforamachinelearningtoolkitthatwillallowmetospecifycustomsimilaritymeasuresaswellaschoosemyownrepresentations
forthedata.Cananyonepointmetoanysuchtoolkits?PreferablyPythonorJava.Thankyou....
71.PredictinClustering
InRlanguageisthereapredictfunctioninclusteringlikethewaywehaveinclassification?Whatcanweconcludefromtheclusteringgraph
resultthatwegetfromR,otherthatcomparingtwoclusters?...
72.Unsupervisedequivalentofknearestneighbouralgorithm
knearestneighbourisasupervisedalgorithm.Itissuitabletoclassifyhighdimensionalitydata.Couldsomeonepleasementionafew
unsupervisedalgorithmsusedtoclassifyhighdimensionalitydataitems?...
73.Skipnodesduringbinarytreetraversal
Ineedtotraverseabinarytree,skippingthechildrenofanynodeforwhichaconditionismet.Thisimplementsatreeguidedclustering
approachtheleavesofasubtreeareconsideredaclusterwhentheycollectivelymeetthecondition.Itseemslike...
74.Correlatingwordproximity
Let'ssayIhaveatexttranscriptofadialogueoveraperiodofaprox.1hour.Iwanttoknowwhatwordshappenincloseproximateytoone
another.WhattypeofstatisticaltechniquewouldIusetodeterminewhatwordsareclusteredtogetherandhowclo...
75.Singlelinkclustering
I'mlookingforawaytodosinglelinkclusteringwithOpenCV.Myscenario:Hundreds(potentiallythousands)offeaturevectors(vectors
dimensioncanbeupto~800features).Unknownnumberofclusters(likelytobemuchlowerthanthenumberofvectors...
76.Clusteringwithscipyclustersviadistancematrix,howtogetbacktheoriginalobjects
Ican'tseamtofindanysimpleenoughtutorialsordescriptionsonclusteringinscipy,soI'lltrytoexplainmyproblem:Itrytocluster
documents(hierarchicalagglomerativeclustering),andhavecreatedavectorforeachdocumentandproducedasym...
77.FromwheredoesGooglegettheabstractforeachofitssiteresults,thatitdisplaysonitssearchresultpage?
Iamworkingonaprojectinwhichihavetosearchfortermsonasearchengineandthenclustertheresultsontheircontextualsense.Soi
havetotreateachresultasadocument.unfortunately,thedatapresentalongwitheachresultontheresultpa...
78.Howtogetflatclusteringcorrespondingtocolorclustersinthedendrogramcreatedbyscipy
Usingthecodepostedhere,Icreatedanicehierarchicalclustering:Let'ssaythethedendrogramontheleftwascreatedbydoingsomething
likeY=sch.linkage(D,method='average')#Disadistancematrixcutoff=0.5*max(Y[:,2])Z=sch.dendrogram(Y,...
79.IsthereabetterwaytohierarchicallyclusterinR?
Iwouldliketodohierarchicalclusteringbyrowandthenbycolumn.Icameupwiththistotalhackofasolution:#!/path/to/my/Rscript
vanillaargscommandArgs(TRUE)mtxf.inargs[1]clusterMethodargs[2]mtxf.outargs[3]mtxread.table(mtxf.in,...
80.Classifyingaclassifier
I'veimplementedaclassifierwhichEachiterationreceivesaparameterobjecttoclassify,someobjectsshareaclassifiable"property"likea
colorname.Classificationparameterscouldchange,sotheyareparametrizedtooandpassedtothisclassifiera...
81.Classifyingaclassifier
I'veimplementedaclassifierwhichEachiterationreceivesaparameterobjecttoclassify,someobjectsshareaclassifiable"property"likea
colorname.Classificationparameterscouldchange,sotheyareparametrizedtooandpassedtothisclassifiera...
82.ClustersovertimeinR
IhaveaseriesofdatathatI'mgoingtouseclusteringon,andIwanttoseehowthisdataclustersovertime.Soessentiallyeveryonestartsina

singlegroup,astheyhavedonenothing,butovertimeastheydodifferentthingstheywillbeputintod...
83.ClustersovertimeinR
IhaveaseriesofdatathatI'mgoingtouseclusteringon,andIwanttoseehowthisdataclustersovertime.Soessentiallyeveryonestartsina
singlegroup,astheyhavedonenothing,butovertimeastheydodifferentthingstheywillbeputintod...
84.Trainingdataforsentimentanalysis
WherecanIgetacorpusofdocumentsthathavealreadybeenclassifiedaspositive/negativeforsentimentinthecorporatedomain?Iwanta
largecorpusofdocumentsthatprovidereviewsforcompanies,likereviewsofcompaniesprovidedbyanalystsandm...
85.ClarificationaboutMatlabLaplaceEquation
IneedhelpunderstandingthecodeandhowthetemperatureTNiscomputed/stored.Specifically,Idon'tunderstandthedoubleloopbeginning
withwhilek=imax.Here'stheMatlabprogramtosolve2DLaplaceequationexplicitly:function[x,y,T]=LaplaceE...
86.Rearrangingavectorinmatlab
I'mwritingacodeforadaptivefiniteelementmethodin1d.Ihaveanintervalletsay[0,1]andinfirstiterationIhaveamesh,x=0:.25:1andin
seconditerationIwouldliketodividethesecondandlastsegmentin3and5segments.Sotheupdatedvec...
87.HowtouseinverseFFTonamplitudefrequencyresponse?
IamtryingtocreateanapplicationforcalculatingcoefficientsforagraphicequalizerFIRfilter.IamdoingsomeprototypinginMatlabbutI
havesomeproblems.IhavestartedwiththefollowingMatlabcode:%binampsvectorholds2^13=8192binsof...
88.Isthereanyfunctionoppositetobwmorph(image,'skel')inMATLABorC,C++code?
Iwanttocreateanimageofanobjectfromitsmorphologicalskeleton.IsthereanyfunctioninMATLABorC,C++code?Thanksinadvance.
Originalimage,anditsskeleton(obtainedusingbwmorph(image,'skel',Inf)):...
89.Datarecoverysitemesaures
ourASP.Netapplicationiscurrentlydeployedoverthefarmwiththebelowspecs:2webfrontendserverscontrolledbyanNLB1Application
serverhostingwindowsservicesandaworkflowengineAnActive/PassiveSQLServerclusterwearerequiredtoprov...
90.howtocalculatethetrendofaspecificactivity
Ihaveatableinmysqlwhichcontainposts/entries,thesepostshavecreationdateandcategorized.WhatIwanttodoisgetthetrendsofthose
categories,eachcategoryhowisthetrendinthepasthour?bytrend,Imean,thetrendofposting....
91.DataAnalysisofHugeAmountsofdatainanArray
SorryifanyofthisisawkwardlywordedbutthisishowIfindmyself.I'mprogramminginC#in.NETbutpseudocoderesponsesorEnglish
responsesarejustaswelcomesinceit'sthewayofdoingthisIfindhard.So,Ihavean3arrayswithbunch(es?)o...
92.ObjectOrientedAnalysis&Design[closed]
I'mlookingforavideotutorial/trainingorevenlecturesonObjectOrientedAnalysisDesign.Idon'tneedexplanationforUMLandUsecases.I
wantafullsoftwarelifecycleexplained.IalreadyreadHeadFirst:ObjectOrientedAnalysisDesignThebook...
93.Gettingcrossstreetsfromageocodedaddress
Let'ssayIhavealotofJSONdatarelatingtointersectionsandtheirgeolocations.Ihaveanappwheretheuseruseshisorhercurrentlocation,
andIwanttobeabletofigureoutwhatblockthey'reon(i.e.thestreetthey'reon,andbetweenwhattw...
94.PHP/MySQLAnalyzingcommonsetsacrossmultiplesets
Let'ssayIhavetwotables,peopleandfamilies.familieshastwofieldsidandname.Thenamefieldcontainsthefamilysurname.peoplehas
threefieldsid,family_idandnameThefamily_idistheidofthefamilythatthatpersonbelongsto.T...
95.howtopreparedatainRformanova
Thedatainatextlookslikethis:InitialSpeedPedalRotation10mph25mph40mph55mph5degrees0.350.190.140.108degrees0.37
0.280.190.1910degrees0.420.300.290.23Thenumbersinthetablerepresentacceleration.Iwouldliketoknowh...
96.HowtogetallnamesofvariablesusedinastatementfromJavaAST?
IsthereanywaytoretrievenamesofalldefinitionsusedinastatementbytraversingtheJavaASTtree?Forexample,inta=1,b=2intc=a+b
//Forthisstatement,Iwanttoget"a"and"b",sincetheyarenamesofvariablesusedandpredefined.Itrie...
97.Techniquesforfindingrepeattransactionsbetweencustomerswithmisspellingsorotherchangeininformation?
Thisisn'taSQLServerspecificquestionbuttheremightbetSQLspecificoptionshere.I'vegotabunchofcustomerdetailsmanyofthem
cancelandresignupfortheirservice.Theygetanentirelynewaccountandourdatavalidationissketchyatbes...
98.Recommendationsforloganalysistoolsforejabberdlogs[closed]
I'mlookingatamassivesetofejabberdlogs,andI'mtryingtopryoutsomeusefulinformationfromthem.Arethereanyexistingtoolsthatcan
helpmegetsomeoftheworkdone,oramIlefttorollmyown?...
99.Venndiagramfromlistofclustersandcooccuringfactors
I'vegotaninputfilewithalistof~50000clustersandpresenceofanumberoffactorsineachofthem(~10millionentriesintotal),seea
smallerexamplebelow:set.seed(1)x=paste("cluster",sample(c(1:100),500,replace=TRUE),sep="")y=c(paste("f...
100.Aboutalgorithmanalysis/performance?
AsshowinthepictureIhavefollowingstructurestostoresetofcloset"Curves"ina"Slice".A"Curve"consistof"Nodes"implementedasa
doublylinkedlist.Hereisthepsuedocode:classSlice{ListCurve*curves}classCurve{intobjectIDNode*...

Copyright(c)2015questioninbox.com.Allrightsreserved.

Você também pode gostar