Você está na página 1de 13

8/14/2015

healthcarecosts

HealthcareCosts
IstumbleduponsomedataregardinghealthcarecostsforvariousmedicalproceduresandconditionsintheUSA.
Andso,Iamgoingtoanalyzethedataandseewhatwecanfindoutfromit.
First,letmesetupthisIpythonnotebookwiththenecessaryrequirementsforourwork.Iamalsogoingtoopenthe
filecontainingthedata(thecsvfile)andpresentthefirst3rowsofdatainordertohaveanideaofwhatweare
workingwith.Somecomputercodewillbefeatured.Butifyouarenotacomputerprogrammer,don'tworryaboutit.
Youwillnotneedthatskilltounderstandthiswork.Herewego.
In[5]: #importingthevariousmoduleswewillbeusing
importpandasaspd
importnumpyasnp
pd.set_option("display.mpl_style","default")
#readingthedatafileandshowingthefirst3rows
healthdata=pd.read_csv("healthcarecosts.csv")
healthdata[:3]
Out[5]:
DRGDefinition

Provider Provider
Id
Name

Provider
Street
Address

Provider
City

Hospital
Provider
Provider
Referral
Zip
State
Region
Code
Description

039
EXTRACRANIAL
0
10001
PROCEDURES
W/OCC/MCC

SOUTHEAST
ALABAMA
MEDICAL
CENTER

1108
ROSS
CLARK
CIRCLE

DOTHAN

AL

36301

ALDothan

039
EXTRACRANIAL
1
10005
PROCEDURES
W/OCC/MCC

MARSHALL
MEDICAL
CENTER
SOUTH

2505US
HIGHWAY
431
NORTH

BOAZ

AL

35957

AL
Birmingham

039
EXTRACRANIAL
2
10006
PROCEDURES
W/OCC/MCC

ELIZA
COFFEE
MEMORIAL
HOSPITAL

205
MARENGO FLORENCE AL
STREET

35631

AL
Birmingham

Oknow,wecanseewhatkindofdatawehaveavailable.Wehavethedefinitionofthevariousmedicalprocedures
andconditions"DRGDefinition".Wehavetheproviderid,providername,theiraddress,theirstate.Also,theAverage
CoveredCharges.Intheoriginalfile,theysaythattheAverageCoveredChargesisthetotalamountthattheprovider
charges.So,wewillusethisasthecostforthevariousprocedures.
Wecanalsofindouthowmanyrowsofdatawehavewiththecodebelow.Andwefindoutthatwehave163065rows
ofdataand12columns.
In[9]: #findingnumberofrowsandcolumns
healthdata.shape
Out[9]: (163065,12)

Butwedon'tneedallthosedifferentkindsofdataforourwork.Weareonlyinterestedin"DRGDefinition","Provider
State"and"AverageCoveredCharges".So,let'smanipulatethedatainordertoshowonlywhatweneed.Also,for
thoseinclinedtowardscomputerprogramming,youmaynotethatthevaluesin"AverageCoveredCharges"are
strings.Well,ifyouhavenoticed,don'tworryaboutit.Iwillalsotransformthosevaluesintonumbers(floats)sowe
candocalculationswiththem.
WhenIamdone,youwillnolongerseethe$infrontoftheirnumbers

http://localhost:8888/nbconvert/html/healthcarecosts.ipynb?download=false

1/13

8/14/2015

healthcarecosts

In[11]: #combiningonly"DRGDefinition","ProviderState"and"AverageCoveredCharges"da
ta
healthdata2=healthdata[["DRGDefinition","ProviderState","AverageCoveredChar
ges"]]
#firstmakingsurethatdatain"AverageCoveredCharges"arestringsbyconverting
themintostrings,thenconvertingtofloats
healthdata2["AverageCoveredCharges"]=healthdata2["AverageCoveredCharges
"].str[1:].astype(float)
#showingfirst10rowsofthenewdata
healthdata2[:10]
C:\Users\Ricardy\Anaconda\lib\sitepackages\IPython\kernel\__main__.py:5:SettingWit
hCopyWarning:
AvalueistryingtobesetonacopyofaslicefromaDataFrame.
Tryusing.loc[row_indexer,col_indexer]=valueinstead
Seethethecaveatsinthedocumentation:http://pandas.pydata.org/pandasdocs/stabl
e/indexing.html#indexingviewversuscopy
Out[11]:
DRGDefinition

Provider
State

AverageCovered
Charges

039EXTRACRANIALPROCEDURESW/O
CC/MCC

AL

32963.07

039EXTRACRANIALPROCEDURESW/O
CC/MCC

AL

15131.85

039EXTRACRANIALPROCEDURESW/O
CC/MCC

AL

37560.37

039EXTRACRANIALPROCEDURESW/O
CC/MCC

AL

13998.28

039EXTRACRANIALPROCEDURESW/O
CC/MCC

AL

31633.27

039EXTRACRANIALPROCEDURESW/O
CC/MCC

AL

16920.79

039EXTRACRANIALPROCEDURESW/O
CC/MCC

AL

11977.13

039EXTRACRANIALPROCEDURESW/O
CC/MCC

AL

35841.09

039EXTRACRANIALPROCEDURESW/O
CC/MCC

AL

28523.39

039EXTRACRANIALPROCEDURESW/O
CC/MCC

AL

75233.38

Ignorethewarning.Everythingisalright.Now,beforewegofurther,Iaminterestinginfindingoutwhatunique
values/nameswehaveforthemedicalprocedures.So,let'screatealistthatshowsonlyuniquevalues.Seebelow
In[13]: #convertingthevaluesfrom"DRGDefinition"intoalist.Butthiswillgiveussev
eralinstancesofthesamevalues
newlist=healthdata2["DRGDefinition"].tolist()
#retrievingtheuniquevaluesbyconvertingthepreviouslistintoaset
dataset=set(newlist)
#thenconvertingthesetbackintoalistagain,butwithuniquevaluesthistime,f
oreaseofoperation.Andsortingthelist
#thenshowingthelistwithitsuniquevalues
datalist=[aforaindataset]
datalist=sorted(datalist)
datalist
Out[13]: ['039EXTRACRANIALPROCEDURESW/OCC/MCC',
'057DEGENERATIVENERVOUSSYSTEMDISORDERSW/OMCC',
'064INTRACRANIALHEMORRHAGEORCEREBRALINFARCTIONWMCC',
'065INTRACRANIALHEMORRHAGEORCEREBRALINFARCTIONWCC',
'066INTRACRANIALHEMORRHAGEORCEREBRALINFARCTIONW/OCC/MCC',
'069TRANSIENTISCHEMIA',
'074CRANIAL&PERIPHERALNERVEDISORDERSW/OMCC',
'101SEIZURESW/OMCC',
'149DYSEQUILIBRIUM',
http://localhost:8888/nbconvert/html/healthcarecosts.ipynb?download=false

2/13

8/14/2015

healthcarecosts

'176PULMONARYEMBOLISMW/OMCC',
'177RESPIRATORYINFECTIONS&INFLAMMATIONSWMCC',
'178RESPIRATORYINFECTIONS&INFLAMMATIONSWCC',
'189PULMONARYEDEMA&RESPIRATORYFAILURE',
'190CHRONICOBSTRUCTIVEPULMONARYDISEASEWMCC',
'191CHRONICOBSTRUCTIVEPULMONARYDISEASEWCC',
'192CHRONICOBSTRUCTIVEPULMONARYDISEASEW/OCC/MCC',
'193SIMPLEPNEUMONIA&PLEURISYWMCC',
'194SIMPLEPNEUMONIA&PLEURISYWCC',
'195SIMPLEPNEUMONIA&PLEURISYW/OCC/MCC',
'202BRONCHITIS&ASTHMAWCC/MCC',
'203BRONCHITIS&ASTHMAW/OCC/MCC',
'207RESPIRATORYSYSTEMDIAGNOSISWVENTILATORSUPPORT96+HOURS',
'208RESPIRATORYSYSTEMDIAGNOSISWVENTILATORSUPPORT<96HOURS',
'238MAJORCARDIOVASCPROCEDURESW/OMCC',
'243PERMANENTCARDIACPACEMAKERIMPLANTWCC',
'244PERMANENTCARDIACPACEMAKERIMPLANTW/OCC/MCC',
'246PERCCARDIOVASCPROCWDRUGELUTINGSTENTWMCCOR4+VESSELS/STENTS',
'247PERCCARDIOVASCPROCWDRUGELUTINGSTENTW/OMCC',
'249PERCCARDIOVASCPROCWNONDRUGELUTINGSTENTW/OMCC',
'251PERCCARDIOVASCPROCW/OCORONARYARTERYSTENTW/OMCC',
'252OTHERVASCULARPROCEDURESWMCC',
'253OTHERVASCULARPROCEDURESWCC',
'254OTHERVASCULARPROCEDURESW/OCC/MCC',
'280ACUTEMYOCARDIALINFARCTION,DISCHARGEDALIVEWMCC',
'281ACUTEMYOCARDIALINFARCTION,DISCHARGEDALIVEWCC',
'282ACUTEMYOCARDIALINFARCTION,DISCHARGEDALIVEW/OCC/MCC',
'286CIRCULATORYDISORDERSEXCEPTAMI,WCARDCATHWMCC',
'287CIRCULATORYDISORDERSEXCEPTAMI,WCARDCATHW/OMCC',
'291HEARTFAILURE&SHOCKWMCC',
'292HEARTFAILURE&SHOCKWCC',
'293HEARTFAILURE&SHOCKW/OCC/MCC',
'300PERIPHERALVASCULARDISORDERSWCC',
'301PERIPHERALVASCULARDISORDERSW/OCC/MCC',
'303ATHEROSCLEROSISW/OMCC',
'305HYPERTENSIONW/OMCC',
'308CARDIACARRHYTHMIA&CONDUCTIONDISORDERSWMCC',
'309CARDIACARRHYTHMIA&CONDUCTIONDISORDERSWCC',
'310CARDIACARRHYTHMIA&CONDUCTIONDISORDERSW/OCC/MCC',
'312SYNCOPE&COLLAPSE',
'313CHESTPAIN',
'314OTHERCIRCULATORYSYSTEMDIAGNOSESWMCC',
'315OTHERCIRCULATORYSYSTEMDIAGNOSESWCC',
'329MAJORSMALL&LARGEBOWELPROCEDURESWMCC',
'330MAJORSMALL&LARGEBOWELPROCEDURESWCC',
'372MAJORGASTROINTESTINALDISORDERS&PERITONEALINFECTIONSWCC',
'377G.I.HEMORRHAGEWMCC',
'378G.I.HEMORRHAGEWCC',
'379G.I.HEMORRHAGEW/OCC/MCC',
'389G.I.OBSTRUCTIONWCC',
'390G.I.OBSTRUCTIONW/OCC/MCC',
'391ESOPHAGITIS,GASTROENT&MISCDIGESTDISORDERSWMCC',
'392ESOPHAGITIS,GASTROENT&MISCDIGESTDISORDERSW/OMCC',
'394OTHERDIGESTIVESYSTEMDIAGNOSESWCC',
'418LAPAROSCOPICCHOLECYSTECTOMYW/OC.D.E.WCC',
'419LAPAROSCOPICCHOLECYSTECTOMYW/OC.D.E.W/OCC/MCC',
'439DISORDERSOFPANCREASEXCEPTMALIGNANCYWCC',
'460SPINALFUSIONEXCEPTCERVICALW/OMCC',
'469MAJORJOINTREPLACEMENTORREATTACHMENTOFLOWEREXTREMITYWMCC',
'470MAJORJOINTREPLACEMENTORREATTACHMENTOFLOWEREXTREMITYW/OMCC',
'473CERVICALSPINALFUSIONW/OCC/MCC',
'480HIP&FEMURPROCEDURESEXCEPTMAJORJOINTWMCC',
'481HIP&FEMURPROCEDURESEXCEPTMAJORJOINTWCC',
'482HIP&FEMURPROCEDURESEXCEPTMAJORJOINTW/OCC/MCC',
'491BACK&NECKPROCEXCSPINALFUSIONW/OCC/MCC',
'536FRACTURESOFHIP&PELVISW/OMCC',
'552MEDICALBACKPROBLEMSW/OMCC',
'563FX,SPRN,STRN&DISLEXCEPTFEMUR,HIP,PELVIS&THIGHW/OMCC',
'602CELLULITISWMCC',
'603CELLULITISW/OMCC',
'638DIABETESWCC',
'640MISCDISORDERSOFNUTRITION,METABOLISM,FLUIDS/ELECTROLYTESWMCC',
'641MISCDISORDERSOFNUTRITION,METABOLISM,FLUIDS/ELECTROLYTESW/OMCC',
'682RENALFAILUREWMCC',
'683RENALFAILUREWCC',
'684RENALFAILUREW/OCC/MCC',
'689KIDNEY&URINARYTRACTINFECTIONSWMCC',
'690KIDNEY&URINARYTRACTINFECTIONSW/OMCC',
'698OTHERKIDNEY&URINARYTRACTDIAGNOSESWMCC',
'699OTHERKIDNEY&URINARYTRACTDIAGNOSESWCC',
'811REDBLOODCELLDISORDERSWMCC',
'812REDBLOODCELLDISORDERSW/OMCC',
'853INFECTIOUS&PARASITICDISEASESWO.R.PROCEDUREWMCC',
http://localhost:8888/nbconvert/html/healthcarecosts.ipynb?download=false

3/13

8/14/2015

healthcarecosts

'870SEPTICEMIAORSEVERESEPSISWMV96+HOURS',
'871SEPTICEMIAORSEVERESEPSISW/OMV96+HOURSWMCC',
'872SEPTICEMIAORSEVERESEPSISW/OMV96+HOURSW/OMCC',
'885PSYCHOSES',
'897ALCOHOL/DRUGABUSEORDEPENDENCEW/OREHABILITATIONTHERAPYW/OMCC',
'917POISONING&TOXICEFFECTSOFDRUGSWMCC',
'918POISONING&TOXICEFFECTSOFDRUGSW/OMCC',
'948SIGNS&SYMPTOMSW/OMCC']

Now,weseethelistwithuniquevalues.Wecanalsoaddtheindexinfrontofthevalues.Thiswillbeusefulincase
wewanttousesomevaluesfromthelistandwanttoknowwhatvalueweareusing.
In[15]: #showingindexinfrontofthevalue
indexlist=[indexforindex,sinenumerate(datalist)]
withindex=[str(indexlist[i])+""+str(datalist[i])foriinrange(100)]
withindex
Out[15]: ['0039EXTRACRANIALPROCEDURESW/OCC/MCC',
'1057DEGENERATIVENERVOUSSYSTEMDISORDERSW/OMCC',
'2064INTRACRANIALHEMORRHAGEORCEREBRALINFARCTIONWMCC',
'3065INTRACRANIALHEMORRHAGEORCEREBRALINFARCTIONWCC',
'4066INTRACRANIALHEMORRHAGEORCEREBRALINFARCTIONW/OCC/MCC',
'5069TRANSIENTISCHEMIA',
'6074CRANIAL&PERIPHERALNERVEDISORDERSW/OMCC',
'7101SEIZURESW/OMCC',
'8149DYSEQUILIBRIUM',
'9176PULMONARYEMBOLISMW/OMCC',
'10177RESPIRATORYINFECTIONS&INFLAMMATIONSWMCC',
'11178RESPIRATORYINFECTIONS&INFLAMMATIONSWCC',
'12189PULMONARYEDEMA&RESPIRATORYFAILURE',
'13190CHRONICOBSTRUCTIVEPULMONARYDISEASEWMCC',
'14191CHRONICOBSTRUCTIVEPULMONARYDISEASEWCC',
'15192CHRONICOBSTRUCTIVEPULMONARYDISEASEW/OCC/MCC',
'16193SIMPLEPNEUMONIA&PLEURISYWMCC',
'17194SIMPLEPNEUMONIA&PLEURISYWCC',
'18195SIMPLEPNEUMONIA&PLEURISYW/OCC/MCC',
'19202BRONCHITIS&ASTHMAWCC/MCC',
'20203BRONCHITIS&ASTHMAW/OCC/MCC',
'21207RESPIRATORYSYSTEMDIAGNOSISWVENTILATORSUPPORT96+HOURS',
'22208RESPIRATORYSYSTEMDIAGNOSISWVENTILATORSUPPORT<96HOURS',
'23238MAJORCARDIOVASCPROCEDURESW/OMCC',
'24243PERMANENTCARDIACPACEMAKERIMPLANTWCC',
'25244PERMANENTCARDIACPACEMAKERIMPLANTW/OCC/MCC',
'26246PERCCARDIOVASCPROCWDRUGELUTINGSTENTWMCCOR4+VESSELS/STENTS',
'27247PERCCARDIOVASCPROCWDRUGELUTINGSTENTW/OMCC',
'28249PERCCARDIOVASCPROCWNONDRUGELUTINGSTENTW/OMCC',
'29251PERCCARDIOVASCPROCW/OCORONARYARTERYSTENTW/OMCC',
'30252OTHERVASCULARPROCEDURESWMCC',
'31253OTHERVASCULARPROCEDURESWCC',
'32254OTHERVASCULARPROCEDURESW/OCC/MCC',
'33280ACUTEMYOCARDIALINFARCTION,DISCHARGEDALIVEWMCC',
'34281ACUTEMYOCARDIALINFARCTION,DISCHARGEDALIVEWCC',
'35282ACUTEMYOCARDIALINFARCTION,DISCHARGEDALIVEW/OCC/MCC',
'36286CIRCULATORYDISORDERSEXCEPTAMI,WCARDCATHWMCC',
'37287CIRCULATORYDISORDERSEXCEPTAMI,WCARDCATHW/OMCC',
'38291HEARTFAILURE&SHOCKWMCC',
'39292HEARTFAILURE&SHOCKWCC',
'40293HEARTFAILURE&SHOCKW/OCC/MCC',
'41300PERIPHERALVASCULARDISORDERSWCC',
'42301PERIPHERALVASCULARDISORDERSW/OCC/MCC',
'43303ATHEROSCLEROSISW/OMCC',
'44305HYPERTENSIONW/OMCC',
'45308CARDIACARRHYTHMIA&CONDUCTIONDISORDERSWMCC',
'46309CARDIACARRHYTHMIA&CONDUCTIONDISORDERSWCC',
'47310CARDIACARRHYTHMIA&CONDUCTIONDISORDERSW/OCC/MCC',
'48312SYNCOPE&COLLAPSE',
'49313CHESTPAIN',
'50314OTHERCIRCULATORYSYSTEMDIAGNOSESWMCC',
'51315OTHERCIRCULATORYSYSTEMDIAGNOSESWCC',
'52329MAJORSMALL&LARGEBOWELPROCEDURESWMCC',
'53330MAJORSMALL&LARGEBOWELPROCEDURESWCC',
'54372MAJORGASTROINTESTINALDISORDERS&PERITONEALINFECTIONSWCC',
'55377G.I.HEMORRHAGEWMCC',
'56378G.I.HEMORRHAGEWCC',
'57379G.I.HEMORRHAGEW/OCC/MCC',
'58389G.I.OBSTRUCTIONWCC',
'59390G.I.OBSTRUCTIONW/OCC/MCC',
'60391ESOPHAGITIS,GASTROENT&MISCDIGESTDISORDERSWMCC',
'61392ESOPHAGITIS,GASTROENT&MISCDIGESTDISORDERSW/OMCC',
'62394OTHERDIGESTIVESYSTEMDIAGNOSESWCC',
http://localhost:8888/nbconvert/html/healthcarecosts.ipynb?download=false

4/13

8/14/2015

healthcarecosts

'63418LAPAROSCOPICCHOLECYSTECTOMYW/OC.D.E.WCC',
'64419LAPAROSCOPICCHOLECYSTECTOMYW/OC.D.E.W/OCC/MCC',
'65439DISORDERSOFPANCREASEXCEPTMALIGNANCYWCC',
'66460SPINALFUSIONEXCEPTCERVICALW/OMCC',
'67469MAJORJOINTREPLACEMENTORREATTACHMENTOFLOWEREXTREMITYWMCC',
'68470MAJORJOINTREPLACEMENTORREATTACHMENTOFLOWEREXTREMITYW/OMCC',
'69473CERVICALSPINALFUSIONW/OCC/MCC',
'70480HIP&FEMURPROCEDURESEXCEPTMAJORJOINTWMCC',
'71481HIP&FEMURPROCEDURESEXCEPTMAJORJOINTWCC',
'72482HIP&FEMURPROCEDURESEXCEPTMAJORJOINTW/OCC/MCC',
'73491BACK&NECKPROCEXCSPINALFUSIONW/OCC/MCC',
'74536FRACTURESOFHIP&PELVISW/OMCC',
'75552MEDICALBACKPROBLEMSW/OMCC',
'76563FX,SPRN,STRN&DISLEXCEPTFEMUR,HIP,PELVIS&THIGHW/OMCC',
'77602CELLULITISWMCC',
'78603CELLULITISW/OMCC',
'79638DIABETESWCC',
'80640MISCDISORDERSOFNUTRITION,METABOLISM,FLUIDS/ELECTROLYTESWMCC',
'81641MISCDISORDERSOFNUTRITION,METABOLISM,FLUIDS/ELECTROLYTESW/OMCC',
'82682RENALFAILUREWMCC',
'83683RENALFAILUREWCC',
'84684RENALFAILUREW/OCC/MCC',
'85689KIDNEY&URINARYTRACTINFECTIONSWMCC',
'86690KIDNEY&URINARYTRACTINFECTIONSW/OMCC',
'87698OTHERKIDNEY&URINARYTRACTDIAGNOSESWMCC',
'88699OTHERKIDNEY&URINARYTRACTDIAGNOSESWCC',
'89811REDBLOODCELLDISORDERSWMCC',
'90812REDBLOODCELLDISORDERSW/OMCC',
'91853INFECTIOUS&PARASITICDISEASESWO.R.PROCEDUREWMCC',
'92870SEPTICEMIAORSEVERESEPSISWMV96+HOURS',
'93871SEPTICEMIAORSEVERESEPSISW/OMV96+HOURSWMCC',
'94872SEPTICEMIAORSEVERESEPSISW/OMV96+HOURSW/OMCC',
'95885PSYCHOSES',
'96897ALCOHOL/DRUGABUSEORDEPENDENCEW/OREHABILITATIONTHERAPYW/OMCC',
'97917POISONING&TOXICEFFECTSOFDRUGSWMCC',
'98918POISONING&TOXICEFFECTSOFDRUGSW/OMCC',
'99948SIGNS&SYMPTOMSW/OMCC']

Oknow,weareallsetforsomestatistics.HowaboutwetaketheaveragecostofhealthcareproceduresintheUSA
In[16]: healthdata3=healthdata2.groupby("DRGDefinition").aggregate(np.mean)
healthdata3[:25]

http://localhost:8888/nbconvert/html/healthcarecosts.ipynb?download=false

5/13

8/14/2015

healthcarecosts

Out[16]:

AverageCovered
Charges
DRGDefinition
039EXTRACRANIALPROCEDURESW/OCC/MCC

30481.984921

057DEGENERATIVENERVOUSSYSTEMDISORDERSW/OMCC

24307.758568

064INTRACRANIALHEMORRHAGEORCEREBRALINFARCTIONW
MCC

48552.609210

065INTRACRANIALHEMORRHAGEORCEREBRALINFARCTIONW
CC

29492.120670

066INTRACRANIALHEMORRHAGEORCEREBRALINFARCTION
W/OCC/MCC

23592.924701

069TRANSIENTISCHEMIA

21195.604506

074CRANIAL&PERIPHERALNERVEDISORDERSW/OMCC

25147.270306

101SEIZURESW/OMCC

21513.097106

149DYSEQUILIBRIUM

18870.966569

176PULMONARYEMBOLISMW/OMCC

26738.109950

177RESPIRATORYINFECTIONS&INFLAMMATIONSWMCC

50648.951705

178RESPIRATORYINFECTIONS&INFLAMMATIONSWCC

35060.272038

189PULMONARYEDEMA&RESPIRATORYFAILURE

34276.191312

190CHRONICOBSTRUCTIVEPULMONARYDISEASEWMCC

29140.944246

191CHRONICOBSTRUCTIVEPULMONARYDISEASEWCC

23397.796103

192CHRONICOBSTRUCTIVEPULMONARYDISEASEW/OCC/MCC

17732.648424

193SIMPLEPNEUMONIA&PLEURISYWMCC

37408.917588

194SIMPLEPNEUMONIA&PLEURISYWCC

24549.381211

195SIMPLEPNEUMONIA&PLEURISYW/OCC/MCC

17886.490776

202BRONCHITIS&ASTHMAWCC/MCC

23201.770845

203BRONCHITIS&ASTHMAW/OCC/MCC

16885.022338

207RESPIRATORYSYSTEMDIAGNOSISWVENTILATORSUPPORT
96+HOURS

143428.051066

208RESPIRATORYSYSTEMDIAGNOSISWVENTILATORSUPPORT
<96HOURS

62394.721315

238MAJORCARDIOVASCPROCEDURESW/OMCC

85010.449213

243PERMANENTCARDIACPACEMAKERIMPLANTWCC

67865.711226

Thefirst25averagecostsinournewdata.WecanseethattransientischemiatreatmentcostsonaverageUSD
21195.MajorcardiovascularprocedurescostonaverageUSD85010.Let'splotthisdata(forall100different
procedures)

http://localhost:8888/nbconvert/html/healthcarecosts.ipynb?download=false

6/13

8/14/2015

healthcarecosts

In[17]: #plottingaveragecostperprocedure
%pylabinline
figsize(15,5)
healthdata3.plot(kind="bar")
Populatingtheinteractivenamespacefromnumpyandmatplotlib
Out[17]: <matplotlib.axes._subplots.AxesSubplotat0x125707f0>

Isn'tthatplotbeautiful?Well,Iagreethatitfeelsalittlecrowded.So,Iamgoingtoplotit25dataatatime.Butfirst,I
willsortthedatabyincreasingorder.Iwillshowfirstthe5procedureswiththelowestaveragecost
In[21]: #sortingtheaveragecostandshowingthe5procedureswiththelowestaveragecost
healthdata4=healthdata3.sort(["AverageCoveredCharges"])
healthdata4[:5]
Out[21]:

AverageCovered
Charges
DRGDefinition
310CARDIACARRHYTHMIA&CONDUCTIONDISORDERSW/O
CC/MCC

14933.904053

303ATHEROSCLEROSISW/OMCC

16640.698775

641MISCDISORDERSOF
NUTRITION,METABOLISM,FLUIDS/ELECTROLYTESW/OMCC

16703.611059

293HEARTFAILURE&SHOCKW/OCC/MCC

16864.201321

203BRONCHITIS&ASTHMAW/OCC/MCC

16885.022338

CardiacArrhythmiaandconductiondisordershavethelowestaveragecostoftreatmentwithaboutUSD14934.And
nowthe5procedureswithhighestaveragecosts.

http://localhost:8888/nbconvert/html/healthcarecosts.ipynb?download=false

7/13

8/14/2015

healthcarecosts

In[30]: #showingthe5procedureswiththehighestaveragecosts
healthdata4[5:]
Out[30]:

AverageCovered
Charges
DRGDefinition
246PERCCARDIOVASCPROCWDRUGELUTINGSTENTWMCCOR
4+VESSELS/STENTS

96348.806707

329MAJORSMALL&LARGEBOWELPROCEDURESWMCC

135330.939966

853INFECTIOUS&PARASITICDISEASESWO.R.PROCEDUREWMCC

139186.350937

207RESPIRATORYSYSTEMDIAGNOSISWVENTILATORSUPPORT
96+HOURS

143428.051066

870SEPTICEMIAORSEVERESEPSISWMV96+HOURS

163227.331725

Septicemia/severesepsishasthehighestaveragecostwithaboutUSD163227.
Now,let'splottheaveragecostsagain,butbyslocesof25thistimeandinincreasingorder.
In[31]: #plottingsortedaveragecostsbyslicesof25
figsize(15,5)
healthdata4[0:25].plot(kind="bar")
Out[31]: <matplotlib.axes._subplots.AxesSubplotat0x1a147e10>

http://localhost:8888/nbconvert/html/healthcarecosts.ipynb?download=false

8/13

8/14/2015

healthcarecosts

In[25]: healthdata4[25:50].plot(kind="bar")
Out[25]: <matplotlib.axes._subplots.AxesSubplotat0x19116828>

In[26]: healthdata4[50:75].plot(kind="bar")
Out[26]: <matplotlib.axes._subplots.AxesSubplotat0x19122c88>

http://localhost:8888/nbconvert/html/healthcarecosts.ipynb?download=false

9/13

8/14/2015

healthcarecosts

In[27]: healthdata4[75:].plot(kind="bar")
Out[27]: <matplotlib.axes._subplots.AxesSubplotat0x146dff98>

Thisshouldbebettertoread.Andifweweretoplotthetotalsorteddatanow,wewouldseethis.
In[29]: #plottingwholesortedaveragecostdata
healthdata4.plot(kind="bar")
Out[29]: <matplotlib.axes._subplots.AxesSubplotat0x14d7ceb8>

http://localhost:8888/nbconvert/html/healthcarecosts.ipynb?download=false

10/13

8/14/2015

healthcarecosts

Now,howabouttheaveragecostofcertainproceduresbystate.Let'splottheaveragecostofsepsis/septicemiaby
state.
In[54]: #separatingsepsis/septicemiadata
figsize(15,5)
datavalue=healthdata2["DRGDefinition"]==datalist[92]#datalist[92]isthestring
valueforsepsis/septicemia
#groupingbystateandtakingthemean/averageofthosevalues.sorting.showingfir
st5values
healthdata5=healthdata2[datavalue].groupby("ProviderState").aggregate(np.mean)
healthdata6=healthdata5.sort(["AverageCoveredCharges"])
healthdata6[:5]
Out[54]:

AverageCoveredCharges
ProviderState
MD

51667.938636

VT

80805.380000

WV

91167.078000

ND

93062.310000

MA

98737.674545

In[47]: #showingthelast5values
healthdata6[5:]
Out[47]:

AverageCoveredCharges
ProviderState
CO

192335.505000

KS

197442.948000

NV

241243.599167

NJ

247313.996667

CA

277004.642419

In[45]: #plottingaveragecostofsepsis/septicemiabystate
healthdata5.plot(kind="bar")
Out[45]: <matplotlib.axes._subplots.AxesSubplotat0x1adb0cc0>

Fromthisplot,weseethatMarylandhasthelowesttreatmentcostforSepsis/septicemiaatUSD51668and
CaliforniathehighestatUSD277005.Now,let'splottheaveragecostoftransientischemiatreatmentperstate.

http://localhost:8888/nbconvert/html/healthcarecosts.ipynb?download=false

11/13

8/14/2015

healthcarecosts

In[55]: #retrievingdatafortransientischemiacost,groupingbystate,finingaverageand
plotting
datavalue=healthdata2["DRGDefinition"]==datalist[5]#datalist[5]isthestringv
aluefortransientischemia
healthdata5=healthdata2[datavalue].groupby("ProviderState").aggregate(np.mean)
healthdata5.plot(kind="bar")
Out[55]: <matplotlib.axes._subplots.AxesSubplotat0x1f005b38>

Marylandstillhasthelowestcost.Now,let'splottheaveragecostofmajorgastrointestinaldisordersandperitoneal
infectionsperstate
In[56]: #retrievingdataformajorgastrointestinaldisordersandperitonealinfectionscos
t,groupingbystate,finingaverage
#andplotting
datavalue=healthdata2["DRGDefinition"]==datalist[54]#datalist[54]isthestring
valueforgastroinstestinal...
healthdata5=healthdata2[datavalue].groupby("ProviderState").aggregate(np.mean)
healthdata5.plot(kind="bar")
Out[56]: <matplotlib.axes._subplots.AxesSubplotat0x1f51ca20>

Marylandhasthelowestcostagain.Let'splotaveragecostformedicalbackproblemsperstate

http://localhost:8888/nbconvert/html/healthcarecosts.ipynb?download=false

12/13

8/14/2015

healthcarecosts

In[57]: #retrievingdataformedicalbackproblemscost,groupingbystate,finingaverage
andplotting
datavalue=healthdata2["DRGDefinition"]==datalist[75]#datalist[75]isthestring
valueformedicalbackproblems
healthdata5=healthdata2[datavalue].groupby("ProviderState").aggregate(np.mean)
healthdata5.plot(kind="bar")
Out[57]: <matplotlib.axes._subplots.AxesSubplotat0x1fad8f28>

Andagain,Marylandhasthelowestcost.Iamnotgoingtoplottheaveragecostperstateforall100healthcare
procedureshere.ButIhavedoneitalreadyandeverytime,Marylandhasthelowestaveragecost.AndNorthDakota
hasthesecondlowestaveragehealthcarecosts.The3stateswiththehighesthealthcarecostsaccordingtothisdata
areCalifornia,NevadaandNewJersey.Notnecessarilyinthatorder.
IwascuriousregardingMarylandhavingthelowesthealthcarecostsandIfoundthisarticlesheddingsomelighton
this:http://www.beckershospitalreview.com/finance/hasmarylandfoundasolutiontotheushealthcarecost
crisis.html(http://www.beckershospitalreview.com/finance/hasmarylandfoundasolutiontotheushealthcarecost
crisis.html)
Thankstodata.govforthedata.
RicardyRicot08/14/2015

http://localhost:8888/nbconvert/html/healthcarecosts.ipynb?download=false

13/13

Você também pode gostar