Você está na página 1de 11

MOLD START

FIRST_NAME
1442563 non-null object
LAST_NAME
1442563 non-null object
SR_CREATED
1442563 non-null object
SR_ID
1442563 non-null object
SR_NUM
1442563 non-null object
SR_TYPE
1442563 non-null object
SR_SUB_TYPE
1442563 non-null object
SR_STATUS
1442563 non-null object
CONTACT_ID
1442563 non-null object
UNIT_ID
1417999 non-null object
WORK_ORDER_NUM
1442563 non-null int64
LOCATION
1442563 non-null object
LOCATION_ID
1442563 non-null object
DESCRIPTION
1442563 non-null object
SCHEDULE_DATE
655311 non-null object
RESP_SCHEDULER
1437042 non-null object
PRIORITY
1442563 non-null int64
STATUS
1442563 non-null object
dtypes: int64(2), object(16)
memory usage: 198.1+ MB
In [88]: Location = r'/Applications/nychatest/sr_withandwithout_workorder.csv'
In [89]: df = pd.read_csv(Location)
In [90]: df.info()
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 1483617 entries, 0 to 1483616
Data columns (total 18 columns):
FIRST_NAME
1483617 non-null object
LAST_NAME
1483617 non-null object
SR_CREATED
1483617 non-null object
SR_ID
1483617 non-null object
SR_NUM
1483617 non-null object
SR_TYPE
1483617 non-null object
SR_SUB_TYPE
1483617 non-null object
SR_STATUS
1483617 non-null object
CONTACT_ID
1483617 non-null object
UNIT_ID
1457428 non-null object
WORK_ORDER_NUM
1442584 non-null float64
LOCATION
1483617 non-null object
LOCATION_ID
1483617 non-null object
DESCRIPTION
1483617 non-null object
SCHEDULE_DATE
655321 non-null object
RESP_SCHEDULER
1437063 non-null object
PRIORITY
1457970 non-null float64
STATUS
1483617 non-null object
dtypes: float64(2), object(16)
memory usage: 203.7+ MB
In [91]: sorted(df['DESCRIPTION'].unique())
Out[91]:
['BBQ Area - Flooding',
'Basketball Courts - Flooding',
'Bathtub Shower - Faucet Leaking OOO',
'Bathtub Shower - Leak Under',
'Boiler Condensate Tank - Belly Leaking',
'Boiler Condensate Tank - Front Leaking',
'Boiler Condensate Tank - Rear Leaking',
4

'Boiler Hydro Test Leaks - Stay Bolts Leaking',


'Boiler Steam Stop - Passing Leaking Flange',
'Canopy - Flooding',
'Catch Basin - Flooding',
'Ceiling - Ceiling Leak From Above',
'Ceiling - Lead Paint',
'Ceiling - Mildew Mold',
'Compactor - Compactor Leaking',
'Compactor - Fitting Oil Leak',
'Compactor - Hose Oil Leak',
'Dish Dryer - Flooding',
'Dishwasher - Flooding',
'Drains - Flooding',
'Elevator - Flooding',
'Elevator Out Of Order - Flooding',
'Elevator Running With Problem - Flooding',
'Excessive Moisture - Excessive Moisture',
'Exterior Compactor - Compactor Leaking',
'Exterior Compactor - Hose Oil Leak',
'Exterior Compactor - Machine Oil Leak',
'Fire Hydrant - Hydrant Leaking',
'Fire Sprinklers - Leaking',
'Heat - Radiator Leaking',
'Heat - Steam Leaking',
'Hot Water Generator - Steam Leaking',
'Kitchen Cabinet - Leak Under',
'Lawn Sprinklers - Leaking',
'Leak From Above - ',
'Leak From Above - Above The Sink',
'Leak From Above - Constant Dripping',
'Leak From Above - Constant Leaking',
'Leak From Above - Flooding',
'Leak From Above - Leadbend Leaking',
'Leak From Above - Leak Over Bathtub',
'Leak From Above - Needs Asbestos Testing',
'Leak From Above - Needs Lead Testing',
'Leak From Above - Over Toilet',
'Leak From Above - Over Window',
'Leak From Above - Shower Leaking',
'Leak From Above - Water Penetration',
'Mildew - ',
'Mildew Condition - ',
'Mildew Condition - Mildew',
'Mildew Condition - Needs Cleaning',
'Mildew Condition - Needs Painting',
'Mildew Condition - Paint After Repair',
'Mildew Condition - Vent OOO',
'Paint - Peeling',
'Pipes - Pipes Leaking',
'Pipes - Wall Leak',
'Pump - Pump Leaking',
'Ramp - Flooding',
'Refrigerator - Refrigerator Leaking',
'Roof Tank - Flooding',
'Roof Tank - Leaking',
'Rooftop - Flooding',
'Sill Sock - Sill Cock Leaking',
'Sink - Leak Under',
'Sink - Speedy Connect Leaking or DML',
5

'Sink - Stoppage With Flooding',


'Stand Pipe - Stand Pipe Leaking',
'Steam Distribution - Condensate Line Leaking',
'Steam Distribution - Steam Main Leak',
'Terrace - Flooding',
'TnkRm Circulator Pump - Pump Leaking',
'TnkRm Heel Trap - Heel Trap Leaking',
'TnkRm Hot Water Circulate Pmp - Hot Water Leaking',
'TnkRm Hot Water Temp Regulator - Hot Water Temp Regulator Leak',
'TnkRm Hot Water Trap - Hot Water Trap Leaking',
'TnkRm Low Vacuum Switch - Vacuum Switch Leaking',
'Toilet - Bowl Leaking',
'Toilet - Tank Leaking',
'Urinal - Urinal Leaking',
'Walls - Lead Paint',
'Walls - Mildew Mold',
'Walls - Wall Leak',
'Water Fountain - Leaking']
In [92]: walls = 'Walls - Mildew Mold'
In [93]: ceiling = 'Ceiling - Mildew Mold'
In [94]: df['SR_CREATED'] = pd.to_datetime(df['SR_CREATED'])
In [95]: maskmold = df['DESCRIPTION'].isin([walls, ceiling])
In [96]: df = df[maskmold]
In [97]: df.info()
<class 'pandas.core.frame.DataFrame'>
Int64Index: 6326 entries, 17 to 1482709
Data columns (total 18 columns):
FIRST_NAME
6326 non-null object
LAST_NAME
6326 non-null object
SR_CREATED
6326 non-null datetime64[ns]
SR_ID
6326 non-null object
SR_NUM
6326 non-null object
SR_TYPE
6326 non-null object
SR_SUB_TYPE
6326 non-null object
SR_STATUS
6326 non-null object
CONTACT_ID
6326 non-null object
UNIT_ID
6320 non-null object
WORK_ORDER_NUM
6295 non-null float64
LOCATION
6326 non-null object
LOCATION_ID
6326 non-null object
DESCRIPTION
6326 non-null object
SCHEDULE_DATE
2692 non-null object
RESP_SCHEDULER
6292 non-null object
PRIORITY
6303 non-null float64
STATUS
6326 non-null object
dtypes: datetime64[ns](1), float64(2), object(15)
memory usage: 939.0+ KB
In [98]: df.head()
Out[98]:
FIRST_NAME LAST_NAME
SR_CREATED
17
Yolanda
Ruiz 2009-12-31 01:23:15
26
Gary
Sims 2009-12-31 01:07:48

SR_ID
1-1G62RA
1-1G60Y0

SR_NUM
1-87623542
1-87621192

SR_TYPE
Complaint
Complaint

36
42
47

ANTHONY
Maria
Maria

VELAZQUEZ 2009-12-31 01:58:14


Cardenas 2009-12-31 01:47:46
Cardenas 2009-12-31 01:47:45

17
26
36
42
47

SR_SUB_TYPE SR_STATUS
Maintenance Request
Closed
Maintenance Request
Closed
Maintenance Request
Closed
Maintenance Request
Closed
Maintenance Request
Closed

17
26
36
42
47

LOCATION
Bathroom
Kitchen
Foyer/Hallway
Bedroom
Living Room

17
26
36
42
47

In [99]: df.tail()
Out[99]:
FIRST_NAME
1482284
RAMON
1482353 Elizabeth
1482514
Carrie
1482540
Aquilina
1482709
Yvette

LAST_NAME
BENITEZ
Hogan
Birchmore
Espada
Walker

1482284
1482353
1482514
1482540
1482709

SR_TYPE
Complaint
Complaint
Complaint
Complaint
Complaint

1482284
1482353
1482514
1482540
1482709

WORK_ORDER_NUM
10115126.0
11256522.0
16106337.0
16620828.0
20463027.0

1482284
1482353
1482514
1482540
1482709

Walls
Ceiling
Ceiling
Ceiling
Ceiling

CONTACT_ID
1-89G-70
1-33D-3739
1-P0-4547
1-P0-195
1-P0-195

LOCATION_ID
024.10.016.F04.04C.BTH01
056.25.078.F01.01B.KIT01
114.05.009.F05.05G.FH01
114.06.011.F08.08F.BD03
114.06.011.F08.08F.LIV01

SCHEDULE_DATE RESP_SCHEDULER
NaN
SIEBEL
NaN
SIEBEL
NaN
SIEBEL
NaN
SIEBEL
NaN
SIEBEL

1-1G6BRG
1-1G69J6
1-1G69IC

PRIORITY
7.0
7.0
7.0
7.0
7.0

UNIT_ID
1-KSI-5
1-3VP-733
1-DE-2521
1-DH-545
1-DH-545

Walls
Walls
Walls
Walls
Walls

Complaint
Complaint
Complaint

WORK_ORDER_NUM
7920827.0
7919679.0
7923326.0
7922533.0
7922535.0

DESCRIPTION
Mildew Mold
Mildew Mold
Mildew Mold
Mildew Mold
Mildew Mold

STATUS
Closed
Closed
Closed
Closed
Closed

SR_CREATED
2010-03-02 20:38:12
2010-06-09 22:20:34
2011-01-08 01:20:31
2011-02-24 23:22:40
2011-10-26 01:32:00

SR_SUB_TYPE SR_STATUS
Maintenance Request
Closed
Maintenance Request
Closed
Maintenance Request
Closed
Maintenance Request
Closed
Maintenance Request
Closed
LOCATION
Bathroom
Bathroom
Bathroom
Bathroom
Bathroom

1-87635212
1-87632322
1-87632292

SR_ID
1-1VOCJ7
1-2JABVO
1-3YAPV4
1-4VW6CC
1-BKAH8Q

SR_NUM
1-113670259
1-153327012
1-239005552
1-295434012
1-699209162

CONTACT_ID
1-10V89I
1-P0-1200
1-33D-2025
1-33Y-3310
1-34M-1347

LOCATION_ID
024.12.019.F01.01E.BTH01
164.01.015.F02.02K.BTH01
167.01.037.F15.15B.BTH01
010.10.019.F02.02F.BTH01
056.11.034.F03.03D.BTH01

DESCRIPTION SCHEDULE_DATE RESP_SCHEDULER


Mildew Mold
NaN
SIEBEL
Mildew Mold
07/01/2010
SIEBEL
Mildew Mold
NaN
SIEBEL
Mildew Mold
02/01/2012
SIEBEL
Mildew Mold
11/03/2011
SIEBEL

UNIT_ID
1-K3R-6
1-DK-1038
1-3VA-1368
1-3V4-742
1-3VJ-1353

PRIORITY
7.0
3.0
3.0
3.0
3.0

STATUS
Closed
Closed
Closed
Closed
Closed

In [100]: df.to_excel('maskmoldoutput.xlsx', index=False)


In [101]: df.info()
<class 'pandas.core.frame.DataFrame'>
7

Int64Index: 6326 entries, 17 to 1482709


Data columns (total 18 columns):
FIRST_NAME
6326 non-null object
LAST_NAME
6326 non-null object
SR_CREATED
6326 non-null datetime64[ns]
SR_ID
6326 non-null object
SR_NUM
6326 non-null object
SR_TYPE
6326 non-null object
SR_SUB_TYPE
6326 non-null object
SR_STATUS
6326 non-null object
CONTACT_ID
6326 non-null object
UNIT_ID
6320 non-null object
WORK_ORDER_NUM
6295 non-null float64
LOCATION
6326 non-null object
LOCATION_ID
6326 non-null object
DESCRIPTION
6326 non-null object
SCHEDULE_DATE
2692 non-null object
RESP_SCHEDULER
6292 non-null object
PRIORITY
6303 non-null float64
STATUS
6326 non-null object
dtypes: datetime64[ns](1), float64(2), object(15)
memory usage: 939.0+ KB
In [102]: df_moldfirstyr = df[(df.SR_CREATED >= '01/01/2009 00:00:00') &
(df.SR_CREATED <= '12/31/2009 23:59:59')]
In [103]: df_moldsecondyr = df[(df.SR_CREATED >= '01/01/2010 00:00:00') &
(df.SR_CREATED <= '12/31/2010 23:59:59')]
In [104]: df_moldthirdyr = df[(df.SR_CREATED >= '01/01/2011 00:00:00') &
(df.SR_CREATED <= '12/31/2011 23:59:59')]
In [105]: df_moldfourthyr = df[(df.SR_CREATED >= '01/01/2012 00:00:00') &
(df.SR_CREATED <= '12/31/2012 23:59:59')]
In [106]: df_moldfifthyr = df[(df.SR_CREATED >= '01/01/2013 00:00:00') &
(df.SR_CREATED <= '12/31/2013 23:59:59')]
In [107]: df_moldsixthyr = df[(df.SR_CREATED >= '01/01/2014 00:00:00') &
(df.SR_CREATED <= '12/31/2014 23:59:59')]
In [108]: df_moldseventhyr = df[(df.SR_CREATED >= '01/01/2015 00:00:00') &
(df.SR_CREATED <= '12/31/2015 23:59:59')]
In [109]: len(df_moldfirstyr['SR_CREATED'])
Out[109]: 9
In [110]: len(df_moldsecondyr['SR_CREATED'])
Out[110]: 2786
In [111]: len(df_moldthirdyr['SR_CREATED'])
Out[111]: 3316
In [112]: len(df_moldfourthyr['SR_CREATED'])
Out[112]: 204
In [113]: len(df_moldfifthyr['SR_CREATED'])
Out[113]: 2

In [114]: len(df_moldsixthyr['SR_CREATED'])
Out[114]: 7
In [115]: len(df_moldseventhyr['SR_CREATED'])
Out[115]: 2
In [116]: df.info()
<class 'pandas.core.frame.DataFrame'>
Int64Index: 6326 entries, 17 to 1482709
Data columns (total 18 columns):
FIRST_NAME
6326 non-null object
LAST_NAME
6326 non-null object
SR_CREATED
6326 non-null datetime64[ns]
SR_ID
6326 non-null object
SR_NUM
6326 non-null object
SR_TYPE
6326 non-null object
SR_SUB_TYPE
6326 non-null object
SR_STATUS
6326 non-null object
CONTACT_ID
6326 non-null object
UNIT_ID
6320 non-null object
WORK_ORDER_NUM
6295 non-null float64
LOCATION
6326 non-null object
LOCATION_ID
6326 non-null object
DESCRIPTION
6326 non-null object
SCHEDULE_DATE
2692 non-null object
RESP_SCHEDULER
6292 non-null object
PRIORITY
6303 non-null float64
STATUS
6326 non-null object
dtypes: datetime64[ns](1), float64(2), object(15)
memory usage: 939.0+ KB
In [117]: len(df['UNIT_ID'].unique())
Out[117]: 5214
In [118]:

MILDEW START

In [222]: df_NYDN.to_excel('NYDNleadtimeframeoutput.xlsx', index=False)


In [223]: Location = r'/Applications/nychatest/sr_with_workorder_only.csv.csv'
In [224]: Location = r'/Applications/nychatest/sr_with_workorder_only.csv'
In [225]: df = pd.read_csv(Location)
In [226]: df.info()
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 1442563 entries, 0 to 1442562
Data columns (total 18 columns):
FIRST_NAME
1442563 non-null object
LAST_NAME
1442563 non-null object
SR_CREATED
1442563 non-null object
SR_ID
1442563 non-null object
SR_NUM
1442563 non-null object
SR_TYPE
1442563 non-null object
SR_SUB_TYPE
1442563 non-null object
SR_STATUS
1442563 non-null object
CONTACT_ID
1442563 non-null object
UNIT_ID
1417999 non-null object
WORK_ORDER_NUM
1442563 non-null int64
LOCATION
1442563 non-null object
LOCATION_ID
1442563 non-null object
DESCRIPTION
1442563 non-null object
SCHEDULE_DATE
655311 non-null object
RESP_SCHEDULER
1437042 non-null object
PRIORITY
1442563 non-null int64
STATUS
1442563 non-null object
dtypes: int64(2), object(16)
memory usage: 198.1+ MB
In [227]: Location = r'/Applications/nychatest/sr_withandwithout_workorder.csv'
In [228]: df = pd.read_csv(Location)
In [229]: df['SR_CREATED'] = pd.to_datetime(df['SR_CREATED'])
In [230]: pattern = 'Mildew'
In [231]: maskmildew = df['DESCRIPTION'].str.contains(pattern)
In [232]: df = df[maskmildew]
In [233]: df['DESCRIPTION'].unique()
Out[233]:
array(['Walls - Mildew Mold', 'Mildew Condition - Mildew',
'Mildew Condition - Vent OOO', 'Mildew Condition - Needs Painting',
'Mildew Condition - Needs Cleaning', 'Ceiling - Mildew Mold',
'Mildew Condition - ', 'Mildew - ',
'Mildew Condition - Paint After Repair'], dtype=object)
In [234]: pattern = 'Mildew Condition'
In [235]: maskmildew = df['DESCRIPTION'].str.contains(pattern)
In [236]: df = df[maskmildew]
5

In [237]: df['DESCRIPTION'].unique()
Out[237]:
array(['Mildew Condition - Mildew', 'Mildew Condition - Vent OOO',
'Mildew Condition - Needs Painting',
'Mildew Condition - Needs Cleaning', 'Mildew Condition - ',
'Mildew Condition - Paint After Repair'], dtype=object)
In [238]: Location = r'/Applications/nychatest/sr_withandwithout_workorder.csv'
In [239]: df = pd.read_csv(Location)
In [240]: df['SR_CREATED'] = pd.to_datetime(df['SR_CREATED'])
In [241]: mildew = 'Mildew - '
In [242]: mildewc = 'Mildew Condition - '
In [243]: mildewcm = 'Mildew Condition - Mildew'
In [244]: mildewcnc = 'Mildew Condition - Needs Cleaning'
In [245]: mildewcnp = 'Mildew Condition - Needs Painting'
In [246]: mildewcpar = 'Mildew Condition - Paint After Repair'
In [247]: mildewcv = 'Mildew Condition - Vent OOO'
In [248]: maskmildew = df['DESCRIPTION'].isin([mildew, mildewc, mildewcm,
mildewcnc, mildewcnp, mildewcpar, mildewcv])
In [249]: df = df[maskmildew]
In [250]: df.info()
<class 'pandas.core.frame.DataFrame'>
Int64Index: 139281 entries, 22 to 1483616
Data columns (total 18 columns):
FIRST_NAME
139281 non-null object
LAST_NAME
139281 non-null object
SR_CREATED
139281 non-null datetime64[ns]
SR_ID
139281 non-null object
SR_NUM
139281 non-null object
SR_TYPE
139281 non-null object
SR_SUB_TYPE
139281 non-null object
SR_STATUS
139281 non-null object
CONTACT_ID
139281 non-null object
UNIT_ID
139104 non-null object
WORK_ORDER_NUM
138734 non-null float64
LOCATION
139281 non-null object
LOCATION_ID
139281 non-null object
DESCRIPTION
139281 non-null object
SCHEDULE_DATE
77822 non-null object
RESP_SCHEDULER
138196 non-null object
PRIORITY
138949 non-null float64
STATUS
139281 non-null object
dtypes: datetime64[ns](1), float64(2), object(15)
memory usage: 20.2+ MB
In [251]: df['DESCRIPTION'].unique()
6

Out[251]:
array(['Mildew
'Mildew
'Mildew
'Mildew

Condition - Mildew', 'Mildew Condition - Vent OOO',


Condition - Needs Painting',
Condition - Needs Cleaning', 'Mildew Condition - ',
- ', 'Mildew Condition - Paint After Repair'], dtype=object)

In [252]: df_mildewfirstyr = df[(df.SR_CREATED >= '01/01/2009 00:00:00') &


(df.SR_CREATED <= '12/31/2009 23:59:59')]
In [253]: df_mildewsecondyr = df[(df.SR_CREATED >= '01/01/2010 00:00:00') &
(df.SR_CREATED <= '12/31/2010 23:59:59')]
In [254]: df_mildewthirdyr = df[(df.SR_CREATED >= '01/01/2011 00:00:00') &
(df.SR_CREATED <= '12/31/2011 23:59:59')]
In [255]: df_mildewfourthyr = df[(df.SR_CREATED >= '01/01/2012 00:00:00') &
(df.SR_CREATED <= '12/31/2012 23:59:59')]
In [256]: df_mildewfifthyr = df[(df.SR_CREATED >= '01/01/2013 00:00:00') &
(df.SR_CREATED <= '12/31/2013 23:59:59')]
In [257]: df_mildewsixthyr = df[(df.SR_CREATED >= '01/01/2014 00:00:00') &
(df.SR_CREATED <= '12/31/2014 23:59:59')]
In [258]: df_mildewseventhyr = df[(df.SR_CREATED >= '01/01/2015 00:00:00') &
(df.SR_CREATED <= '12/31/2015 23:59:59')]
In [259]: len(df_mildewfirstyr['SR_CREATED'])
Out[259]: 34
In [260]: len(df_mildewsecondyr['SR_CREATED'])
Out[260]: 5484
In [261]: len(df_mildewthirdyr['SR_CREATED'])
Out[261]: 5904
In [262]: len(df_mildewfourthyr['SR_CREATED'])
Out[262]: 8703
In [263]: len(df_mildewfifthyr['SR_CREATED'])
Out[263]: 31841
In [264]: len(df_mildewsixthyr['SR_CREATED'])
Out[264]: 40654
In [265]: len(df_mildewseventhyr['SR_CREATED'])
Out[265]: 46661
In [266]: df.info()
<class 'pandas.core.frame.DataFrame'>
Int64Index: 139281 entries, 22 to 1483616
Data columns (total 18 columns):
FIRST_NAME
139281 non-null object
LAST_NAME
139281 non-null object
SR_CREATED
139281 non-null datetime64[ns]
SR_ID
139281 non-null object
SR_NUM
139281 non-null object
SR_TYPE
139281 non-null object
SR_SUB_TYPE
139281 non-null object
7

SR_STATUS
139281 non-null object
CONTACT_ID
139281 non-null object
UNIT_ID
139104 non-null object
WORK_ORDER_NUM
138734 non-null float64
LOCATION
139281 non-null object
LOCATION_ID
139281 non-null object
DESCRIPTION
139281 non-null object
SCHEDULE_DATE
77822 non-null object
RESP_SCHEDULER
138196 non-null object
PRIORITY
138949 non-null float64
STATUS
139281 non-null object
dtypes: datetime64[ns](1), float64(2), object(15)
memory usage: 20.2+ MB
In [267]: df['DESCRIPTION'].str.contains(mildew).sum()
Out[267]: 1
In [268]: df['DESCRIPTION'].str.contains(mildewc).sum()
Out[268]: 139280
In [269]: df['DESCRIPTION'].str.contains(mildewcm).sum()
Out[269]: 92571
In [270]: df['DESCRIPTION'].str.contains(mildewcnc).sum()
Out[270]: 37478
In [271]: df.info()
<class 'pandas.core.frame.DataFrame'>
Int64Index: 139281 entries, 22 to 1483616
Data columns (total 18 columns):
FIRST_NAME
139281 non-null object
LAST_NAME
139281 non-null object
SR_CREATED
139281 non-null datetime64[ns]
SR_ID
139281 non-null object
SR_NUM
139281 non-null object
SR_TYPE
139281 non-null object
SR_SUB_TYPE
139281 non-null object
SR_STATUS
139281 non-null object
CONTACT_ID
139281 non-null object
UNIT_ID
139104 non-null object
WORK_ORDER_NUM
138734 non-null float64
LOCATION
139281 non-null object
LOCATION_ID
139281 non-null object
DESCRIPTION
139281 non-null object
SCHEDULE_DATE
77822 non-null object
RESP_SCHEDULER
138196 non-null object
PRIORITY
138949 non-null float64
STATUS
139281 non-null object
dtypes: datetime64[ns](1), float64(2), object(15)
memory usage: 20.2+ MB
In [272]: df['DESCRIPTION'].isin(mildew).sum()
--------------------------------------------------------------------------TypeError
Traceback (most recent call last)
<ipython-input-272-c45d756bdb48> in <module>()
----> 1 df['DESCRIPTION'].isin(mildew).sum()
/Applications/anaconda/lib/python3.5/site-packages/pandas/core/series.py in
isin(self, values)
8

2413
2414
"""
-> 2415
result = algos.isin(_values_from_object(self), values)
2416
return self._constructor(result,
index=self.index).__finalize__(self)
2417
/Applications/anaconda/lib/python3.5/site-packages/pandas/core/algorithms.py in
isin(comps, values)
114
raise TypeError("only list-like objects are allowed to be passed"
115
" to isin(), you passed a "
--> 116
"[{0}]".format(type(values).__name__))
117
if not isinstance(values, np.ndarray):
118
values = list(values)
TypeError: only list-like objects are allowed to be passed to isin(), you passed a
[str]
In [273]: df['DESCRIPTION'].isin([mildew]).sum()
Out[273]: 1
In [274]: df['DESCRIPTION'].isin([mildewc]).sum()
Out[274]: 7
In [275]: df['DESCRIPTION'].isin([mildewcm]).sum()
Out[275]: 92571
In [276]: df['DESCRIPTION'].isin([mildewcnc]).sum()
Out[276]: 37478
In [277]: df['DESCRIPTION'].isin([mildewcnp]).sum()
Out[277]: 7221
In [278]: df['DESCRIPTION'].isin([mildewcpar]).sum()
Out[278]: 1
In [279]: df['DESCRIPTION'].isin([mildewcv]).sum()
Out[279]: 2002
In [280]: df.to_excel('maskmildewoutput.xlsx', index=False)
In [281]:

Você também pode gostar