Escolar Documentos
Profissional Documentos
Cultura Documentos
FIRST_NAME
1442563 non-null object
LAST_NAME
1442563 non-null object
SR_CREATED
1442563 non-null object
SR_ID
1442563 non-null object
SR_NUM
1442563 non-null object
SR_TYPE
1442563 non-null object
SR_SUB_TYPE
1442563 non-null object
SR_STATUS
1442563 non-null object
CONTACT_ID
1442563 non-null object
UNIT_ID
1417999 non-null object
WORK_ORDER_NUM
1442563 non-null int64
LOCATION
1442563 non-null object
LOCATION_ID
1442563 non-null object
DESCRIPTION
1442563 non-null object
SCHEDULE_DATE
655311 non-null object
RESP_SCHEDULER
1437042 non-null object
PRIORITY
1442563 non-null int64
STATUS
1442563 non-null object
dtypes: int64(2), object(16)
memory usage: 198.1+ MB
In [88]: Location = r'/Applications/nychatest/sr_withandwithout_workorder.csv'
In [89]: df = pd.read_csv(Location)
In [90]: df.info()
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 1483617 entries, 0 to 1483616
Data columns (total 18 columns):
FIRST_NAME
1483617 non-null object
LAST_NAME
1483617 non-null object
SR_CREATED
1483617 non-null object
SR_ID
1483617 non-null object
SR_NUM
1483617 non-null object
SR_TYPE
1483617 non-null object
SR_SUB_TYPE
1483617 non-null object
SR_STATUS
1483617 non-null object
CONTACT_ID
1483617 non-null object
UNIT_ID
1457428 non-null object
WORK_ORDER_NUM
1442584 non-null float64
LOCATION
1483617 non-null object
LOCATION_ID
1483617 non-null object
DESCRIPTION
1483617 non-null object
SCHEDULE_DATE
655321 non-null object
RESP_SCHEDULER
1437063 non-null object
PRIORITY
1457970 non-null float64
STATUS
1483617 non-null object
dtypes: float64(2), object(16)
memory usage: 203.7+ MB
In [91]: sorted(df['DESCRIPTION'].unique())
Out[91]:
['BBQ Area - Flooding',
'Basketball Courts - Flooding',
'Bathtub Shower - Faucet Leaking OOO',
'Bathtub Shower - Leak Under',
'Boiler Condensate Tank - Belly Leaking',
'Boiler Condensate Tank - Front Leaking',
'Boiler Condensate Tank - Rear Leaking',
4
SR_ID
1-1G62RA
1-1G60Y0
SR_NUM
1-87623542
1-87621192
SR_TYPE
Complaint
Complaint
36
42
47
ANTHONY
Maria
Maria
17
26
36
42
47
SR_SUB_TYPE SR_STATUS
Maintenance Request
Closed
Maintenance Request
Closed
Maintenance Request
Closed
Maintenance Request
Closed
Maintenance Request
Closed
17
26
36
42
47
LOCATION
Bathroom
Kitchen
Foyer/Hallway
Bedroom
Living Room
17
26
36
42
47
In [99]: df.tail()
Out[99]:
FIRST_NAME
1482284
RAMON
1482353 Elizabeth
1482514
Carrie
1482540
Aquilina
1482709
Yvette
LAST_NAME
BENITEZ
Hogan
Birchmore
Espada
Walker
1482284
1482353
1482514
1482540
1482709
SR_TYPE
Complaint
Complaint
Complaint
Complaint
Complaint
1482284
1482353
1482514
1482540
1482709
WORK_ORDER_NUM
10115126.0
11256522.0
16106337.0
16620828.0
20463027.0
1482284
1482353
1482514
1482540
1482709
Walls
Ceiling
Ceiling
Ceiling
Ceiling
CONTACT_ID
1-89G-70
1-33D-3739
1-P0-4547
1-P0-195
1-P0-195
LOCATION_ID
024.10.016.F04.04C.BTH01
056.25.078.F01.01B.KIT01
114.05.009.F05.05G.FH01
114.06.011.F08.08F.BD03
114.06.011.F08.08F.LIV01
SCHEDULE_DATE RESP_SCHEDULER
NaN
SIEBEL
NaN
SIEBEL
NaN
SIEBEL
NaN
SIEBEL
NaN
SIEBEL
1-1G6BRG
1-1G69J6
1-1G69IC
PRIORITY
7.0
7.0
7.0
7.0
7.0
UNIT_ID
1-KSI-5
1-3VP-733
1-DE-2521
1-DH-545
1-DH-545
Walls
Walls
Walls
Walls
Walls
Complaint
Complaint
Complaint
WORK_ORDER_NUM
7920827.0
7919679.0
7923326.0
7922533.0
7922535.0
DESCRIPTION
Mildew Mold
Mildew Mold
Mildew Mold
Mildew Mold
Mildew Mold
STATUS
Closed
Closed
Closed
Closed
Closed
SR_CREATED
2010-03-02 20:38:12
2010-06-09 22:20:34
2011-01-08 01:20:31
2011-02-24 23:22:40
2011-10-26 01:32:00
SR_SUB_TYPE SR_STATUS
Maintenance Request
Closed
Maintenance Request
Closed
Maintenance Request
Closed
Maintenance Request
Closed
Maintenance Request
Closed
LOCATION
Bathroom
Bathroom
Bathroom
Bathroom
Bathroom
1-87635212
1-87632322
1-87632292
SR_ID
1-1VOCJ7
1-2JABVO
1-3YAPV4
1-4VW6CC
1-BKAH8Q
SR_NUM
1-113670259
1-153327012
1-239005552
1-295434012
1-699209162
CONTACT_ID
1-10V89I
1-P0-1200
1-33D-2025
1-33Y-3310
1-34M-1347
LOCATION_ID
024.12.019.F01.01E.BTH01
164.01.015.F02.02K.BTH01
167.01.037.F15.15B.BTH01
010.10.019.F02.02F.BTH01
056.11.034.F03.03D.BTH01
UNIT_ID
1-K3R-6
1-DK-1038
1-3VA-1368
1-3V4-742
1-3VJ-1353
PRIORITY
7.0
3.0
3.0
3.0
3.0
STATUS
Closed
Closed
Closed
Closed
Closed
In [114]: len(df_moldsixthyr['SR_CREATED'])
Out[114]: 7
In [115]: len(df_moldseventhyr['SR_CREATED'])
Out[115]: 2
In [116]: df.info()
<class 'pandas.core.frame.DataFrame'>
Int64Index: 6326 entries, 17 to 1482709
Data columns (total 18 columns):
FIRST_NAME
6326 non-null object
LAST_NAME
6326 non-null object
SR_CREATED
6326 non-null datetime64[ns]
SR_ID
6326 non-null object
SR_NUM
6326 non-null object
SR_TYPE
6326 non-null object
SR_SUB_TYPE
6326 non-null object
SR_STATUS
6326 non-null object
CONTACT_ID
6326 non-null object
UNIT_ID
6320 non-null object
WORK_ORDER_NUM
6295 non-null float64
LOCATION
6326 non-null object
LOCATION_ID
6326 non-null object
DESCRIPTION
6326 non-null object
SCHEDULE_DATE
2692 non-null object
RESP_SCHEDULER
6292 non-null object
PRIORITY
6303 non-null float64
STATUS
6326 non-null object
dtypes: datetime64[ns](1), float64(2), object(15)
memory usage: 939.0+ KB
In [117]: len(df['UNIT_ID'].unique())
Out[117]: 5214
In [118]:
MILDEW START
In [237]: df['DESCRIPTION'].unique()
Out[237]:
array(['Mildew Condition - Mildew', 'Mildew Condition - Vent OOO',
'Mildew Condition - Needs Painting',
'Mildew Condition - Needs Cleaning', 'Mildew Condition - ',
'Mildew Condition - Paint After Repair'], dtype=object)
In [238]: Location = r'/Applications/nychatest/sr_withandwithout_workorder.csv'
In [239]: df = pd.read_csv(Location)
In [240]: df['SR_CREATED'] = pd.to_datetime(df['SR_CREATED'])
In [241]: mildew = 'Mildew - '
In [242]: mildewc = 'Mildew Condition - '
In [243]: mildewcm = 'Mildew Condition - Mildew'
In [244]: mildewcnc = 'Mildew Condition - Needs Cleaning'
In [245]: mildewcnp = 'Mildew Condition - Needs Painting'
In [246]: mildewcpar = 'Mildew Condition - Paint After Repair'
In [247]: mildewcv = 'Mildew Condition - Vent OOO'
In [248]: maskmildew = df['DESCRIPTION'].isin([mildew, mildewc, mildewcm,
mildewcnc, mildewcnp, mildewcpar, mildewcv])
In [249]: df = df[maskmildew]
In [250]: df.info()
<class 'pandas.core.frame.DataFrame'>
Int64Index: 139281 entries, 22 to 1483616
Data columns (total 18 columns):
FIRST_NAME
139281 non-null object
LAST_NAME
139281 non-null object
SR_CREATED
139281 non-null datetime64[ns]
SR_ID
139281 non-null object
SR_NUM
139281 non-null object
SR_TYPE
139281 non-null object
SR_SUB_TYPE
139281 non-null object
SR_STATUS
139281 non-null object
CONTACT_ID
139281 non-null object
UNIT_ID
139104 non-null object
WORK_ORDER_NUM
138734 non-null float64
LOCATION
139281 non-null object
LOCATION_ID
139281 non-null object
DESCRIPTION
139281 non-null object
SCHEDULE_DATE
77822 non-null object
RESP_SCHEDULER
138196 non-null object
PRIORITY
138949 non-null float64
STATUS
139281 non-null object
dtypes: datetime64[ns](1), float64(2), object(15)
memory usage: 20.2+ MB
In [251]: df['DESCRIPTION'].unique()
6
Out[251]:
array(['Mildew
'Mildew
'Mildew
'Mildew
SR_STATUS
139281 non-null object
CONTACT_ID
139281 non-null object
UNIT_ID
139104 non-null object
WORK_ORDER_NUM
138734 non-null float64
LOCATION
139281 non-null object
LOCATION_ID
139281 non-null object
DESCRIPTION
139281 non-null object
SCHEDULE_DATE
77822 non-null object
RESP_SCHEDULER
138196 non-null object
PRIORITY
138949 non-null float64
STATUS
139281 non-null object
dtypes: datetime64[ns](1), float64(2), object(15)
memory usage: 20.2+ MB
In [267]: df['DESCRIPTION'].str.contains(mildew).sum()
Out[267]: 1
In [268]: df['DESCRIPTION'].str.contains(mildewc).sum()
Out[268]: 139280
In [269]: df['DESCRIPTION'].str.contains(mildewcm).sum()
Out[269]: 92571
In [270]: df['DESCRIPTION'].str.contains(mildewcnc).sum()
Out[270]: 37478
In [271]: df.info()
<class 'pandas.core.frame.DataFrame'>
Int64Index: 139281 entries, 22 to 1483616
Data columns (total 18 columns):
FIRST_NAME
139281 non-null object
LAST_NAME
139281 non-null object
SR_CREATED
139281 non-null datetime64[ns]
SR_ID
139281 non-null object
SR_NUM
139281 non-null object
SR_TYPE
139281 non-null object
SR_SUB_TYPE
139281 non-null object
SR_STATUS
139281 non-null object
CONTACT_ID
139281 non-null object
UNIT_ID
139104 non-null object
WORK_ORDER_NUM
138734 non-null float64
LOCATION
139281 non-null object
LOCATION_ID
139281 non-null object
DESCRIPTION
139281 non-null object
SCHEDULE_DATE
77822 non-null object
RESP_SCHEDULER
138196 non-null object
PRIORITY
138949 non-null float64
STATUS
139281 non-null object
dtypes: datetime64[ns](1), float64(2), object(15)
memory usage: 20.2+ MB
In [272]: df['DESCRIPTION'].isin(mildew).sum()
--------------------------------------------------------------------------TypeError
Traceback (most recent call last)
<ipython-input-272-c45d756bdb48> in <module>()
----> 1 df['DESCRIPTION'].isin(mildew).sum()
/Applications/anaconda/lib/python3.5/site-packages/pandas/core/series.py in
isin(self, values)
8
2413
2414
"""
-> 2415
result = algos.isin(_values_from_object(self), values)
2416
return self._constructor(result,
index=self.index).__finalize__(self)
2417
/Applications/anaconda/lib/python3.5/site-packages/pandas/core/algorithms.py in
isin(comps, values)
114
raise TypeError("only list-like objects are allowed to be passed"
115
" to isin(), you passed a "
--> 116
"[{0}]".format(type(values).__name__))
117
if not isinstance(values, np.ndarray):
118
values = list(values)
TypeError: only list-like objects are allowed to be passed to isin(), you passed a
[str]
In [273]: df['DESCRIPTION'].isin([mildew]).sum()
Out[273]: 1
In [274]: df['DESCRIPTION'].isin([mildewc]).sum()
Out[274]: 7
In [275]: df['DESCRIPTION'].isin([mildewcm]).sum()
Out[275]: 92571
In [276]: df['DESCRIPTION'].isin([mildewcnc]).sum()
Out[276]: 37478
In [277]: df['DESCRIPTION'].isin([mildewcnp]).sum()
Out[277]: 7221
In [278]: df['DESCRIPTION'].isin([mildewcpar]).sum()
Out[278]: 1
In [279]: df['DESCRIPTION'].isin([mildewcv]).sum()
Out[279]: 2002
In [280]: df.to_excel('maskmildewoutput.xlsx', index=False)
In [281]: