Escolar Documentos
Profissional Documentos
Cultura Documentos
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 1037
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 09 | Sep 2018 www.irjet.net p-ISSN: 2395-0072
Sampling is the process where appropriate data are used • Test Sample: Model performances will be validated on
which may reduce the running time for the algorithm. Using this sample. 30% or 20% of the data goes here
python, the preprocessing is done.
Model Selection
2.2 Functional Diagram of Proposed Work
Based on the defined goal(s) (supervised or unsupervised)
It can be divided into 4 parts: we have to select one of or combinations of modeling
techniques. Such as
1. Descriptive analysis on the Data
• KNN Classification
2. Data treatment (Missing value and outlier fixing)
• Logistic Regression
3. Data Modelling
• Decision Trees
4. Estimation of performance
• Random Forest
• Bayesian methods
Build/Develop/Train Models
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 1038
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 09 | Sep 2018 www.irjet.net p-ISSN: 2395-0072
Check Model performance - Error, Accuracy The results are obtained after undergoing various processes
that comes under machine learning
Validate/Test Model
Predictive modelling.
Score and Predict using Test Sample
Table 1- Dataset
Check Model Performance: Accuracy etc.
3. IMPLEMENTATION
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 1039
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 09 | Sep 2018 www.irjet.net p-ISSN: 2395-0072
As we can see from the results obtained from the table the
algorithm which can be used for the predictive modeling will
be KNN algorithms with accuracy of 0.787 highest among the
rest of the algorithm.
Crime Visualization
This section deals with the analysis done on the dataset and
plotting them into various graphs like bar, pie, Figure 3 – No of crimes committed over months in a
scatter.
Analysis done are year
5. Types of crimes committed over Time (Month/ Hour).
The graph below shows the arrest ratio made in the city.
67.2 % of crimes committed by the criminals are not
6. No of crimes of all types of crime over the whole city of arrested and rest 32.8 % are arrested.
Chicago.
7. Arrested ratio.
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 1040
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 09 | Sep 2018 www.irjet.net p-ISSN: 2395-0072
5. CONCLUSION
REFERENCES
Figure 7- Details of the Major crimes (theft and 3. Sivaranjani, S., Sivakumari, S., & Aasha, M. (2016,
battery) committed October). Crime prediction and forecasting in
Tamilnadu using clustering approaches. In
Emerging Technological Trends (ICETT),
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 1041
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 09 | Sep 2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 1042