International Journal of New Developments in Engineering and Society, 2025, 9(1); doi: 10.25236/IJNDES.2025.090102.
Yining Xiong
School of International Police Studies, People's Public Security University of China, Beijing, 100038, China
The establishment of crime prediction models using big data has become a key part of current police work. This paper combines binary classification with SVM, Random Forest, XGBoost, and GBDT methods based on the data of burglary, battery, assault, and criminal damage in Chicago from 2015 to 2020, and compares the prediction results. In this experiment, GBDT and XGBoost presented relatively stable and excellent data, and the scores of F1-score were 0.69 and 0.71 respectively, with high scores. It means that the model has strong generalization ability, can better adapt to different data distributions, and can be used more in the future to provide decision support for local police work.
Machine learning, Crime occurrence prediction, XGBoost, and GBDT
Yining Xiong. Research on Crime Occurrence Prediction Using Machine Learning Methods—Considering Four Types of Crime in Chicago. International Journal of New Developments in Engineering and Society (2025) Vol.9, Issue 1: 7-14. https://doi.org/10.25236/IJNDES.2025.090102.
[1] HE Rixing, LU Yumei, JIANG Chao, DENG Yue, LI Xinran, SHI Dong. Progress in Research and Practice of Spatial-temporal Crime Prediction over the Past Decade[J]. Geo-Information Science, 2023, 25(4): 866-882
[2] Yan Jinghua, Hou Miaomiao. Research on Time Series Prediction of Theft Crime Based on LSTM Network [J]. Data Analysis and Knowledge Discovery, 2020, 4(11): 84-91.
[3] Shen Hanlei, Zhang Hu, Zhang Yaofeng, et al. Research on Burglary Crime Prediction Based on Long Short-Term Memory Model[J]. Statistics and Information Forum, 2019, 34(11): 107-115.
[4] Han Yishi, Fan Yingsheng, Li Guojun, et al. Analysis of the Growth Trend of Communication Network Fraud Crime Based on ARIMA Model: A Case Study of Quzhou City, Zhejiang Province[J]. Theoretical Observation, 2017, (05): 101-103.
[5] Hou Miaomiao, Hu Xiaofeng. Crime Prediction Research Based on Time Series Model SARIMA[J]. Journal of People's Public Security University of China (Natural Science Edition), 2021, 27(02): 67-73.
[6] Liu Xiaojuan, Gao Liansheng. Application of Grey System Theory in Dynamic Crime Prediction[J]. Journal of People's Public Security University of China (Social Science Edition), 2005, 66(1): 44-48.
[7] Alves L G, Ribeiro H V, Rodrigues F A. Crime Prediction Through Urban Metrics and Statistical Learning [J]. Physica A: Statistical Mechanics and Its Applications, 2018, 505: 435-443.
[8] Uttley J, Canwell R, Smith J,et al.Does darkness increase the risk of certain types of crime? A registered report protocol[J].PLoS ONE (v.1;2006), 2024, 19(1):18. DOI:10.1371/journal. pone. 0291971.
[9] Huang Na, He Jingsha, Sun Jingchao, et al. Crime Trend Prediction Method Based on Improved LSTM Network[J]. Journal of Beijing University of Technology, 2019, 45(08): 742-748.
[10] Yu Hongzhi, Liu Fengxin, Zou Kaiqi. Improved Fuzzy BP Neural Network and Its Application in Crime Prediction[J]. Journal of Liaoning University of Engineering and Technology (Natural Science Edition), 2012, 31(02): 244-247.
[11] Mwaniki B, Mwalili T, Ogada K. Crime Prediction Using Decision Trees, Random Forests, and Hybrid Algorithm: A Comparative Analysis[C]//Proceedings of the 2023 7th International Conference on New Media Studies (CONMEDIA), Bali, Indonesia, 2023: 99-104.
[12] Zhou Z H. Machine Learning [M]. Beijing: Tsinghua University Press, 2016: 121-139, 298-300.
[13] Li H. Statistical Learning Methods[M]. Beijing: Tsinghua University Press, 2012: Chapter 7, 95-135.
[14] Biau G, Devroye L. On the layered nearest neighbour estimate, the bagged nearest neighbour estimate and the random forest method in regression and classification[J]. Journal of Multivariate Analysis, 2010, 101: 2499-2518.
[15] Kim Y A, Wo J C, Hipp J R .Estimating Age-Graded Effects of Businesses on Crime in Place [J]. Justice Quarterly, 2023, 40(3):403-426. DOI:10.1080/07418825.2022.2107943.
[16] Rainio O, Teuho J, Klén, Riku. Evaluation metrics and statistical tests for machine learning[J]. Scientific Reports, 14(1):1-14[2024-11-26].
[17] Hand D J, Christen P, Kirielle N .F*: an interpretable transformation of the F-measure[J]. Machine Learning, 2021(1).
[18] Mandavkar Y K .Crime Analysis and Prediction using Data Mining[J].International Journal of Advanced Research in Science, Communication and Technology, 2023.DOI:10.48175/ijarsct-12027.
[19] Parisineni S R A, Pal M .Enhancing trust and interpretability of complex machine learning models using local interpretable model agnostic shap explanations[J].International Journal of Data Science and Analytics, 2024, 18(4):457-466.