Welcome to Francis Academic Press

Academic Journal of Mathematical Sciences, 2023, 4(4); doi: 10.25236/AJMS.2023.040403.

Report on the Analysis and Prediction of Wordle Data Based on the SIR Model


Hangyu Zeng

Corresponding Author:
Hangyu Zeng

School of Statistics and Mathematics, Central University of Finance and Economics, Beijing, 102206, China


Wordle is a New York Times crossword puzzle that has become very popular recently. In this paper, a SIR infectious disease model was developed to explain the reasons for the variation in the number of daily reported results and to predict the number of future reports. By building the SIR contagion model, this paper explains that the main reason for the change in the number of daily reports over time is Twitter publicity, and predicts that the number of reported results on March 1, 2023 will be approximately 9411. Further, by analyzing the correlation and feature importance ratings from decision tree regression, we conclude that words with common letters and high frequency are more likely to be guessed by players with fewer guesses, whereas words with repeated letters were less favorable. Also, the presence or absence of common letters in words had the most significant effect on the difficulty of the game.


Wordle, SIR Infectious Disease Model, Decision Tree

Cite This Paper

Hangyu Zeng. Report on the Analysis and Prediction of Wordle Data Based on the SIR Model. Academic Journal of Mathematical Sciences (2023) Vol. 4, Issue 4: 17-21. https://doi.org/10.25236/AJMS.2023.040403.


[1] Yang Chaobo, Xie Weihong, Wang Lizang. Research on optimization and intervention of SIR model for online public opinion [J]. Frontiers of Data and Computing Development, 2023, 5(01): 115-127. 

[2] Qu Y, Zhai JW. Analysis of project implicit knowledge transfer factors based on improved SIR model [J]. Industrial Engineering, 2023, 26(01): 146-152. 

[3] Zheng M. M., Zhang W. N., Liu Y., Wu X. T. Research on the propagation process of pedestrian crossing violation based on migration-based SIR model [J]. Journal of Dalian Jiaotong University, 2023, 44(01): 53-57+63. 

[4] Yang JH, Li X, Liu H. Diagnostic efficacy of serum sTREM-1 combined with SIRS score for sepsis in burn patients with complications [J]. Chinese general medicine, 2023, 21(02): 234-237. 

[5] Chen J, Xiong Y, Tong J, et al. Analysis and prediction of COVID-19 in the US based on the time-varying parameters SIR model[J]. Journal of Physics: Conference Series, 2020, 1678(1):012082 (8pp).DOI:10.1088/1742-6596/1678/1/012082.

[6] Sang R, Zhang L, Wu H. A two-patch SIRS contagion model for media-induced migration rate change [J]. Journal of Xinjiang University (Natural Science Edition) (in English and Chinese), 2023, 40(01): 49-56+60. 

[7] Zhao Yanjun, Sun Xiaohui, Su Li, Li Wenxuan. Qualitative analysis of stochastic SIRS infectious disease models with logistic growth and Beddington-DeAngelis incidence [J]. Journal of Mathematical Physics, 2022, 42(06): 1861-1872. 

[8] Zhao Xiaoqiang, Luo Weilan, Liang Haopeng. Bearing fault diagnosis based on SIR multilevel residual connected dense network [J]. Journal of Lanzhou University of Technology, 2022, 48(06): 46-54. 

[9] Cang Linqing. Modeling and research of Si-SIR rumor propagation model on complex social networks [D]. Nanjing University of Posts and Telecommunications, 2022. 

[10] Chen Wenhao, Cui Ruiwen, Liu Mengna. Research on the cross-contagion mechanism of risk among banks in China - based on SIRS contagion model [J]. North China Finance, 2022, (11): 24-34.