Welcome to Francis Academic Press

Academic Journal of Computing & Information Science, 2022, 5(3); doi: 10.25236/AJCIS.2022.050303.

Research on Early Warning of Customer Churn Based on Mutual Information and Integrated Learning—Taking Ctrip as an Example


Wei Yang

Corresponding Author:
Wei Yang

School of Mathematics and Statistics, Northeastern University at Qinhuangdao, Qinhuangdao, Hebei, 066099, China


With the increasing competition pressure among tourism e-commerce platforms, how to reduce customer churn to the greatest extent is of great significance to tourism e-commerce platforms. Based on this, this paper takes Ctrip's hotel customer-related data as an example, and first uses a supervised feature selection method based on mutual information to select features that have an important impact on customer churn. Then, by using the cross-validation method, combined with evaluation indicators such as accuracy rate, F1-sorce, AUC, etc., select the best model set from Logistic regression, Support Vector Machine, Decision Tree, Random Forest, GBDT, XGBoost, and LightGBM. A subset of the optimal models, and then the optimal model fusion is performed. The empirical results show that the multi-model fusion has higher accuracy and stability. In addition, based on the model fusion results, this paper obtains the importance ranking of customer personal characteristics. Finally, this paper puts forward relevant suggestions on how to accurately manage Ctrip and reduce the customer churn rate.


customer churn, feature selection, machine learning

Cite This Paper

Wei Yang. Research on Early Warning of Customer Churn Based on Mutual Information and Integrated Learning—Taking Ctrip as an Example. Academic Journal of Computing & Information Science (2022), Vol. 5, Issue 3: 23-27. https://doi.org/10.25236/AJCIS.2022.050303.


[1] Aronoff S. Voting system [J]. Advances in Computers, 2021, 121: 495-500.

[2] Chen Peng. Study on customer loss prediction and influencing factors of Ctrip hotel based on Stacking [D]. Central University for Nationalities, 2021.

[3] Du Le. B2C E-commerce Enterprise Customer Classification Research [D]. Northern University of Technology, 2014.

[4] Fernandez-Peralta R, Massanet S, Mir A. A New Edge Detector Based on SMOTE and Logistic Regression [J]. 2017.

[5] Lee Jin. Crisis analysis of Ctrip network customer loss [D]. Guizhou University of Finance and Economics.

[6] Qiu Y, Li C. Research on E-commerce User Churn Prediction Based on Logistic Regression [C]// 0.

[7] Wirtz B W, Lihotzky N. Customer Retention Management in the B2C Electronic Business[J]. Long Range Planning, 2003, 36(6): 517-532.

[8] Yu Xiaobing, Cao Jie, Gong in Wu. Customer Loss Research Review [J]. Computer Integrated Manufacturing System, 2012, 18 (10): 11.