Kaiyao Tan1, Zhikun Luo2
1GuiZhou University, School of Computer Science, Guizhou 550025, China
2Hunan University of Science and Technology, School of Resource & Environment and Safety Engineering, Xiangtan 411201, China
With the development of computers, machine learning algorithms can be applied in the medical field to solve many classification and prediction problems, thus assisting professionals to quickly judge and diagnose the disease. In this paper, we propose a breast cancer prediction model based on stacking algorithm, which integrates several traditional machine learning algorithms and compares with Adaboosting, SVM and other algorithms in terms of accuracy, ROC curve, PR curve, F1 value index, etc. The experiments show that the accuracy of the breast cancer classification model based on stacking algorithm can reach 97.23%, which is 6% higher than the classification accuracy of SVM, Adaboosting and other algorithms, and the AUC value of ROC curve can be improved by up to 0.26, which provides a certain reference value in breast cancer prediction examination and so on.
Stacking, Ensemble Learning
Kaiyao Tan, Zhikun Luo. Predictive Analysis of Breast Cancer Based on Stacking Algorithm. Academic Journal of Medicine & Health Sciences (2021) Vol. 2, Issue 1: 36-41. https://doi.org/10.25236/AJMHS.2021.020107.
 Siegal R L, Miller K D, Jemal A. C Cancer statistics [J]. CA: A Caner Journal for Clinicians, 2016, 66(1): 7-30.
 Wang SH, Shi HY, Kong WW, Wang L, Li F. Recent advances in risk factors for high-incidence breast cancer in China [J]. Journal of Clinical Nursing, 2017, 16(01): 72-75.
 Liu H X, Zhang R S, Luan F, Yao X J, Liu M C, Hu Z D, Fan B T. Diagnosing breast cancer based on support vector machines. [J]. Journal of chemical information and computer sciences, 2003, 43(3):
 Zhang YX, He S, Y ou SM. Application of integrated learning in diabetes prediction [J]. Intelligent Computers and Applications, 2019, 9(05): 176-179.
 Li Y, Chen S-Xuan, Jia H, Wang X. Research on breast cancer prediction based on C-AdaBoost model [J]. Computer Engineering and Science, 2020, 42(08): 1414-1422.
 Xiong Ting. Research on multi-classifier integration method for disease diagnosis [D]. East China Jiaotong University, 2018.
 Bi Xuehua, Wu Miao, Wu Jing. Analysis of data mining technology in the field of Chinese medicine [J]. Computer Knowledge and Technology, 2012, 8(10): 2175-2176.
 Zhang Y ao. Integrated learning based pathological image analysis of breast cancer [D]. Shandong University, 2021.