Welcome to Francis Academic Press

Academic Journal of Computing & Information Science, 2022, 5(3); doi: 10.25236/AJCIS.2022.050312.

Analysis of Stock Data Based on Clustering Method


Tianhao Fu1, Jixin She2, Yuhao Guo3

Corresponding Author:
Tianhao Fu

1School of Economy, Fudan University, Shanghai, China

2Academy of Reading, Nanjing University of Information Science and Technology, Nanjing, China

3Huaer Zizhu Academy, Shanghai, China


Number of researches on stock data are based on machine learning method. However, former researches mainly applied all kinds of supervised learning method as well as some kind of basial clustering method, such as neuro net-work and k-means algorithm. We use an emerging clustering algorithm called AP algorithm to cluster the A-share stock data before 2017. And finally, according to the market performance, the selected 69 A-share stocks are divided into 9 classification clusters based on their market performances. Such classification has a strong guiding role for investment decision-making.


Affinity Propagation Cluster, Investment, Stock Data

Cite This Paper

Tianhao Fu, Jixin She, Yuhao Guo. Analysis of Stock Data Based on Clustering Method. Academic Journal of Computing & Information Science (2022), Vol. 5, Issue 3: 85-88. https://doi.org/10.25236/AJCIS.2022.050312.


[1] Cai Qingquan, Ma yunyun, and Li Jinmei (2019) Prediction Model of Stock Excess Return Based on Machine Learning. Information System Engineering, 6-17.

[2] Meng Ye (2019) Research and Application of Machine Learning Stock Selection Algorithm Integrating Behavioral Financ. Qingdao University, DOI:10.27262/d.cnki. gqdau. 2019.000890.

[3] Sun Fuxiong, Liu Guangming, Zeng Zixuan, and Peng Mengqi (2020) Research on Stock Suspension Prediction Based on Portfolio Model. Computer Engineering and Application, 18, 272-278.

[4] Zhang Suxiang, Liu Jianming, Zhao Bingzhen, and Cao Jinping (2013) Research on analysis Model of Residential Power Consumption Behavior Based on Cloud Computing. Power Grid Technology, 06: 1542-1546 DOI:10.13335/j.1000-3673. pst. 2013.06.010.

[5] Zhang Qian, Gao Liqun, Hu Xiangpei, and Wu Wei (2003) Clustering Improved Genetic Algorithm for Multi-objective Optimization of Logistics Distribution Path. Control and Decision Making, 04: 418-422 DOI:10.13195/j.cd. 2003.04.34. zhangq. Seve

[6] Sun Shengping (2011) Research on Hot Topic Detection and Tracking Technology of Chinese Weibo. Beijing Jiaotong University.

[7] Guo Chonghui, Jia Hongfeng, and Zhang Na (2008) Time Series Clustering Method Based on ICA and Its Application in Stock Data Analysis. Operations Research and Management, 05, 120-124.

[8] Hannah (2011) Research and Application of Clustering Algorithm in Time Series. Guangdong University of technology.

[9] Song Zongxiang (2017) Application of Fuzzy C-means Clustering in Stock Investment. Northeast Petroleum University.

[10] Hai Mo, Niu Yihan, and Zhang Yuejin (2015) Application of Parallel Clustering Algorithm For Big Data in Stock Sector Division. Big data, 04, 9-17.

[11] Frey, BJ, Dueck, D (2007) Clustering by Passing Messages between Data Points. Science, pp.972-976.