Welcome to Francis Academic Press

Academic Journal of Computing & Information Science, 2019, 2(1); doi: 10.25236/AJCIS.010024.

Online Comment Text Analysis with Improved Feature Weight


Chaoju Hu*, Xiaojie Yang

Corresponding Author:
Chaoju Hu

Department of computer, North China Electric Power University, Baoding 071000, China
*Corresponding author e-mail: y2296665134@163.com


In the product reviews of online shopping platforms, the star rating and text comments given by the same user often appear. These data are processed by the unreasonable scoring system to give the merchants a false rating and mislead consumers. In order to improve the scoring system, an improved feature weighting method combining text review content and star rating is proposed. Firstly, the weighting rules were defined. Secondly, according to the part-of-speech evaluation function in the short text, the feature word selection was combined with the star rating and the text comment content. The CBOW model was used to train the word vector. Finally, the text classification method SVM was used to obtain the final result. This model was applied to two text datasets and compared with the traditional TF-IDF feature selection. The results show that the algorithm can effectively improve the accuracy and F1 value of text classification.


Online user review, SVM, short text classification, sentiment analysis

Cite This Paper

Chaoju Hu, Xiaojie Yang, Online Comment Text Analysis with Improved Feature Weight. Academic Journal of Computing & Information Science (2019) Vol. 2: 110-119. https://doi.org/10.25236/AJCIS.010024.


[1] L.L. Xu, J.S. Fu and C.H. Jiang (2015). A Product Ranking Algorithm Based on Wilson Interval of Users’ Positive Ratings. Computer Technology and Development, vol.25, no.5, p.168-171.
[2] Y.J. Li, Y.L. Li (2015). The Influence of Seller's Manipulation of Online Reviews. Soft Science, vol.29, no.12, p.135-139.
[3] H.X. Yuan (2018). Research on Tendency Analysis of OnlineShopping Comment Based on SVM. Chongqing Normal University.
[4] C.R. Li (2018) Sentiment Analysis And Visualization Research of Online News Users’ Comments. Harbin Institute of Technology
[5] X. Yuan, M. Sun, Z. Chen, et al, Semantic Clustering-Based deep hypergraph model for online reviews semantic classification in cyber-physical-social Systems [J]. IEEE Access, 2018 6 (1): 17942-17951.
[6] Z.H. Zhang, Y. Guo and M.Q. Han (2015). Research on short text classification based on keyword similarity. Application Research of Computers, 1-6 [2019-03-14]. https: //doi.org/10.19734/j.issn.1001-3695.2018.04.0440.
[7] M. Elhoseiny, A. Elgammal and B. Saleh, Write a classifier: predicting visual classifiers from unstructured text [J]. IEEE Trans on Pattern Analysis and Machine Intelligence, 2017 39 (12): 2539-2553.
[8] C.Ma, R.F. Guo and C.Gao (2018). Short Text Clustering Algorithm with Improved Feature Weight. Computer Systems & Applications, vol.27, no.9, p.210-214.
[9] Y.F. Zhang, L.H. Peng and C.Hong (2019). An Empirical Study on Time-series Correlation Characteristics of Online Users’Follow-up Review Behaviors: Taking the Mobile Phone Review Data on Jingdong Mall as an Example. Information studies: Theory & Application, 1-11 [2019-03-06]. http://kns. cnki. net/kcms/detail/11.1762. G3.20181030.0921.004.html.
[10] T.C. Li, Y.Y. Xi and B.Wang (2015). Improved Short Text Hierarchical Clustering Algorithm. Journal of Information Engineering University, vol.16, no.6, p.743-748, 752.
[11] Z.W. Zhou (2017). Data Analysis on Customer Satisfaction Based on Commodity Comment: Taking the Reviews of Jingdong Mobile Platform As An Example. Zhejiang Gongshang University.
[12] Mikolov T, Chen K, Corrado G, et al. Efficient estimation of word representations in vector space [J]. Computer Science, 2013, 3 (1): 1-12.
[13] S.Z. Tu, J.Yang. and L.Zhao (2019). Filtering Chinese microblog topics noise algorithm based on a semisupervised model. Journal of Tsinghua University (Science and Technology), vol.59, no.3, p.178-185.
[14] VAPNIK V. Statistical learning theory.1998 [M]. Wiley, New York, 1998: 1.
[15] Q.S. You, J.X.Wang. and X.Y. Zhang (2019). SVM-Based Analysis on Food Safety Sampling and Inspection Data. Software Engineering, vol.22, no.2, p.29-31.
[16] H. Li. Statistical learning method (2012). Tsinghua University Press
[17] H.Y. Wang, J.H. Li. and F.L. Yang (2014). Overview of support vector machine analysis and algorithm. Application Research of Computers, vol.31, no.5, p.1281-1286.