Welcome to Francis Academic Press

Academic Journal of Computing & Information Science, 2023, 6(1); doi: 10.25236/AJCIS.2023.060112.

Research on keyword extraction based on abstract extraction


Zihao Yan

Corresponding Author:
Zihao Yan

Nanjing University of Finance & Economics, Nanjing, China, 210023


In order to improve the accuracy of text keyword extraction, this paper combined with the relevant methods of abstract extraction, aiming to extract key sentences through the abstract extraction method, and then optimize the effect of keyword extraction. The experimental results show that the effect of the keyword extraction algorithm on the text after abstract extraction is improved by 6.92 percentage points.


Keyword Extraction, Abstract Extraction, Supervised

Cite This Paper

Zihao Yan. Research on keyword extraction based on abstract extraction. Academic Journal of Computing & Information Science (2023), Vol. 6, Issue 1: 77-82. https://doi.org/10.25236/AJCIS.2023.060112.


[1] Turney, P.D. (2000) Learning algorithms for keyphrase extraction. Information Retrieval, 2(4), 303-336.

[2] Zhang, C.Z. (2007) Review and Prospect of Automatic Indexing Research. New Technology of Library and Information Service, 11, 33-39.

[3] Cohen, J.D. (1995) Highlights: language- and domain-independent automatic indexing terms for abstracting. Journal of the American Society for Information Science, 46(3), 162-174.

[4] Salton, G., Yang, C.S., and Yu, C.T. (1975) A Theory of Term Importance in Automatic Text Analysis. Journal of the American Society for Information Science, 26(1), 33-44.

[5] Matsuo, Y., and Ishizuka, M. (2008) Keyword Extraction from a Single Document Using Word Co-occurrence Statistical Information[J]. International Journal on Artificial Intelligence Tools, 13(1), 157-169.

[6] Barker, K., and Cornacchia, N. (2000) Using noun phrase heads to extract document keyphrases. Springer Berlin Heidelberg.

[7] Mihalcea, R., and Tarau, P. (2004) Textrank: bringing order into texts. Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 404-411.

[8 Bougouin, A., Boudin, F., and Daille, B. (2013) TopicRank: Graph-Based Topic Ranking for Keyphrase Extraction. International Joint Conference on Natural Language Processing, 543-551.

[9] Radev, D.R. (2004) Lexrank: graph-based lexical centrality as salience in text summarization. Journal of Qiqihar Junior Teachers College, 22, 457-479.

[10] Yasunaga, M., Zhang, R., Meelu, K., Pareek, A., Srinivasan, K., and Radev, D. (2017) Graph-based Neural Multi-Document Summarization. Proceedings of the 21st Conference on Computational Natural Language Learning, 452-462.

[11] Xu, J., Liu, J., Cheng, Y., and Gan, Z. (2020) Discourse-Aware Neural Extractive Text Summarization. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 5021-5031.

[12] Liu, Y., and Lapata, M. (2019) Text Summarization with Pretrained Encoders. Association for Computational Linguistics, 3728-3738.

[13] Dai, Z., and Callan, J. (2020) Context-Aware Term Weighting For First Stage Passage Retrieval. SIGIR '20: The 43rd International ACM SIGIR conference on research and development in Information Retrieval. ACM.

[14] Nguyen, T., Rosenberg, M., Song, X., Gao, J., Tiwary, S., and Majumder, R., et al. (2016) Ms marco: a human generated machine reading comprehension dataset, In CoCo@ NIPs.