Welcome to Francis Academic Press

International Journal of New Developments in Engineering and Society, 2023, 7(8); doi: 10.25236/IJNDES.2023.070809.

Research on Automatic Terminological Dictionary Extraction Based on Bilingual Parallel Corpus


Fangting Liu

Corresponding Author:
Fangting Liu

College of Foreign Languages, Bohai University, Jinzhou, 121013, China


Bilingual dictionaries have always been a basic resource in the fields of machine translation and cross-language information retrieval. The text in parallel corpus is a pair of sentences translated into each other. It is a set of text composed of the source language text and the corresponding translated text, which has strong text alignment characteristics. By making full use of rich corpus information in parallel corpora, a bilingual automatic term extraction system is constructed by designing reasonable algorithms to realize automatic or semi-automatic term extraction, which is used to solve the difficult problems in machine translation and cross-language natural language processing. Based on the method of deep learning, this paper constructs the LSTM model based on RNN. Based on LSTM, a new BLSTM model is formed, which has the advantages of comprehensive information, strong robustness, and the ability to take into account both front and back data in natural language processing, so that the trained machine can learn more abstract samples.


Bilingual Parallel Corpus; Terminological Dictionary; Automatic Extraction; BLSTM; Recurrent Neural Network

Cite This Paper

Fangting Liu. Research on Automatic Terminological Dictionary Extraction Based on Bilingual Parallel Corpus. International Journal of New Developments in Engineering and Society (2023) Vol.7, Issue 8: 54-59. https://doi.org/10.25236/IJNDES.2023.070809.


[1] Li Y, Li Z.Research on College English Translation Teaching Based on Parallel Corpus[J].International Journal of New Developments in Education, 2023, 5(18): 55-59.

[2] L. Sun, Y. B. Jin, L. Du, et al. Automatic Extraction of Bilingual Term Lexicon from Parallel Corpora[J]. Journal of Chinese Information Processing, 2000, 15(06): 34-39.

[3] Prakash C, Sumit C. An intelligent chatbot design and implementation model using long short-term memory with recurrent neural networks and attention mechanism[J]. Decision Analytics Journal, 2023, 9(1): 100359-100359.

[4] Aschale A A, Demilie M M, Olalekan A S. Towards audio-based identification of Ethio-Semitic languages using recurrent neural network[J]. Scientific reports, 2023, 13(1): 19346-19346.

[5] Nilkanth N P, Suresh B K, S. P P. Adaptive membership enhanced fuzzy classifier with modified LSTM for automated rainfall prediction model[J]. Intelligent Decision Technologies, 2023, 17(4): 1031-1060.

[6] Yongping Z, Achyut S. Enhancing Supply Chain Transparency and Risk Management Using CNN-LSTM With Transfer Learning[J]. Journal of Organizational and End User Computing (JOEUC), 2023, 35(1): 1-22.

[7] KafiKang M, Hendawi A. Drug-Drug Interaction Extraction from Biomedical Text Using Relation BioBERT with BLSTM[J]. Machine Learning and Knowledge Extraction, 2023, 5(2): 669-683.

[8] J. Liu, J. Yang, S. S. Song. Collaborative Filtering Recommendation Algorithm Based on Purchasing Intention of Users[J]. Journal of Jilin University(Science Edition), 2021, 59(6): 1432-1438.