Welcome to Francis Academic Press

The Frontiers of Society, Science and Technology, 2023, 5(11); doi: 10.25236/FSST.2023.051109.

Key Technologies for Constructing Bilingual Corpus for English-Chinese Translation

Author(s)

Yanming Li

Corresponding Author:
Yanming Li
Affiliation(s)

School of Applied Technology, University of Science and Technology Liaoning, Anshan, China

Abstract

Bilingual corpus provides translators with vivid language resources and translation examples. Bilingual corpus can extract the original text and the corresponding translation at the same time, take translation based on parallel corpus as a method to cross the language boundary between source language and target language, and use bilingual dictionary as the main knowledge source to realize query translation processing. Based on the relevant research results of corpus linguistics, this paper studies the key technologies of constructing bilingual corpus for English and Chinese translation. Among them, corpus collection, based on the basic principle of web crawler technology, designs the corpus collection process, which provides the basis for system programming. Corpus alignment finds the corresponding source text and target text in bilingual or multilingual texts, and recommend using ABBYY Aligner tool. Corpus annotation studies the classification and basic principles of annotation and solves the core problems of annotation.

Keywords

English-Chinese Translation; Bilingual Corpus; Key Technologies; Corpus collection; Corpus alignment; Corpus annotation

Cite This Paper

Yanming Li. Key Technologies for Constructing Bilingual Corpus for English-Chinese Translation. The Frontiers of Society, Science and Technology (2023) Vol. 5, Issue 11: 55-59. https://doi.org/10.25236/FSST.2023.051109.

References

[1] A. B. Yu, "Establishment and Application of Chinese-English Parallel Corpora for tourism publicity: A case study of Wuxi city," English on Campus, vol. 23, no. 6, pp. 74-75+84, 2022.

[2] L. J. Liu. Application of Corpus in College English Translation Teaching[J]. Journal of Changchun University, 2022, 32(06):90-93.

[3] Z. J. Lian. A Corpus-based Study of the Construction of Cloud Platform of Translation Laboratory and its Teaching Mode [J]. Research and Exploration in Laboratory, 2022, 41(05): 230-233+262.

[4] X. F. Ge, J. Liu, "On Web Crawler Software Module Design," Journal of Heihe University, 2018, 9(10): 209-210.

[5] Wang Y, Ren Y. Data Acquisition on Network Public Opinion of Emergencies Based on Web Crawler Technology [C] //Institute of Management Science and Industrial Engineering. Proceedings of 2019 7th International Conference on Machinery, Materials and Computing Technology(ICMMCT 2019). Clausius Scientific Press, 2019: 679-683.

[6] S. Wen. Research and application of key techniques in Chinese-English parallel corpus [D]. Guangxi University for Nationalities, 2021.

[7] Rémi C, Natalia G. Parallel Sentence Alignment from Biomedical Comparable Corpora.[J]. Studies in health technology and informatics, 2020, 270(1): 362-366.

[8] H. S. Wang, S. S. Wang. Research on the translation application of corpus alignment technique [J]. Chinese Science & Technology Translators Journal, 2017, 30(04): 16-19.

[9] L. Shu, Y. L. Guo, H. P. Wang, et al. The Construction and Application of Ancient Chinese Corpus with Word Sense Annotation[J]. Journal of Chinese Information Processing, 2022, 36(05): 21-30.

[10] A. H. Dong. A Discussion to Annotation of the Corpora[J]. Journal of Beijing Institute of Graphic Communication, 2016, 24(05): 67-70.