Welcome to Francis Academic Press

International Journal of Frontiers in Engineering Technology, 2024, 6(6); doi: 10.25236/IJFET.2024.060601.

The Design and Implementation of an End-side Dictionary-based Chinese-English Word Segmentation Algorithm

Author(s)

Gao Qun

Corresponding Author:
Gao Qun
Affiliation(s)

School of Intelligent Transportation Modern Industry, Anhui Sanlian University, Hefei, 230601, China

Abstract

By analyzing the Chinese-English word segmentation requirements on the device side, this paper elaborates on the design and implementation process of a dictionary-based Chinese-English word segmentation algorithm on the device side. Through in-depth analysis of the word segmentation requirements on the device side and combining advanced algorithm strategies, an efficient and accurate word segmentation algorithm is designed. Verified by experiments, this algorithm performs well on different devices. On the manually annotated Chinese and English test sets, the accuracy of whole-sentence word segmentation reaches more than 91.3% and 82.1% respectively, providing an implementation idea for the implementation on the device side.

Keywords

Device Side; Chinese-English Word Segmentation Algorithm; Good Performance

Cite This Paper

Gao Qun. The Design and Implementation of an End-side Dictionary-based Chinese-English Word Segmentation Algorithm. International Journal of Frontiers in Engineering Technology (2024), Vol. 6, Issue 6: 1-7. https://doi.org/10.25236/IJFET.2024.060601.

References

[1] Chao Shen, Jinxia Dai, Mengge Mao, et al. Research on Chinese Word Segmentation Algorithm based on Dictionary and Statistical Method and Its Application in the field of Power Grid Control[C]//3rd International Symposium on Information Science and Engineering Technology(siset2022), 2022: 19-23.

[2] Huang Linjieqiong, Li Xingshan. The effects of lexical- and sentence-level contextual cues on Chinese word segmentation. [J]. Psychonomic Bulletin & Review, 2023: null.

[3] Liu Yang, Yu Tian, Ding Yi. A New Chinese Word Segmentation Based on Maximum Probability Path [J]. Computer & Digital Engineering, 2022, 50(03):591-596.

[4] Zou Zhimin, Guo Heqing, Gao Ying. English String Segmentation Method[J].Application Research of Computers, 2007, (07):52-54. 

[5] Zhang Jun, Lai Zhipeng, Li Xue. Cross-domain Chinese Word Segmentation Based on New Word Discovery[J].Journal of Electronics & Information Technology, 2022, 44(09):3241-3248.