Chinese Named Entity Recognition Method for Domain-Specific Text

Liu, He; Ma, Yuekun; Gao, Chang; Qi, Jia; Zhang, Dezheng

doi:10.17559/TV-20230324000477

Tehnički vjesnik, Vol. 30 No. 6, 2023.

Izvorni znanstveni članak

https://doi.org/10.17559/TV-20230324000477

Chinese Named Entity Recognition Method for Domain-Specific Text

He Liu ; College for Artificial Intelligence, North China University of Science and Technology, Tangshan 063210, China
Yuekun Ma ; 1) College for Artificial Intelligence, North China University of Science and Technology, Tangshan 063210, China 2) School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China 3) Hebei Key Laboratory of Industrial Intelligent Perception, Tangshan 063210, China *
Chang Gao ; College for Artificial Intelligence, North China University of Science and Technology, Tangshan 063210, China
Jia Qi ; Inspur Electronic Information Industry Co., Ltd.
Dezheng Zhang ; School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China *

* Dopisni autor.

Puni tekst: engleski pdf 634 Kb

str. 1799-1808

preuzimanja: 736

citiraj

APA 6th Edition

Liu, H., Ma, Y., Gao, C., Qi, J. i Zhang, D. (2023). Chinese Named Entity Recognition Method for Domain-Specific Text. Tehnički vjesnik, 30 (6), 1799-1808. https://doi.org/10.17559/TV-20230324000477

MLA 8th Edition

Liu, He, et al. "Chinese Named Entity Recognition Method for Domain-Specific Text." Tehnički vjesnik, vol. 30, br. 6, 2023, str. 1799-1808. https://doi.org/10.17559/TV-20230324000477. Citirano 02.07.2026.

Chicago 17th Edition

Liu, He, Yuekun Ma, Chang Gao, Jia Qi i Dezheng Zhang. "Chinese Named Entity Recognition Method for Domain-Specific Text." Tehnički vjesnik 30, br. 6 (2023): 1799-1808. https://doi.org/10.17559/TV-20230324000477

Harvard

Liu, H., et al. (2023). 'Chinese Named Entity Recognition Method for Domain-Specific Text', Tehnički vjesnik, 30(6), str. 1799-1808. https://doi.org/10.17559/TV-20230324000477

Vancouver

Liu H, Ma Y, Gao C, Qi J, Zhang D. Chinese Named Entity Recognition Method for Domain-Specific Text. Tehnički vjesnik [Internet]. 2023 [pristupljeno 02.07.2026.];30(6):1799-1808. https://doi.org/10.17559/TV-20230324000477

IEEE

H. Liu, Y. Ma, C. Gao, J. Qi i D. Zhang, "Chinese Named Entity Recognition Method for Domain-Specific Text", Tehnički vjesnik, vol.30, br. 6, str. 1799-1808, 2023. [Online]. https://doi.org/10.17559/TV-20230324000477

Sažetak

The Chinese named entity recognition (NER) is a critical task in natural language processing, aiming at identifying and classifying named entities in text. However, the specificity of domain texts and the lack of large-scale labelled datasets have led to the poor performance of NER methods trained on public domain corpora on domain texts. In this paper, a named entity recognition method incorporating sentence semantic information is proposed, mainly by adaptively incorporating sentence semantic information into character semantic information through an attention mechanism and a gating mechanism to enhance entity feature representation while attenuating the noise generated by irrelevant character information. In addition, to address the lack of large-scale labelled samples, we used data self-augmentation methods to expand the training samples. Furthermore, we introduced a Weighted Strategy considering that the low-quality samples generated by the data self-augmentation process can have a negative impact on the model. Experiments on the TCM prescriptions corpus showed that the F1 values of our method outperformed the comparison methods.

Ključne riječi

attention mechanism; data augmentation; domain text; meta-learning; named entity recognition

Hrčak ID:

309230

URI

https://hrcak.srce.hr/309230

Datum izdavanja:

25.10.2023.

Posjeta: 1.644 *

Prijava i registracija

Tehnički vjesnik, Vol. 30 No. 6, 2023.

Sažetak

Ključne riječi

Hrčak ID:

URI

Datum izdavanja: