hrcak mascot   Srce   HID

Izvorni znanstveni članak
https://doi.org/10.17559/TV-20190420161815

Naive Bayesian Automatic Classification of Railway Service Complaint Text Based on Eigenvalue Extraction

Lifeng Li ; School of Economics and Management, Beijing Jiaotong University, No. 3, Shangyuancun, Haidian District, Beijing, China
Wenxing Li ; School of Economics and Management, Beijing Jiaotong University, No. 3, Shangyuancun, Haidian District, Beijing, China

Puni tekst: engleski, pdf (485 KB) str. 778-785 preuzimanja: 111* citiraj
APA 6th Edition
Li, L. i Li, W. (2019). Naive Bayesian Automatic Classification of Railway Service Complaint Text Based on Eigenvalue Extraction. Tehnički vjesnik, 26 (3), 778-785. https://doi.org/10.17559/TV-20190420161815
MLA 8th Edition
Li, Lifeng i Wenxing Li. "Naive Bayesian Automatic Classification of Railway Service Complaint Text Based on Eigenvalue Extraction." Tehnički vjesnik, vol. 26, br. 3, 2019, str. 778-785. https://doi.org/10.17559/TV-20190420161815. Citirano 19.11.2019.
Chicago 17th Edition
Li, Lifeng i Wenxing Li. "Naive Bayesian Automatic Classification of Railway Service Complaint Text Based on Eigenvalue Extraction." Tehnički vjesnik 26, br. 3 (2019): 778-785. https://doi.org/10.17559/TV-20190420161815
Harvard
Li, L., i Li, W. (2019). 'Naive Bayesian Automatic Classification of Railway Service Complaint Text Based on Eigenvalue Extraction', Tehnički vjesnik, 26(3), str. 778-785. https://doi.org/10.17559/TV-20190420161815
Vancouver
Li L, Li W. Naive Bayesian Automatic Classification of Railway Service Complaint Text Based on Eigenvalue Extraction. Tehnički vjesnik [Internet]. 2019 [pristupljeno 19.11.2019.];26(3):778-785. https://doi.org/10.17559/TV-20190420161815
IEEE
L. Li i W. Li, "Naive Bayesian Automatic Classification of Railway Service Complaint Text Based on Eigenvalue Extraction", Tehnički vjesnik, vol.26, br. 3, str. 778-785, 2019. [Online]. https://doi.org/10.17559/TV-20190420161815

Sažetak
Railways have developed rapidly in China for several decades. The hardware of railways has already reached the world's leading level, but the level of service of these railways still has room for improvement. The railway management department receives a large number of passenger complaints every year and records them in text, which needs to be classified and analyzed. The text of railway complaints includes characteristics spanning wide business coverage, various events, serious colloquialisms, interference and useless information. When using the direct classification via traditional text categorization, the classification accuracy is low. The key to the automatic classification of such text lies in an eigenvalue extraction. The more accurate the eigenvalue extraction, the higher the accuracy of text classification. In this paper, the TF-IDF algorithm, TextRank algorithm and Word2vec algorithm are selected to extract text eigenvalues, and a railway complaint text classification method is constructed with a naive Bayesian classifier. The three types of eigenvalue extraction algorithms are compared. The TF-IDF algorithm, based on eigenvalue extraction, achieves the highest automatic text classification accuracy.

Ključne riječi
automatic classification; eigenvalue; naive Bayes; railway complaint text; TextRank; TF-IDF; Word2vec

Hrčak ID: 221004

URI
https://hrcak.srce.hr/221004

Posjeta: 197 *