Research on model of network information extraction based on improved topic-focused Web crawler key technology

Chen, Mo; Yang, Xiao-Ping

doi:10.17559/TV-20150314134638

Tehnički vjesnik, Vol. 23 No. 4, 2016.

Izvorni znanstveni članak

https://doi.org/10.17559/TV-20150314134638

Research on model of network information extraction based on improved topic-focused Web crawler key technology

Mo Chen ; School of Information, Renmin University of China, Beijing 100872, China
Xiao-Ping Yang ; School of Information, Renmin University of China, Beijing 100872, China

Puni tekst: hrvatski pdf 1.329 Kb

str. 1025-1035

preuzimanja: 532

citiraj

APA 6th Edition

Chen, M. i Yang, X. (2016). Research on model of network information extraction based on improved topic-focused Web crawler key technology. Tehnički vjesnik, 23 (4), 1025-1035. https://doi.org/10.17559/TV-20150314134638

MLA 8th Edition

Chen, Mo i Xiao-Ping Yang. "Research on model of network information extraction based on improved topic-focused Web crawler key technology." Tehnički vjesnik, vol. 23, br. 4, 2016, str. 1025-1035. https://doi.org/10.17559/TV-20150314134638. Citirano 26.12.2024.

Chicago 17th Edition

Chen, Mo i Xiao-Ping Yang. "Research on model of network information extraction based on improved topic-focused Web crawler key technology." Tehnički vjesnik 23, br. 4 (2016): 1025-1035. https://doi.org/10.17559/TV-20150314134638

Harvard

Chen, M., i Yang, X. (2016). 'Research on model of network information extraction based on improved topic-focused Web crawler key technology', Tehnički vjesnik, 23(4), str. 1025-1035. https://doi.org/10.17559/TV-20150314134638

Vancouver

Chen M, Yang X. Research on model of network information extraction based on improved topic-focused Web crawler key technology. Tehnički vjesnik [Internet]. 2016 [pristupljeno 26.12.2024.];23(4):1025-1035. https://doi.org/10.17559/TV-20150314134638

IEEE

M. Chen i X. Yang, "Research on model of network information extraction based on improved topic-focused Web crawler key technology", Tehnički vjesnik, vol.23, br. 4, str. 1025-1035, 2016. [Online]. https://doi.org/10.17559/TV-20150314134638

Puni tekst: engleski pdf 1.329 Kb

str. 1025-1035

preuzimanja: 420

citiraj

APA 6th Edition

MLA 8th Edition

Chicago 17th Edition

Harvard

Vancouver

IEEE

Sažetak

This research has caught researchers' wide attention for extracting network information exactly with the arrival of the big data era characterized by semi-structured or unstructured text. This paper proposes a model of network information extraction based on improved topic-focused web crawler key technology taking Web news as object of extraction. The authors elaborate main function, method and technology on every layer of the model in detail, which have been used or completed, and focuses on how to extract network information efficiently oriented topic from a large number of Web news instances, in order to explore a research method for network information extraction. The experimental results show the feasibility, validity and superiority of the model design and play a very important role in constructing topic-focused Web news corpus so as to provide a real-time data source for trust analysis, currency analysis, hot topic detection, topic evolution tracking of Web news.

Ključne riječi

network information extraction; relativity calculation; search strategy; topic-focused web crawler

Hrčak ID:

163814

URI

https://hrcak.srce.hr/163814

Datum izdavanja:

16.8.2016.

Podaci na drugim jezicima: hrvatski

Posjeta: 2.406 *

Prijava i registracija

Tehnički vjesnik, Vol. 23 No. 4, 2016.

Sažetak

Ključne riječi

Hrčak ID:

URI

Datum izdavanja: