Skoči na glavni sadržaj

Izvorni znanstveni članak

https://doi.org/10.17559/TV-20241202002164

Research on Key Technology of Web Topic Detection Based on Tuple Semantic Description Analysis for Big Data

Mo Chen ; Big Data Management and Application Major, School of Business, Beijing Union University, No. 3, Yanjing Dongli, Chaoyang District, Beijing *

* Dopisni autor.


Puni tekst: engleski pdf 1.291 Kb

str. 225-236

preuzimanja: 124

citiraj


Sažetak

In the context of the numerical intelligence age, how to further promote research on Web topic detection with heterogeneous big data as an important research object has attracted widespread attention from scholars. This paper takes network heterogeneous big data as the main research object and proposes a Web topic detection idea based on tuple semantic description analysis. Firstly, the main implementation methods are explained, including time item analysis, named entity item analysis, event item analysis, and semantic formal analysis. Secondly, the Web topic detection algorithm based on tuple semantic description analysis is elaborated. Through experiments, firstly, the impact of the quantity of Web news on the quality of time item description and five tuple semantic description is analysed, the influence of the adjustment parameters in the algorithm on the quality of the described Web news named entity items is also analysed to obtain the optimal adjustment range for these parameters, the impact of the long and short time selection range in the algorithm on the calculation results of the popularity weight of Web news named entity items is also analysed to obtain its optimal adjustment range, the impact of the adjustment parameter in the algorithm on the quality of the described Web news event items is also analysed to obtain the optimal adjustment range of the parameter. Secondly, the quality of topic detection under different semantic descriptions in Web news, the time consumed in the topic detection process under different methods, and the quality of topic detection under different datasets are analyzed. The experimental analysis process shows that the Web topic detection idea proposed in this paper is feasible, verifiable, and superior, and can play an important role in reconfiguring the Web topic corpus, inferring the Web hierarchical big data propagation path, and providing the numerical intelligence warehouses based on network information detection.

Ključne riječi

big data; tuple semantic description; web topic detection

Hrčak ID:

342644

URI

https://hrcak.srce.hr/342644

Datum izdavanja:

31.12.2025.

Posjeta: 255 *