Skoči na glavni sadržaj

Izvorni znanstveni članak

https://doi.org/10.31341/jios.42.1.5

The Use of Support Vector Machines When Designing a User-Defined Niche Search Engine

Maria Jakovljevic ; School of Computing, University of South Africa, South Africa
Howard Sommerfeld ; Platform45, Technical Consultant, Johannesburg, South Africa
Alfred Coleman ; School of Computing, University of South Africa, South Africa


Puni tekst: engleski pdf 1.669 Kb

str. 87-109

preuzimanja: 461

citiraj


Sažetak

This study presents the construction of a niche search engine, whose search topic domain is to be user-defined. The specific focus of this study is the investigation of the role that a Support Vector Machine plays when classifying textual data from web pages. Furthermore, the aim is to establish whether this niche search engine can return results that are more relevant to a user than when compared to those returned by a commercial search engine Through the conduction of various experiments across a number of appropriate datasets, the suitability of the SVM to classify web pages has been proven to meet the needs of a niche search engine. A subset of the most useful webpage-specific features has been discovered, with the best performing feature being a web pages’ Text & Title component. The user defined niche search engine was successfully designed and an experiment showed that it returned more relevant results than a commercial search engine.

Ključne riječi

Support vector machine; search engine; text classification and processing; information retrieval

Hrčak ID:

202594

URI

https://hrcak.srce.hr/202594

Datum izdavanja:

26.6.2018.

Posjeta: 950 *