INFORMATION RETRIEVAL USING LATENT SEMANTIC INDEXING

Dobša, Jasminka

Journal of Information and Organizational Sciences, Vol. 26 No. 1-2, 2002.

Preliminary communication

INFORMATION RETRIEVAL USING LATENT SEMANTIC INDEXING

Jasminka Dobša orcid.org/0000-0002-1684-1010 ; Faculty of Organization and Informatics, University of Zagreb, Varaždin, Croatia

Full text: english pdf 5.049 Kb

page 13-23

downloads: 713

cite

APA 6th Edition

Dobša, J. (2002). INFORMATION RETRIEVAL USING LATENT SEMANTIC INDEXING. Journal of Information and Organizational Sciences, 26 (1-2), 13-23. Retrieved from https://hrcak.srce.hr/78428

MLA 8th Edition

Dobša, Jasminka. "INFORMATION RETRIEVAL USING LATENT SEMANTIC INDEXING." Journal of Information and Organizational Sciences, vol. 26, no. 1-2, 2002, pp. 13-23. https://hrcak.srce.hr/78428. Accessed 2 Jun. 2026.

Chicago 17th Edition

Dobša, Jasminka. "INFORMATION RETRIEVAL USING LATENT SEMANTIC INDEXING." Journal of Information and Organizational Sciences 26, no. 1-2 (2002): 13-23. https://hrcak.srce.hr/78428

Harvard

Dobša, J. (2002). 'INFORMATION RETRIEVAL USING LATENT SEMANTIC INDEXING', Journal of Information and Organizational Sciences, 26(1-2), pp. 13-23. Available at: https://hrcak.srce.hr/78428 (Accessed 02 June 2026)

Vancouver

Dobša J. INFORMATION RETRIEVAL USING LATENT SEMANTIC INDEXING. Journal of Information and Organizational Sciences [Internet]. 2002 [cited 2026 June 02];26(1-2):13-23. Available from: https://hrcak.srce.hr/78428

IEEE

J. Dobša, "INFORMATION RETRIEVAL USING LATENT SEMANTIC INDEXING", Journal of Information and Organizational Sciences, vol.26, no. 1-2, pp. 13-23, 2002. [Online]. Available: https://hrcak.srce.hr/78428. [Accessed: 02 June 2026]

Abstract

Our capabilities for collecting and storing data of all kinds are greater then ever. On the other side analyzing, summarizing and extracting information from this data is harder than ever. That’s why there is a growing need for the fast and efficient algorithms for information retrieval.In this paper we present some mathematical models based on linear algebra used to extract the relevant documents for some subjects out of a large set of text document. This is a typical problem of a search engine on the World Wide Web. We use vector space model, which is based on literal matching of terms in the documents and the queries. The vector space model is implemented by creating the term-document matrix. Literal matching of terms does not necessarily retrieve all relevant documents. Synonymy (multiple words having the same meaning) and polysemy (words having multiple meaning) are two major obstacles for efficient information retrieval. Latent Semantic Indexing represents documents by approximations and tends to cluster documents on similar topics even if their term profiles are somewhat different. This approximate representation is accomplished using a low-rank singular value decomposition (SVD) approximation of the term-document matrix. In this paper we compare the precision of information retrieval for different ranks of SVD representation of term-document matrix.

Keywords

information retrieval; singular value decomposition; vector space model; lowrank approximation; latent semantic indexing

Hrčak ID:

78428

URI

https://hrcak.srce.hr/78428

Publication date:

13.12.2002.

Visits: 1.677 *

Login and registration

Journal of Information and Organizational Sciences, Vol. 26 No. 1-2, 2002.

Abstract

Keywords

Hrčak ID:

URI

Publication date: