hrcak mascot   Srce   HID

Izvorni znanstveni članak
https://doi.org/10.2498/cit.2006.04.08

Comparison of Collocation Extraction Measures for Document Indexing

Bojana Dalbelo Basic
Mladen Kolar
Jan Snajder
Sasa Petrovic

Puni tekst: engleski, pdf (211 KB) str. 321-327 preuzimanja: 569* citiraj
APA 6th Edition
Dalbelo Basic, B., Kolar, M., Snajder, J. i Petrovic, S. (2006). Comparison of Collocation Extraction Measures for Document Indexing. Journal of computing and information technology, 14 (4), 321-327. https://doi.org/10.2498/cit.2006.04.08
MLA 8th Edition
Dalbelo Basic, Bojana, et al. "Comparison of Collocation Extraction Measures for Document Indexing." Journal of computing and information technology, vol. 14, br. 4, 2006, str. 321-327. https://doi.org/10.2498/cit.2006.04.08. Citirano 24.02.2020.
Chicago 17th Edition
Dalbelo Basic, Bojana, Mladen Kolar, Jan Snajder i Sasa Petrovic. "Comparison of Collocation Extraction Measures for Document Indexing." Journal of computing and information technology 14, br. 4 (2006): 321-327. https://doi.org/10.2498/cit.2006.04.08
Harvard
Dalbelo Basic, B., et al. (2006). 'Comparison of Collocation Extraction Measures for Document Indexing', Journal of computing and information technology, 14(4), str. 321-327. https://doi.org/10.2498/cit.2006.04.08
Vancouver
Dalbelo Basic B, Kolar M, Snajder J, Petrovic S. Comparison of Collocation Extraction Measures for Document Indexing. Journal of computing and information technology [Internet]. 2006 [pristupljeno 24.02.2020.];14(4):321-327. https://doi.org/10.2498/cit.2006.04.08
IEEE
B. Dalbelo Basic, M. Kolar, J. Snajder i S. Petrovic, "Comparison of Collocation Extraction Measures for Document Indexing", Journal of computing and information technology, vol.14, br. 4, str. 321-327, 2006. [Online]. https://doi.org/10.2498/cit.2006.04.08

Sažetak
Automatic extraction of collocations from a corpus is a well-known problem in the field of natural language processing. It is typically carried out by employing some kind of a statistical measure that indicates whether or not two words occur together more often than by chance. As there is an aboundance of these measures proposed by various authors, we have compared some of them on a task of extracting collocations from a corpus of Croatian legal documents for the purpose of document indexing. We propose and evaluate extensions of these measures for collocations consisting of three words.

Hrčak ID: 44648

URI
https://hrcak.srce.hr/44648

Posjeta: 724 *