Original scientific paper
https://doi.org/10.22210/suvlin.2022.093.03
How we color the world with words
Kristina Kocijan
orcid.org/0000-0001-9467-5313
; Faculty of Humanities and Social Sciences, University of Zagreb
Abstract
Th is paper presents a computational approach to the automatic detection of language patterns,
specifi cally those dealing with expressing colors in the Croatian language. It investigates diff erent
lexicalization patterns of color terms, mainly compounds and multiword units, in order to classify
them and prepare them for usage in the design of an algorithm that will automatically recognize
and annotate these expressions in Croatian text. Th e paper also presents a comparative analysis of
diff erent classes of color terms found in a corpus built from books intended for younger (CLC) and
older (ALC) populations. Finally, the research data is presented through a dictionary of three types
of color terms categorized as multiword expressions
Keywords
color terms; lexicalization patterns; multiword expressions; natural language processing; digital humanities; Croatian; NooJ
Hrčak ID:
280907
URI
Publication date:
25.7.2022.
Visits: 1.429 *