Skip to the main content

Original scientific paper

https://doi.org/10.22210/suvlin.2022.093.03

How we color the world with words

Kristina Kocijan orcid id orcid.org/0000-0001-9467-5313 ; Faculty of Humanities and Social Sciences, University of Zagreb


Full text: english pdf 1.536 Kb

page 41-83

downloads: 546

cite


Abstract

Th is paper presents a computational approach to the automatic detection of language patterns,
specifi cally those dealing with expressing colors in the Croatian language. It investigates diff erent
lexicalization patterns of color terms, mainly compounds and multiword units, in order to classify
them and prepare them for usage in the design of an algorithm that will automatically recognize
and annotate these expressions in Croatian text. Th e paper also presents a comparative analysis of
diff erent classes of color terms found in a corpus built from books intended for younger (CLC) and
older (ALC) populations. Finally, the research data is presented through a dictionary of three types
of color terms categorized as multiword expressions

Keywords

color terms; lexicalization patterns; multiword expressions; natural language processing; digital humanities; Croatian; NooJ

Hrčak ID:

280907

URI

https://hrcak.srce.hr/280907

Publication date:

25.7.2022.

Article data in other languages: croatian

Visits: 1.429 *