Croatian Frequency Dictionary of Child Language

Hržica, Gordana; Kuvač Kraljević, Jelena; Šnajder, Jan

LAHOR : journal for Croatian as mother, second and foreign lanugage, Vol. 2 No. 16, 2013.

Review article

Croatian Frequency Dictionary of Child Language

Gordana Hržica orcid.org/0000-0001-6067-9148 ; University of Zagreb, Laboratory for Psycholinguistic Research - POLIN
Jelena Kuvač Kraljević orcid.org/0000-0003-1452-0851 ; University of Zagreb, Laboratory for Psycholinguistic Research - POLIN
Jan Šnajder orcid.org/0000-0001-8942-5301 ; University of Zagreb, Faculty of Electrical Engineering and Computing (FER)

Full text: croatian pdf 201 Kb

page 189-205

downloads: 2.054

cite

APA 6th Edition

Hržica, G., Kuvač Kraljević, J. & Šnajder, J. (2013). Croatian Frequency Dictionary of Child Language. Lahor, 2 (16), 189-205. Retrieved from https://hrcak.srce.hr/130044

MLA 8th Edition

Hržica, Gordana, et al. "Croatian Frequency Dictionary of Child Language." Lahor, vol. 2, no. 16, 2013, pp. 189-205. https://hrcak.srce.hr/130044. Accessed 25 Jul. 2026.

Chicago 17th Edition

Hržica, Gordana, Jelena Kuvač Kraljević and Jan Šnajder. "Croatian Frequency Dictionary of Child Language." Lahor 2, no. 16 (2013): 189-205. https://hrcak.srce.hr/130044

Harvard

Hržica, G., Kuvač Kraljević, J., and Šnajder, J. (2013). 'Croatian Frequency Dictionary of Child Language', Lahor, 2(16), pp. 189-205. Available at: https://hrcak.srce.hr/130044 (Accessed 25 July 2026)

Vancouver

Hržica G, Kuvač Kraljević J, Šnajder J. Croatian Frequency Dictionary of Child Language. Lahor [Internet]. 2013 [cited 2026 July 25];2(16):189-205. Available from: https://hrcak.srce.hr/130044

IEEE

G. Hržica, J. Kuvač Kraljević and J. Šnajder, "Croatian Frequency Dictionary of Child Language", Lahor, vol.2, no. 16, pp. 189-205, 2013. [Online]. Available: https://hrcak.srce.hr/130044. [Accessed: 25 July 2026]

Abstract

Nowadays language corpora are recognised as valuable and informative sources of linguistic information. However, retrieving the available data can be demanding and complex, therefore sometime not suitable for all users that could benefit from it. The only existing Croatian corpus of spoken language is the Croatian Corpus of Child Language (CCCL --- Kovacevic, 2002). Speech samples were taken from three children, in equable time periods, from the onset of speech to three years of age. Samples were transcribed according the rules of CHAT, using the computer programme CLAN. CCCL is available on-line in the CHILDES (Child Language Data Exchange System --- ). It is designed to provide data about lexical and grammatical development in language acquisition. Consequently, a Croatian frequency dictionary of child language (CFDCL) has been designed to enable easier data retrieval form CCCL. It allows the analyses of most frequent lemmas in all three sub-corpora according to frequency, alphabetic ordering, time of appearance, and part-of-speech. Furthermore, it preserves the morphological encoding of types, and number of types and tokens. Therefore it incorporates a larger amount of information than traditional corpora of written language, enabling users to extract relevant information about child language development such as type/token ratio, lexical diversity, morphological diversity, etc.

Keywords

Croatian corpus of child language; Croatian Frequency Dictionary of Child Language; CHILDES; lemmatization; tagging of corpus; structure of CFDCLP

Hrčak ID:

130044

URI

https://hrcak.srce.hr/130044

Publication date:

23.12.2013.

Article data in other languages: croatian

Visits: 5.730 *

Login and registration

LAHOR : journal for Croatian as mother, second and foreign lanugage, Vol. 2 No. 16, 2013.

Abstract

Keywords

Hrčak ID:

URI

Publication date: