CROATIAN ADULT SPOKEN LANGUAGE CORPUS (HrAL)

Kuvač Kraljević, Jelena; Hržica, Gordana

FLUMINENSIA : časopis za filološka istraživanja, Vol. 28 No. 2, 2016.

Pregledni rad

CROATIAN ADULT SPOKEN LANGUAGE CORPUS (HrAL)

Jelena Kuvač Kraljević orcid.org/0000-0003-1452-0851 ; Edukacijsko-rehabilitacijski fakultet, Zagreb
Gordana Hržica orcid.org/0000-0001-6067-9148 ; Edukacijsko-rehabilitacijski fakultet, Zagreb

Puni tekst: engleski pdf 623 Kb

str. 87-102

preuzimanja: 808

citiraj

APA 6th Edition

Kuvač Kraljević, J. i Hržica, G. (2016). CROATIAN ADULT SPOKEN LANGUAGE CORPUS (HrAL). FLUMINENSIA, 28 (2), 87-102. Preuzeto s https://hrcak.srce.hr/174013

MLA 8th Edition

Kuvač Kraljević, Jelena i Gordana Hržica. "CROATIAN ADULT SPOKEN LANGUAGE CORPUS (HrAL)." FLUMINENSIA, vol. 28, br. 2, 2016, str. 87-102. https://hrcak.srce.hr/174013. Citirano 24.04.2024.

Chicago 17th Edition

Kuvač Kraljević, Jelena i Gordana Hržica. "CROATIAN ADULT SPOKEN LANGUAGE CORPUS (HrAL)." FLUMINENSIA 28, br. 2 (2016): 87-102. https://hrcak.srce.hr/174013

Harvard

Kuvač Kraljević, J., i Hržica, G. (2016). 'CROATIAN ADULT SPOKEN LANGUAGE CORPUS (HrAL)', FLUMINENSIA, 28(2), str. 87-102. Preuzeto s: https://hrcak.srce.hr/174013 (Datum pristupa: 24.04.2024.)

Vancouver

Kuvač Kraljević J, Hržica G. CROATIAN ADULT SPOKEN LANGUAGE CORPUS (HrAL). FLUMINENSIA [Internet]. 2016 [pristupljeno 24.04.2024.];28(2):87-102. Dostupno na: https://hrcak.srce.hr/174013

IEEE

J. Kuvač Kraljević i G. Hržica, "CROATIAN ADULT SPOKEN LANGUAGE CORPUS (HrAL)", FLUMINENSIA, vol.28, br. 2, str. 87-102, 2016. [Online]. Dostupno na: https://hrcak.srce.hr/174013. [Citirano: 24.04.2024.]

Sažetak

Interest in spoken-language corpora has increased over the past two decades leading to the development of new corpora and the discovery of new facets of spoken language. These types of corpora represent the most comprehensive data source about the language of ordinary speakers. Such corpora are based on spontaneous, unscripted speech defined by a variety of styles, registers and dialects.
The aim of this paper is to present the Croatian Adult Spoken Language Corpus (HrAL), its structure and its possible applications in different linguistic subfields. HrAL was built by sampling spontaneous conversations among 617 speakers from all Croatian counties, and it comprises more than 250,000 tokens and more than 100,000 types. Data were collected during three time slots: from 2010 to 2012, from 2014 to 2015 and during 2016.
HrAL is today available within TalkBank, a large database of spoken-language corpora covering different languages (https://talkbank.org), in the Conversational Analyses corpora within the subsection titled Conversational Banks. Data were transcribed, coded and segmented using the transcription format Codes for Human Analysis of Transcripts (CHAT) and the Computerised Language Analysis (CLAN) suite of programmes within the TalkBank toolkit. Speech streams were segmented into communication units (C-units) based on syntactic criteria. Most transcripts were linked to their source audios. The TalkBank is public free, i.e. all data stored in it can be shared by the wider community in accordance with the basic rules of the TalkBank.
HrAL provides information about spoken grammar and lexicon, discourse skills, error production and productivity in general. It may be useful for sociolinguistic research and studies of synchronic language changes in Croatian.

Ključne riječi

Croatian Adult Spoken Language Corpus (HrAL); language sampling; spontaneous speech corpora

Hrčak ID:

174013

URI

https://hrcak.srce.hr/174013

Datum izdavanja:

26.1.2017.

Podaci na drugim jezicima: hrvatski

Posjeta: 2.361 *

Prijava i registracija

FLUMINENSIA : časopis za filološka istraživanja, Vol. 28 No. 2, 2016.

Sažetak

Ključne riječi

Hrčak ID:

URI

Datum izdavanja: