Skip to the main content

Review article

https://doi.org/10.31820/f.37.1.4

The Corpus of Spoken Istrovenetian/Fiuman and Croatian (C-ORAL-IC)

Nada Poropat Jeletić orcid id orcid.org/0000-0001-8787-9748 ; Filozofski fakultet, Sveučilište Jurja Dobrile u Puli
Gordana Hržica ; Edukacijsko-rehabilitacijski fakultet, Sveučilište u Zagrebu
Eliana Moscarda Mirković ; Filozofski fakultet, Sveučilište Jurja Dobrile u Puli


Full text: english PDF 1.413 Kb

page 89-109

downloads: 396

cite


Abstract

Bilingual conversational corpora are invaluable for studying genuine contact phenomena in spontaneous bilingual speech. This paper presents the Corpus of Spoken Istrovenetian/Fiuman and Croatian (C-ORAL-IC), the first corpus documenting unscripted Istrovenetian and Fiuman dialects spoken among bilinguals in the Istrian and Kvarner areas of Croatia. The region has a long history of Croatian and Italian cultural and linguistic interaction, shaping a complex sociolinguistic system with diglossic and polyglossic relations. C-ORAL-IC includes data from 87 bilingual/multilingual speakers and features over 85,000 tokens and 27,000 types. Available on TalkBank (BilingBank subsection) [https://talkbank.org, https://biling.talkbank.org/access/C-ORAL-IC.html], it includes transcribed, phonologically adapted, coded, segmented and morphologically tagged recordings. Additional participant data on language history and usage are available. C-ORAL-IC provides a rich resource for exploring spontaneous bilingual speech, offering insights into conversational features, structure, and synchronic changes in Istrovenetian/Fiuman.

Keywords

language sampling; spoken speech corpora; codeswitching; bilingual speech

Hrčak ID:

334183

URI

https://hrcak.srce.hr/334183

Publication date:

31.7.2025.

Article data in other languages: croatian

Visits: 863 *