When news sites “catch” the coronavirus: Development and comparative analysis of the 2019 and 2020 articles published on the Index.hr news portal

Bago, Petra

doi:10.20901/ms.13.25.2

Media studies, Vol. 13 No. 25, 2022.

Original scientific paper

https://doi.org/10.20901/ms.13.25.2

When news sites “catch” the coronavirus: Development and comparative analysis of the 2019 and 2020 articles published on the Index.hr news portal

Petra Bago orcid.org/0000-0002-4994-6417 ; Faculty of Humanities and Social Sciences, University of Zagreb, Croatia

Full text: croatian pdf 726 Kb

page 27-49

downloads: 731

cite

APA 6th Edition

Bago, P. (2022). When news sites “catch” the coronavirus: Development and comparative analysis of the 2019 and 2020 articles published on the Index.hr news portal. Medijske studije, 13 (25), 27-49. https://doi.org/10.20901/ms.13.25.2

MLA 8th Edition

Bago, Petra. "When news sites “catch” the coronavirus: Development and comparative analysis of the 2019 and 2020 articles published on the Index.hr news portal." Medijske studije, vol. 13, no. 25, 2022, pp. 27-49. https://doi.org/10.20901/ms.13.25.2. Accessed 1 Jul. 2026.

Chicago 17th Edition

Bago, Petra. "When news sites “catch” the coronavirus: Development and comparative analysis of the 2019 and 2020 articles published on the Index.hr news portal." Medijske studije 13, no. 25 (2022): 27-49. https://doi.org/10.20901/ms.13.25.2

Harvard

Bago, P. (2022). 'When news sites “catch” the coronavirus: Development and comparative analysis of the 2019 and 2020 articles published on the Index.hr news portal', Medijske studije, 13(25), pp. 27-49. https://doi.org/10.20901/ms.13.25.2

Vancouver

Bago P. When news sites “catch” the coronavirus: Development and comparative analysis of the 2019 and 2020 articles published on the Index.hr news portal. Medijske studije [Internet]. 2022 [cited 2026 July 01];13(25):27-49. https://doi.org/10.20901/ms.13.25.2

IEEE

P. Bago, "When news sites “catch” the coronavirus: Development and comparative analysis of the 2019 and 2020 articles published on the Index.hr news portal", Medijske studije, vol.13, no. 25, pp. 27-49, 2022. [Online]. https://doi.org/10.20901/ms.13.25.2

Abstract

The goal of this paper is to present the methodology, tools and results of comparative computational analysis of newspaper online articles: from the collection of documents and the cleaning of language data for the development of specialized corpora of newspaper articles, to the presentation of the tools used and the comparative statistical analysis of the corpora. The research was conducted on two specialized corpora developed precisely for the purpose of this research, based on 500 newspaper articles in the category “News” of the Index.hr news portal. One corpus is based on articles published in the pre-pandemic year 2019, and the other is based on articles published in the pandemic year 2020. By analyzing the data, we found that the vocabulary of the pandemic corpus is significantly poorer than the pre-pandemic corpus, that in 2020 less was written about the neighboring states of the Republic of Croatia than in 2019, and that the pre-pandemic corpus mentioned domestic cities more than the foreign ones, while the opposite can be argued for the pandemic corpus. Finally, we also investigated the adequacy of automatic term extraction to identify specific topics covered in the observed corpora.

Keywords

statistical corpus analysis; specialized corpus; journal articles; Sketch Engine; Python; Index.hr

Hrčak ID:

281477

URI

https://hrcak.srce.hr/281477

Publication date:

4.8.2022.

Article data in other languages: croatian

Visits: 2.219 *

Login and registration

Media studies, Vol. 13 No. 25, 2022.

Abstract

Keywords

Hrčak ID:

URI

Publication date: