Skip to the main content

Original scientific paper

https://doi.org/10.31724/rihjj.49.1.6

Building an Owl-Ontology for Representing, Linking and Querying SemAF Discourse Annotations

Christian Chiarcos orcid id orcid.org/0000-0002-4428-029X ; University of Augsburg, Augsburg
Purificação Silvano orcid id orcid.org/0000-0001-8057-5338 ; Faculty of Arts and Humanities of the University of Porto, Porto
Mariana Damova ; Mozaika, Ltd., Sofia *
Giedre Valunaite Oleškeviciene orcid id orcid.org/0000-0001-5688-2469 ; Mykolas Romeris University, Vilnius
Chaya Liebeskind ; Jerusalem College of Technology, Jerusalem
Dimitar Trajanov orcid id orcid.org/0000-0002-3105-6010 ; Department for Information Systems and Network Technologies, Faculty of Computer Science and Engineering Ss. Cyril and Methodius University, Skopje
Ciprian-Octavian Truică ; Department of Information Technology, Uppsala University, Sweden, Uppsala
Elena-Simona Apostol ; Faculty of Automatic Control and Computers, University Politehnica of Bucharest, Bucharest
Anna Bączkowska orcid id orcid.org/0000-0002-0147-2718 ; Institute of English and American Studies, University of Gdansk, Gdansk

* Corresponding author.


Full text: english pdf 2.783 Kb

page 117-136

downloads: 510

cite


Abstract

Linguistic Linked Open Data (LLOD) are technologies that provide a powerful instrument for representing and interpreting language phenomena on a web-scale. The main objective of this paper is to demonstrate how LLOD technologies can be applied to represent and annotate a corpus composed of multiword discourse markers, and what the effects of this are. In particular, it is our aim to apply semantic web standards such as RDF and OWL for publishing and integrating data. We present a novel scheme for discourse annotation that combines ISO standards describing discourse relations and dialogue acts – ISO DR-Core (ISO 24617-8) and ISO-Dialogue Acts (ISO 24617-2) in 9 languages (cf. Silvano and Damova 2022; Silvano, et al. 2022). We develop an OWL ontology to formalize that scheme, provide a newly annotated dataset and link its RDF edition with the ontology. Consequently, we describe the conjoint querying of the ontology and the annotations by means of SPARQL, the standard query language for the web of data. The ultimate result is that we are able to perform queries over multiple, interlinked datasets with complex internal structure. This is a first, but essential step, in developing novel, powerful, and groundbreaking means for the corpus-based study of multilingual discourse, communication analysis, or attitudes discovery.

Keywords

LLOD, OWL ontology; RDF; discourse markers; ISO standard; annotations; parallel corpus

Hrčak ID:

308936

URI

https://hrcak.srce.hr/308936

Publication date:

16.10.2023.

Article data in other languages: croatian

Visits: 1.319 *