Skip to the main content

Original scientific paper

https://doi.org/10.31341/jios.41.1.7

Detecting Source Code Plagiarism on .NET Programming Languages using Low-level Representation and Adaptive Local Alignment

Faqih Salban Rabbani orcid id orcid.org/0000-0003-3488-4599 ; Faculty of Information Technology, Maranatha Christian University, Indonesia
Oscar Karnalim orcid id orcid.org/0000-0003-4930-6249 ; Faculty of Information Technology, Maranatha Christian University, Indonesia


Full text: english pdf 1.543 Kb

page 105-123

downloads: 785

cite


Abstract

Even though there are various source code plagiarism detection approaches, only a few works which are focused on low-level representation for deducting similarity. Most of them are only focused on lexical token sequence extracted from source code. In our point of view, low-level representation is more beneficial than lexical token since its form is more compact than the source code itself. It only considers semantic-preserving instructions and ignores many source code delimiter tokens. This paper proposes a source code plagiarism detection which rely on low-level representation. For a case study, we focus our work on .NET programming languages with Common Intermediate Language as its low-level representation. In addition, we also incorporate Adaptive Local Alignment for detecting similarity. According to Lim et al, this algorithm outperforms code similarity state-of-the-art algorithm (i.e. Greedy String Tiling) in term of effectiveness. According to our evaluation which involves various plagiarism attacks, our approach is more effective and efficient when compared with standard lexical-token approach.

Keywords

source code plagiarism detection; source code similarity; low-level language; .NET programming language; adaptive local alignment

Hrčak ID:

183091

URI

https://hrcak.srce.hr/183091

Publication date:

16.6.2017.

Visits: 1.973 *