Skip to the main content

Original scientific paper

https://doi.org/10.17559/TV-20200801053744

Text Adversarial Examples Generation and Defense Based on Reinforcement Learning

Yue Li* orcid id orcid.org/0000-0003-4539-4018 ; College of Computer Science and Technology, Donghua University, Shanghai 201600, China
Pengjian Xu ; College of Computer Science and Technology, Donghua University, Shanghai 201600, China
Qing Ruan ; College of Computer Science and Technology, Donghua University, Shanghai 201600, China
Wusheng Xu ; College of Computer Science and Technology, Donghua University, Shanghai 201600, China


Full text: english pdf 589 Kb

page 1306-1314

downloads: 565

cite


Abstract

In recent years, the neural networks are widely used in image processing, natural language processing and other fields. But there are new security issues-the adversarial examples. Crafted adversarial examples can make a trouble for the neural network, which leads to the mis-classification. Text classification is one of the basic tasks of the natural language processing. This paper is concerned about the generation and defense of text adversarial examples. The main contributions of this research are as follows: This paper explores a new type of adversarial example and applies reinforcement learning to generate the adversarial examples; a training set composed of adversarial examples is constructed. To build a more robust classifier, a new defense framework is established. In order to eliminate the influence of noise, well-designed predetector and reformer were implemented, which helps the neural networks to resist adversarial examples and reduce coupling.

Keywords

adversarial examples; defense; neural networks; text classification

Hrčak ID:

260854

URI

https://hrcak.srce.hr/260854

Publication date:

22.7.2021.

Visits: 1.209 *