Skip to the main content

Original scientific paper

https://doi.org/10.17559/TV-20231023001052

A Text Recognition Algorithm Based on a Dual-Attention Mechanism in Complex Driving Environment

Ling Ding ; School of Computer Science, Hubei University of Education, Wuhan, China
Liyuan Wang ; CCCC Second Highway Consultants Co., Ltd, Wuhan, China *
Yuanfang Wang ; School of Electronic Information, Wuhan University, Wuhan, China
Shaohuai Yu ; CCCC Second Highway Consultants Co., Ltd, Wuhan, China
Jinsheng Xiao ; School of Electronic Information, Wuhan University, Wuhan, China

* Corresponding author.


Full text: english pdf 2.123 Kb

page 247-253

downloads: 156

cite


Abstract

In response to many problems such as complex background of text recognition environment, perspective distortion, shallow handwriting, and mixed Chinese and English characters, we have designed an OCR algorithm framework with features such as landmark extraction and correction, image enhancement, text detection, and text recognition. We have designed a DBNet based on dual attention mechanism and content-aware upsampling. We have also designed a text recognition module incorporating the central loss CRNN + CTC to improve content awareness. Experimental results show that the improved text detection network in this paper has increased accuracy by 5.09%, recall by 2.12%, and F-score by 3.46% on the ICDAR2015 dataset. The text recognition network has improved the accuracy of recognizing Chinese and English characters by 1.2%.

Keywords

double attention mechanism; landmark extraction; text detection; text recognition

Hrčak ID:

312908

URI

https://hrcak.srce.hr/312908

Publication date:

31.12.2023.

Visits: 358 *