Tehnički vjesnik, Vol. 31 No. 1, 2024.
Izvorni znanstveni članak
https://doi.org/10.17559/TV-20231023001052
A Text Recognition Algorithm Based on a Dual-Attention Mechanism in Complex Driving Environment
Ling Ding
; School of Computer Science, Hubei University of Education, Wuhan, China
Liyuan Wang
; CCCC Second Highway Consultants Co., Ltd, Wuhan, China
*
Yuanfang Wang
; School of Electronic Information, Wuhan University, Wuhan, China
Shaohuai Yu
; CCCC Second Highway Consultants Co., Ltd, Wuhan, China
Jinsheng Xiao
; School of Electronic Information, Wuhan University, Wuhan, China
* Dopisni autor.
Sažetak
In response to many problems such as complex background of text recognition environment, perspective distortion, shallow handwriting, and mixed Chinese and English characters, we have designed an OCR algorithm framework with features such as landmark extraction and correction, image enhancement, text detection, and text recognition. We have designed a DBNet based on dual attention mechanism and content-aware upsampling. We have also designed a text recognition module incorporating the central loss CRNN + CTC to improve content awareness. Experimental results show that the improved text detection network in this paper has increased accuracy by 5.09%, recall by 2.12%, and F-score by 3.46% on the ICDAR2015 dataset. The text recognition network has improved the accuracy of recognizing Chinese and English characters by 1.2%.
Ključne riječi
double attention mechanism; landmark extraction; text detection; text recognition
Hrčak ID:
312908
URI
Datum izdavanja:
31.12.2023.
Posjeta: 937 *