Skoči na glavni sadržaj

Izvorni znanstveni članak

https://doi.org/10.20532/cit.2022.1005718

Attention Mechanism and Detection Box Information Based Real-time Multi-Object Vehicle Detection

Hao Wu ; School of Computer Science and Technology, Hefei Normal University, China
Wei Wu orcid id orcid.org/0009-0000-6219-4072 ; School of Economics and Trade, Anhui Business and Technology College, Hefei, China
Xiaoyan Sun ; School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
Jin Zhong ; School of Computer Science and Technology, Hefei Normal University, China
Fengyun Cao ; School of Computer Science and Technology, Hefei Normal University, China


Puni tekst: engleski pdf 2.147 Kb

str. 239-256

preuzimanja: 45

citiraj


Sažetak

Ensuring both the accuracy of vehicle target detection and meeting real-time requirements is crucial in traffic videos. The YOLOv5s target detection framework, known for its accuracy and efficiency, has attracted attention in academic circles. However, there are still some features that can be optimized. First of all, the detection subnet of the YOLOv5s framework cannot smoothly convert complex feature maps into relatively sparse target prediction boxes. To solve this, we integrate a self-attention-based gating mechanism into the detection subnet, forming the YOLOv5s-SAG network. Secondly, the loss function of CIoU used by YOLOv5s pays insufficient attention to the overlapping area of the detection frame, which can be used as metric for measuring target detection effectiveness. We add the loss term of area ratio to CIoU to further improve the modeling ability. Finally, the current multi-class Non-Maximum Suppression algorithm can cause high overlap of multi-class detection frames. To improve it, we propose a multi-class CS-NMS algorithm based on category suppression. Experimental results show an approximately 8% improvement in the mAP50 index on the UA-DETRAC dataset compared with YOLOv5s. The proposed algorithm also achieves better detection results compared to mainstream target detection algorithms and meets the real-time requirements of traffic video analysis.

Ključne riječi

YOLOv5s, AIoU Loss, multi-object detection, attention mechanism, CS-NMS

Hrčak ID:

313196

URI

https://hrcak.srce.hr/313196

Datum izdavanja:

29.11.2023.

Posjeta: 97 *