Advancing UAV Image Semantic Segmentation with an Improved Multiscale Diffusion Model

Wang, Wang; Zhou, Chen; He, Hua; Ma, Changsong

doi:10.17559/TV-20231023001051

Technical gazette, Vol. 31 No. 6, 2024.

Original scientific paper

https://doi.org/10.17559/TV-20231023001051

Advancing UAV Image Semantic Segmentation with an Improved Multiscale Diffusion Model

Wang Wang ; Geely University of China, Chengdu, Sichuan, 641423, China *
Chen Zhou ; Wuhan University, Wuhan, Hubei, 430072, China
Hua He ; Chongqing Technology and Business University, Chongqing, 400067, China
Changsong Ma ; Krirk University, Bangkok, 10220, Thailand

* Corresponding author.

Full text: english pdf 988 Kb

page 1859-1865

downloads: 372

cite

APA 6th Edition

Wang, W., Zhou, C., He, H. & Ma, C. (2024). Advancing UAV Image Semantic Segmentation with an Improved Multiscale Diffusion Model. Tehnički vjesnik, 31 (6), 1859-1865. https://doi.org/10.17559/TV-20231023001051

MLA 8th Edition

Wang, Wang, et al. "Advancing UAV Image Semantic Segmentation with an Improved Multiscale Diffusion Model." Tehnički vjesnik, vol. 31, no. 6, 2024, pp. 1859-1865. https://doi.org/10.17559/TV-20231023001051. Accessed 26 May 2026.

Chicago 17th Edition

Wang, Wang, Chen Zhou, Hua He and Changsong Ma. "Advancing UAV Image Semantic Segmentation with an Improved Multiscale Diffusion Model." Tehnički vjesnik 31, no. 6 (2024): 1859-1865. https://doi.org/10.17559/TV-20231023001051

Harvard

Wang, W., et al. (2024). 'Advancing UAV Image Semantic Segmentation with an Improved Multiscale Diffusion Model', Tehnički vjesnik, 31(6), pp. 1859-1865. https://doi.org/10.17559/TV-20231023001051

Vancouver

Wang W, Zhou C, He H, Ma C. Advancing UAV Image Semantic Segmentation with an Improved Multiscale Diffusion Model. Tehnički vjesnik [Internet]. 2024 [cited 2026 May 26];31(6):1859-1865. https://doi.org/10.17559/TV-20231023001051

IEEE

W. Wang, C. Zhou, H. He and C. Ma, "Advancing UAV Image Semantic Segmentation with an Improved Multiscale Diffusion Model", Tehnički vjesnik, vol.31, no. 6, pp. 1859-1865, 2024. [Online]. https://doi.org/10.17559/TV-20231023001051

Abstract

This study explores the challenges of image semantic segmentation in autonomous driving across varied campus environments. We introduce a specialized dataset consisting of 400 drone-captured images from different campuses. These images have been meticulously labelled into five categories: buildings, vegetation, ground, playgrounds, and lakes. These categories are essential for precise semantic segmentation tasks which are crucial to autonomous driving applications. To address the segmentation challenges presented by the unique and diverse features of campus environments, we propose an innovative algorithm. This algorithm is based on an enhanced diffusion model that is adept at handling multi-scale features inherent in campus environments. By incorporating scalable jump-connection layers in the denoising probability diffusion model, the proposed algorithm not only achieves superior accuracy but also demonstrates a significant improvement in recognition precision within the dataset, resulting in an average mIoU of 85%. The results underscore the algorithm's effectiveness and its potential as a robust solution for semantic segmentation tasks in autonomous driving within campus settings, paving the way for further research and application in real-world scenarios.

Keywords

image semantic segmentation; multiscale diffusion model style; reviewing; UVA

Hrčak ID:

321906

URI

https://hrcak.srce.hr/321906

Publication date:

31.10.2024.

Visits: 827 *

Login and registration

Technical gazette, Vol. 31 No. 6, 2024.

Abstract

Keywords

Hrčak ID:

URI

Publication date: