Skoči na glavni sadržaj

Izvorni znanstveni članak

https://doi.org/10.20532/cit.2026.1006291

A Lightweight Spatiotemporal Saliency Detection Framework for VR Panoramic Dynamic Scenes

Dezhi Kong ; Hebei University of Water Resources and Electric Engineering, Cang Zhou, Hebei, China *
Huijuan Hao ; Hebei University of Water Resources and Electric Engineering, Cang Zhou, Hebei, China
Bo Gao ; Hebei University of Water Resources and Electric Engineering, Cang Zhou, Hebei, China

* Dopisni autor.


Puni tekst: engleski pdf 1.537 Kb

str. 65-79

preuzimanja: 11

citiraj


Sažetak

Saliency detection in virtual reality (VR) panoramic dynamic scenes faces two major challenges: geometric distortion caused by equirectangular projection (ERP) and the high computational cost of modeling long-term temporal dependencies. To address these issues, we propose TAD-Net, a lightweight spatiotemporal saliency detection framework that integrates cubemap projection (CMP), temporal attention, knowledge distillation, and adversarial training. CMP efficiently reduces panoramic distortion while enabling standard 2D convolutional processing. A dual-stream network extracts spatial appearance and temporal motion features, and a temporal attention module enhances dynamic saliency discrimination. To reconcile the accuracy–latency trade-off, a heavy teacher model transfers long-range temporal knowledge to a lightweight student model via distillation, while adversarial training improves boundary sharpness. Extensive experiments on Salient360-Dynamic and VR-EyeDynamic demonstrate that TAD-Net achieves state-of-the-art performance, improving AUC-Judd by up to 5.2% while maintaining real-time inference at 35.1 FPS on an RTX 3080 GPU. Cross-dataset evaluation confirms robust generalization under domain shifts. The results indicate that the proposed projection–perception-distillation pipeline effectively balances geometric correction, temporal reasoning, and real-time constraints in VR applications.

Ključne riječi

VR panoramic images; dynamic scenes; saliency detection; spatiotemporal features; attention mechanism; lightweight optimization

Hrčak ID:

346537

URI

https://hrcak.srce.hr/346537

Datum izdavanja:

7.4.2026.

Posjeta: 30 *