Open Access iconOpen Access

ARTICLE

Efficient Spatiotemporal Information Utilization for Video Camouflaged Object Detection

Dongdong Zhang, Chunping Wang, Huiying Wang, Qiang Fu*

Army Engineering University of PLA, Shijiazhuang, 050003, China

* Corresponding Author: Qiang Fu. Email: email

Computers, Materials & Continua 2025, 82(3), 4319-4338. https://doi.org/10.32604/cmc.2025.060653

Abstract

Video camouflaged object detection (VCOD) has become a fundamental task in computer vision that has attracted significant attention in recent years. Unlike image camouflaged object detection (ICOD), VCOD not only requires spatial cues but also needs motion cues. Thus, effectively utilizing spatiotemporal information is crucial for generating accurate segmentation results. Current VCOD methods, which typically focus on exploring motion representation, often ineffectively integrate spatial and motion features, leading to poor performance in diverse scenarios. To address these issues, we design a novel spatiotemporal network with an encoder-decoder structure. During the encoding stage, an adjacent space-time memory module (ASTM) is employed to extract high-level temporal features (i.e., motion cues) from the current frame and its adjacent frames. In the decoding stage, a selective space-time aggregation module is introduced to efficiently integrate spatial and temporal features. Additionally, a multi-feature fusion module is developed to progressively refine the rough prediction by utilizing the information provided by multiple types of features. Furthermore, we incorporate multi-task learning into the proposed network to obtain more accurate predictions. Experimental results show that the proposed method outperforms existing cutting-edge baselines on VCOD benchmarks.

Keywords


Cite This Article

APA Style
Zhang, D., Wang, C., Wang, H., Fu, Q. (2025). Efficient spatiotemporal information utilization for video camouflaged object detection. Computers, Materials & Continua, 82(3), 4319–4338. https://doi.org/10.32604/cmc.2025.060653
Vancouver Style
Zhang D, Wang C, Wang H, Fu Q. Efficient spatiotemporal information utilization for video camouflaged object detection. Comput Mater Contin. 2025;82(3):4319–4338. https://doi.org/10.32604/cmc.2025.060653
IEEE Style
D. Zhang, C. Wang, H. Wang, and Q. Fu, “Efficient Spatiotemporal Information Utilization for Video Camouflaged Object Detection,” Comput. Mater. Contin., vol. 82, no. 3, pp. 4319–4338, 2025. https://doi.org/10.32604/cmc.2025.060653



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 265

    View

  • 97

    Download

  • 0

    Like

Share Link