Dongdong Zhang, Chunping Wang, Huiying Wang, Qiang Fu*
CMC-Computers, Materials & Continua, Vol.82, No.3, pp. 4319-4338, 2025, DOI:10.32604/cmc.2025.060653
- 06 March 2025
Abstract Video camouflaged object detection (VCOD) has become a fundamental task in computer vision that has attracted significant attention in recent years. Unlike image camouflaged object detection (ICOD), VCOD not only requires spatial cues but also needs motion cues. Thus, effectively utilizing spatiotemporal information is crucial for generating accurate segmentation results. Current VCOD methods, which typically focus on exploring motion representation, often ineffectively integrate spatial and motion features, leading to poor performance in diverse scenarios. To address these issues, we design a novel spatiotemporal network with an encoder-decoder structure. During the encoding stage, an adjacent space-time More >