Dan Xu*, Jiale Ru, Jinlong Shi
CMC-Computers, Materials & Continua, Vol.78, No.1, pp. 85-104, 2024, DOI:10.32604/cmc.2023.045258
- 30 January 2024
Abstract Video salient object detection (VSOD) aims at locating the most attractive objects in a video by exploring the spatial and temporal features. VSOD poses a challenging task in computer vision, as it involves processing complex spatial data that is also influenced by temporal dynamics. Despite the progress made in existing VSOD models, they still struggle in scenes of great background diversity within and between frames. Additionally, they encounter difficulties related to accumulated noise and high time consumption during the extraction of temporal features over a long-term duration. We propose a multi-stream temporal enhanced network (MSTENet)… More >