Visual Object Tracking via Cascaded RPN Fusion and Coordinate Attention

Jianming Zhang; Kai Wang; Yaoqi He; Lidan Kuang

doi:10.32604/cmes.2022.020471

Open Access icon Open Access

ARTICLE

Visual Object Tracking via Cascaded RPN Fusion and Coordinate Attention

Jianming Zhang^1,2,*, Kai Wang^1,2, Yaoqi He^1,2, Lidan Kuang^1,2

1 School of Computer and Communication Engineering, Changsha University of Science and Technology, Changsha, 410114, China
2 Hunan Provincial Key Laboratory of Intelligent Processing of Big Data on Transportation, Changsha University of Science and Technology, Changsha, 410114, China

* Corresponding Author: Jianming Zhang. Email: email

(This article belongs to the Special Issue: Enabled and Human-centric Computational Intelligence Solutions for Visual Understanding and Application)

Computer Modeling in Engineering & Sciences 2022, 132(3), 909-927. https://doi.org/10.32604/cmes.2022.020471

Received 26 November 2021; Accepted 20 January 2022; Issue published 27 June 2022

Abstract

Recently, Siamese-based trackers have achieved excellent performance in object tracking. However, the high speed and deformation of objects in the movement process make tracking difficult. Therefore, we have incorporated cascaded region-proposal-network (RPN) fusion and coordinate attention into Siamese trackers. The proposed network framework consists of three parts: a feature-extraction sub-network, coordinate attention block, and cascaded RPN block.We exploit the coordinate attention block, which can embed location information into channel attention, to establish long-term spatial location dependence while maintaining channel associations. Thus, the features of different layers are enhanced by the coordinate attention block. We then send these features separately into the cascaded RPN for classification and regression. According to the two classification and regression results, the final position of the target is obtained. To verify the effectiveness of the proposed method, we conducted comprehensive experiments on the OTB100, VOT2016, UAV123, and GOT-10k datasets. Compared with other state-of-the-art trackers, the proposed tracker achieved good performance and can run at real-time speed.

Keywords

Object tracking; deep learning; coordinate attention; cascaded RPN

Cite This Article

APA Style

Zhang, J., Wang, K., He, Y., Kuang, L. (2022). Visual Object Tracking via Cascaded RPN Fusion and Coordinate Attention. Computer Modeling in Engineering & Sciences, 132(3), 909–927. https://doi.org/10.32604/cmes.2022.020471

Vancouver Style

Zhang J, Wang K, He Y, Kuang L. Visual Object Tracking via Cascaded RPN Fusion and Coordinate Attention. Comput Model Eng Sci. 2022;132(3):909–927. https://doi.org/10.32604/cmes.2022.020471

IEEE Style

J. Zhang, K. Wang, Y. He, and L. Kuang, “Visual Object Tracking via Cascaded RPN Fusion and Coordinate Attention,” Comput. Model. Eng. Sci., vol. 132, no. 3, pp. 909–927, 2022. https://doi.org/10.32604/cmes.2022.020471

BibTex EndNote RIS

Copyright © 2022 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Visual Object Tracking via Cascaded RPN Fusion and Coordinate Attention

Abstract

Keywords

Cite This Article

3151

1568

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link