Open Access iconOpen Access

ARTICLE

crossmark

SMSTracker: A Self-Calibration Multi-Head Self-Attention Transformer for Visual Object Tracking

by Zhongyang Wang, Hu Zhu, Feng Liu*

School of Communications and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing, 210003, China

* Corresponding Author: Feng Liu. Email: email

(This article belongs to the Special Issue: Recognition Tasks with Transformers)

Computers, Materials & Continua 2024, 80(1), 605-623. https://doi.org/10.32604/cmc.2024.050959

Abstract

Visual object tracking plays a crucial role in computer vision. In recent years, researchers have proposed various methods to achieve high-performance object tracking. Among these, methods based on Transformers have become a research hotspot due to their ability to globally model and contextualize information. However, current Transformer-based object tracking methods still face challenges such as low tracking accuracy and the presence of redundant feature information. In this paper, we introduce self-calibration multi-head self-attention Transformer (SMSTracker) as a solution to these challenges. It employs a hybrid tensor decomposition self-organizing multi-head self-attention transformer mechanism, which not only compresses and accelerates Transformer operations but also significantly reduces redundant data, thereby enhancing the accuracy and efficiency of tracking. Additionally, we introduce a self-calibration attention fusion block to resolve common issues of attention ambiguities and inconsistencies found in traditional tracking methods, ensuring the stability and reliability of tracking performance across various scenarios. By integrating a hybrid tensor decomposition approach with a self-organizing multi-head self-attentive transformer mechanism, SMSTracker enhances the efficiency and accuracy of the tracking process. Experimental results show that SMSTracker achieves competitive performance in visual object tracking, promising more robust and efficient tracking systems, demonstrating its potential to provide more robust and efficient tracking solutions in real-world applications.

Keywords


Cite This Article

APA Style
Wang, Z., Zhu, H., Liu, F. (2024). Smstracker: A self-calibration multi-head self-attention transformer for visual object tracking. Computers, Materials & Continua, 80(1), 605-623. https://doi.org/10.32604/cmc.2024.050959
Vancouver Style
Wang Z, Zhu H, Liu F. Smstracker: A self-calibration multi-head self-attention transformer for visual object tracking. Comput Mater Contin. 2024;80(1):605-623 https://doi.org/10.32604/cmc.2024.050959
IEEE Style
Z. Wang, H. Zhu, and F. Liu, “SMSTracker: A Self-Calibration Multi-Head Self-Attention Transformer for Visual Object Tracking,” Comput. Mater. Contin., vol. 80, no. 1, pp. 605-623, 2024. https://doi.org/10.32604/cmc.2024.050959



cc Copyright © 2024 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 435

    View

  • 202

    Download

  • 0

    Like

Share Link