SMSTracker: A Self-Calibration Multi-Head Self-Attention Transformer for Visual Object Tracking

Zhongyang Wang, Hu Zhu, Feng Liu^*
School of Communications and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing, 210003, China
* Corresponding Author: Feng Liu. Email: email
(This article belongs to the Special Issue: Recognition Tasks with Transformers)

Computers, Materials & Continua https://doi.org/10.32604/cmc.2024.050959

Received 23 February 2024; Accepted 23 April 2024; Published online 08 July 2024

Download PDF

Abstract

Visual object tracking plays a crucial role in computer vision. In recent years, researchers have proposed various methods to achieve high-performance object tracking. Among these, methods based on Transformers have become a research hotspot due to their ability to globally model and contextualize information. However, current Transformer-based object tracking methods still face challenges such as low tracking accuracy and the presence of redundant feature information. In this paper, we introduce self-calibration multi-head self-attention Transformer (SMSTracker) as a solution to these challenges. It employs a hybrid tensor decomposition self-organizing multi-head self-attention transformer mechanism, which not only compresses and accelerates Transformer operations but also significantly reduces redundant data, thereby enhancing the accuracy and efficiency of tracking. Additionally, we introduce a self-calibration attention fusion block to resolve common issues of attention ambiguities and inconsistencies found in traditional tracking methods, ensuring the stability and reliability of tracking performance across various scenarios. By integrating a hybrid tensor decomposition approach with a self-organizing multi-head self-attentive transformer mechanism, SMSTracker enhances the efficiency and accuracy of the tracking process. Experimental results show that SMSTracker achieves competitive performance in visual object tracking, promising more robust and efficient tracking systems, demonstrating its potential to provide more robust and efficient tracking solutions in real-world applications.

Keywords

Visual object tracking; tensor decomposition; transformer; self-attention

Downloads
- Full-Text PDF
Citation Tools
- BibTex
- EndNote
- RIS

65

View
6

Download
0

Like

Keyphrase Generation Based on Self-Attention Mechanism
Kehua Yang, Yaodong Wang, Wei...
Hashtag Recommendation Using LSTM Networks with Self-Attention
Yatian Shen, Yan Li, Jun Sun,...
SSD Real-Time Illegal Parking Detection Based on Contextual Information Transmission
Huanrong Tang, Aoming Peng, Dongming...
Parameters Compressing in Deep Learning
Shiming He, Zhuozhou Li, Yangning...
TdBrnn: An Approach to Learning Users’ Intention to Legal Consultation with Normalized Tensor Decomposition and Bi-LSTM
Xiaoding Guo, Hongli Zhang, Lin...

All issues

Online First

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

SMSTracker: A Self-Calibration Multi-Head Self-Attention Transformer for Visual Object Tracking

Abstract

Keywords

65

6

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link