DTHN: Dual-Transformer Head End-to-End Person Search Network

Cheng Feng; Dezhi Han; Chongqing Chen

doi:10.32604/cmc.2023.042765

Open Access icon Open Access

ARTICLE

DTHN: Dual-Transformer Head End-to-End Person Search Network

Cheng Feng^*, Dezhi Han, Chongqing Chen

School of Information Engineering, Shanghai Maritime University, Shanghai, 201306, China

* Corresponding Author: Cheng Feng. Email: email

Computers, Materials & Continua 2023, 77(1), 245-261. https://doi.org/10.32604/cmc.2023.042765

Received 11 June 2023; Accepted 13 September 2023; Issue published 31 October 2023

Abstract

Person search mainly consists of two submissions, namely Person Detection and Person Re-identification (re-ID). Existing approaches are primarily based on Faster R-CNN and Convolutional Neural Network (CNN) (e.g., ResNet). While these structures may detect high-quality bounding boxes, they seem to degrade the performance of re-ID. To address this issue, this paper proposes a Dual-Transformer Head Network (DTHN) for end-to-end person search, which contains two independent Transformer heads, a box head for detecting the bounding box and extracting efficient bounding box feature, and a re-ID head for capturing high-quality re-ID features for the re-ID task. Specifically, after the image goes through the ResNet backbone network to extract features, the Region Proposal Network (RPN) proposes possible bounding boxes. The box head then extracts more efficient features within these bounding boxes for detection. Following this, the re-ID head computes the occluded attention of the features in these bounding boxes and distinguishes them from other persons or backgrounds. Extensive experiments on two widely used benchmark datasets, CUHK-SYSU and PRW, achieve state-of-the-art performance levels, 94.9 mAP and 95.3 top-1 scores on the CUHK-SYSU dataset, and 51.6 mAP and 87.6 top-1 scores on the PRW dataset, which demonstrates the advantages of this paper’s approach. The efficiency comparison also shows our method is highly efficient in both time and space.

Keywords

Transformer; occluded attention; end-to-end person search; person detection; person re-ID; Dual-Transformer Head

Cite This Article

APA Style

Feng, C., Han, D., Chen, C. (2023). DTHN: dual-transformer head end-to-end person search network. Computers, Materials & Continua, 77(1), 245–261. https://doi.org/10.32604/cmc.2023.042765

Vancouver Style

Feng C, Han D, Chen C. DTHN: dual-transformer head end-to-end person search network. Comput Mater Contin. 2023;77(1):245–261. https://doi.org/10.32604/cmc.2023.042765

IEEE Style

C. Feng, D. Han, and C. Chen, “DTHN: Dual-Transformer Head End-to-End Person Search Network,” Comput. Mater. Contin., vol. 77, no. 1, pp. 245–261, 2023. https://doi.org/10.32604/cmc.2023.042765

BibTex EndNote RIS

Copyright © 2023 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

DTHN: Dual-Transformer Head End-to-End Person Search Network

Abstract

Keywords

Cite This Article

631

401

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Share Link