Open Access iconOpen Access

ARTICLE

VPM-Net: Person Re-ID Network Based on Visual Prompt Technology and Multi-Instance Negative Pooling

Haitao Xie, Yuliang Chen, Yunjie Zeng, Lingyu Yan, Zhizhi Wang, Zhiwei Ye*

School of Computer Science, Hubei University of Technology, Wuhan, 430068, China

* Corresponding Author: Zhiwei Ye. Email: email

(This article belongs to the Special Issue: Machine Vision Detection and Intelligent Recognition, 2nd Edition)

Computers, Materials & Continua 2025, 83(2), 3389-3410. https://doi.org/10.32604/cmc.2025.060783

Abstract

With the rapid development of intelligent video surveillance technology, pedestrian re-identification has become increasingly important in multi-camera surveillance systems. This technology plays a critical role in enhancing public safety. However, traditional methods typically process images and text separately, applying upstream models directly to downstream tasks. This approach significantly increases the complexity of model training and computational costs. Furthermore, the common class imbalance in existing training datasets limits model performance improvement. To address these challenges, we propose an innovative framework named Person Re-ID Network Based on Visual Prompt Technology and Multi-Instance Negative Pooling (VPM-Net). First, we incorporate the Contrastive Language-Image Pre-training (CLIP) pre-trained model to accurately map visual and textual features into a unified embedding space, effectively mitigating inconsistencies in data distribution and the training process. To enhance model adaptability and generalization, we introduce an efficient and task-specific Visual Prompt Tuning (VPT) technique, which improves the model’s relevance to specific tasks. Additionally, we design two key modules: the Knowledge-Aware Network (KAN) and the Multi-Instance Negative Pooling (MINP) module. The KAN module significantly enhances the model’s understanding of complex scenarios through deep contextual semantic modeling. MINP module handles samples, effectively improving the model’s ability to distinguish fine-grained features. The experimental outcomes across diverse datasets underscore the remarkable performance of VPM-Net. These results vividly demonstrate the unique advantages and robust reliability of VPM-Net in fine-grained retrieval tasks.

Keywords

Person re-identification; multi-instance negative pooling; visual prompt tuning

Cite This Article

APA Style
Xie, H., Chen, Y., Zeng, Y., Yan, L., Wang, Z. et al. (2025). VPM-Net: Person Re-ID Network Based on Visual Prompt Technology and Multi-Instance Negative Pooling. Computers, Materials & Continua, 83(2), 3389–3410. https://doi.org/10.32604/cmc.2025.060783
Vancouver Style
Xie H, Chen Y, Zeng Y, Yan L, Wang Z, Ye Z. VPM-Net: Person Re-ID Network Based on Visual Prompt Technology and Multi-Instance Negative Pooling. Comput Mater Contin. 2025;83(2):3389–3410. https://doi.org/10.32604/cmc.2025.060783
IEEE Style
H. Xie, Y. Chen, Y. Zeng, L. Yan, Z. Wang, and Z. Ye, “VPM-Net: Person Re-ID Network Based on Visual Prompt Technology and Multi-Instance Negative Pooling,” Comput. Mater. Contin., vol. 83, no. 2, pp. 3389–3410, 2025. https://doi.org/10.32604/cmc.2025.060783



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 156

    View

  • 57

    Download

  • 0

    Like

Share Link