Open Access iconOpen Access

ARTICLE

Multi-Modal Scene Matching Location Algorithm Based on M2Det

by Jiwei Fan, Xiaogang Yang*, Ruitao Lu, Qingge Li, Siyu Wang

Department of Automation, PLA Rocket Force University of Engineering, Xi’an, 710025, China

* Corresponding Author: Xiaogang Yang. Email: email

Computers, Materials & Continua 2023, 77(1), 1031-1052. https://doi.org/10.32604/cmc.2023.039582

Abstract

In recent years, many visual positioning algorithms have been proposed based on computer vision and they have achieved good results. However, these algorithms have a single function, cannot perceive the environment, and have poor versatility, and there is a certain mismatch phenomenon, which affects the positioning accuracy. Therefore, this paper proposes a location algorithm that combines a target recognition algorithm with a depth feature matching algorithm to solve the problem of unmanned aerial vehicle (UAV) environment perception and multi-modal image-matching fusion location. This algorithm was based on the single-shot object detector based on multi-level feature pyramid network (M2Det) algorithm and replaced the original visual geometry group (VGG) feature extraction network with the ResNet-101 network to improve the feature extraction capability of the network model. By introducing a depth feature matching algorithm, the algorithm shares neural network weights and realizes the design of UAV target recognition and a multi-modal image-matching fusion positioning algorithm. When the reference image and the real-time image were mismatched, the dynamic adaptive proportional constraint and the random sample consensus consistency algorithm (DAPC-RANSAC) were used to optimize the matching results to improve the correct matching efficiency of the target. Using the multi-modal registration data set, the proposed algorithm was compared and analyzed to verify its superiority and feasibility. The results show that the algorithm proposed in this paper can effectively deal with the matching between multi-modal images (visible image–infrared image, infrared image–satellite image, visible image–satellite image), and the contrast, scale, brightness, ambiguity deformation, and other changes had good stability and robustness. Finally, the effectiveness and practicability of the algorithm proposed in this paper were verified in an aerial test scene of an S1000 six-rotor UAV.

Keywords


Cite This Article

APA Style
Fan, J., Yang, X., Lu, R., Li, Q., Wang, S. (2023). Multi-modal scene matching location algorithm based on m2det. Computers, Materials & Continua, 77(1), 1031-1052. https://doi.org/10.32604/cmc.2023.039582
Vancouver Style
Fan J, Yang X, Lu R, Li Q, Wang S. Multi-modal scene matching location algorithm based on m2det. Comput Mater Contin. 2023;77(1):1031-1052 https://doi.org/10.32604/cmc.2023.039582
IEEE Style
J. Fan, X. Yang, R. Lu, Q. Li, and S. Wang, “Multi-Modal Scene Matching Location Algorithm Based on M2Det,” Comput. Mater. Contin., vol. 77, no. 1, pp. 1031-1052, 2023. https://doi.org/10.32604/cmc.2023.039582



cc Copyright © 2023 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 573

    View

  • 373

    Download

  • 0

    Like

Share Link