Multi-Modal Scene Matching Location Algorithm Based on M2Det

Jiwei Fan; Xiaogang Yang; Ruitao Lu; Qingge Li; Siyu Wang

doi:10.32604/cmc.2023.039582

Open Access icon Open Access

ARTICLE

Multi-Modal Scene Matching Location Algorithm Based on M2Det

Jiwei Fan, Xiaogang Yang^*, Ruitao Lu, Qingge Li, Siyu Wang

Department of Automation, PLA Rocket Force University of Engineering, Xi’an, 710025, China

* Corresponding Author: Xiaogang Yang. Email: email

Computers, Materials & Continua 2023, 77(1), 1031-1052. https://doi.org/10.32604/cmc.2023.039582

Received 06 February 2023; Accepted 14 August 2023; Issue published 31 October 2023

Abstract

In recent years, many visual positioning algorithms have been proposed based on computer vision and they have achieved good results. However, these algorithms have a single function, cannot perceive the environment, and have poor versatility, and there is a certain mismatch phenomenon, which affects the positioning accuracy. Therefore, this paper proposes a location algorithm that combines a target recognition algorithm with a depth feature matching algorithm to solve the problem of unmanned aerial vehicle (UAV) environment perception and multi-modal image-matching fusion location. This algorithm was based on the single-shot object detector based on multi-level feature pyramid network (M2Det) algorithm and replaced the original visual geometry group (VGG) feature extraction network with the ResNet-101 network to improve the feature extraction capability of the network model. By introducing a depth feature matching algorithm, the algorithm shares neural network weights and realizes the design of UAV target recognition and a multi-modal image-matching fusion positioning algorithm. When the reference image and the real-time image were mismatched, the dynamic adaptive proportional constraint and the random sample consensus consistency algorithm (DAPC-RANSAC) were used to optimize the matching results to improve the correct matching efficiency of the target. Using the multi-modal registration data set, the proposed algorithm was compared and analyzed to verify its superiority and feasibility. The results show that the algorithm proposed in this paper can effectively deal with the matching between multi-modal images (visible image–infrared image, infrared image–satellite image, visible image–satellite image), and the contrast, scale, brightness, ambiguity deformation, and other changes had good stability and robustness. Finally, the effectiveness and practicability of the algorithm proposed in this paper were verified in an aerial test scene of an S1000 six-rotor UAV.

Keywords

Visual positioning; multi-modal scene matching; unmanned aerial vehicle

Cite This Article

APA Style

Fan, J., Yang, X., Lu, R., Li, Q., Wang, S. (2023). Multi-modal scene matching location algorithm based on m2det. Computers, Materials & Continua, 77(1), 1031–1052. https://doi.org/10.32604/cmc.2023.039582

Vancouver Style

Fan J, Yang X, Lu R, Li Q, Wang S. Multi-modal scene matching location algorithm based on m2det. Comput Mater Contin. 2023;77(1):1031–1052. https://doi.org/10.32604/cmc.2023.039582

IEEE Style

J. Fan, X. Yang, R. Lu, Q. Li, and S. Wang, “Multi-Modal Scene Matching Location Algorithm Based on M2Det,” Comput. Mater. Contin., vol. 77, no. 1, pp. 1031–1052, 2023. https://doi.org/10.32604/cmc.2023.039582

BibTex EndNote RIS

Copyright © 2023 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Multi-Modal Scene Matching Location Algorithm Based on M2Det

Abstract

Keywords

Cite This Article

739

458

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Share Link