Home / Journals / CMC / Online First / doi:10.32604/cmc.2025.063205
Special Issues
Table of Content

Open Access

REVIEW

Research Progress on Multi-Modal Fusion Object Detection Algorithms for Autonomous Driving: A Review

Peicheng Shi1,*, Li Yang1, Xinlong Dong1, Heng Qi2, Aixi Yang3
1 School of Mechanical and Automotive Engineering, Anhui Polytechnic University, Wuhu, 241000, China
2 State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, 430072, China
3 Polytechnic Institute, Zhejiang University, Hangzhou, 310015, China
* Corresponding Author: Peicheng Shi. Email: email
(This article belongs to the Special Issue: Advances in Object Detection: Methods and Applications)

Computers, Materials & Continua https://doi.org/10.32604/cmc.2025.063205

Received 08 January 2025; Accepted 11 March 2025; Published online 01 April 2025

Abstract

As the number and complexity of sensors in autonomous vehicles continue to rise, multimodal fusion-based object detection algorithms are increasingly being used to detect 3D environmental information, significantly advancing the development of perception technology in autonomous driving. To further promote the development of fusion algorithms and improve detection performance, this paper discusses the advantages and recent advancements of multimodal fusion-based object detection algorithms. Starting from single-modal sensor detection, the paper provides a detailed overview of typical sensors used in autonomous driving and introduces object detection methods based on images and point clouds. For image-based detection methods, they are categorized into monocular detection and binocular detection based on different input types. For point cloud-based detection methods, they are classified into projection-based, voxel-based, point cluster-based, pillar-based, and graph structure-based approaches based on the technical pathways for processing point cloud features. Additionally, multimodal fusion algorithms are divided into Camera-LiDAR fusion, Camera-Radar fusion, Camera-LiDAR-Radar fusion, and other sensor fusion methods based on the types of sensors involved. Furthermore, the paper identifies five key future research directions in this field, aiming to provide insights for researchers engaged in multimodal fusion-based object detection algorithms and to encourage broader attention to the research and application of multimodal fusion-based object detection.

Keywords

Multi-modal fusion; 3D object detection; deep learning; autonomous driving
  • 188

    View

  • 100

    Download

  • 0

    Like

Share Link