Research Progress on Multi-Modal Fusion Object Detection Algorithms for Autonomous Driving: A Review

Peicheng Shi^1,*, Li Yang¹, Xinlong Dong¹, Heng Qi², Aixi Yang³
1 School of Mechanical and Automotive Engineering, Anhui Polytechnic University, Wuhu, 241000, China
2 State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, 430072, China
3 Polytechnic Institute, Zhejiang University, Hangzhou, 310015, China
* Corresponding Author: Peicheng Shi. Email: email
(This article belongs to the Special Issue: Advances in Object Detection: Methods and Applications)

Computers, Materials & Continua https://doi.org/10.32604/cmc.2025.063205

Received 08 January 2025; Accepted 11 March 2025; Published online 01 April 2025

Download PDF

Abstract

As the number and complexity of sensors in autonomous vehicles continue to rise, multimodal fusion-based object detection algorithms are increasingly being used to detect 3D environmental information, significantly advancing the development of perception technology in autonomous driving. To further promote the development of fusion algorithms and improve detection performance, this paper discusses the advantages and recent advancements of multimodal fusion-based object detection algorithms. Starting from single-modal sensor detection, the paper provides a detailed overview of typical sensors used in autonomous driving and introduces object detection methods based on images and point clouds. For image-based detection methods, they are categorized into monocular detection and binocular detection based on different input types. For point cloud-based detection methods, they are classified into projection-based, voxel-based, point cluster-based, pillar-based, and graph structure-based approaches based on the technical pathways for processing point cloud features. Additionally, multimodal fusion algorithms are divided into Camera-LiDAR fusion, Camera-Radar fusion, Camera-LiDAR-Radar fusion, and other sensor fusion methods based on the types of sensors involved. Furthermore, the paper identifies five key future research directions in this field, aiming to provide insights for researchers engaged in multimodal fusion-based object detection algorithms and to encourage broader attention to the research and application of multimodal fusion-based object detection.

Keywords

Multi-modal fusion; 3D object detection; deep learning; autonomous driving

Downloads
- Full-Text PDF
Citation Tools
- BibTex
- EndNote
- RIS

188

View
100

Download
0

Like

Improved VGG Model for Road Traffic Sign Recognition
Shuren Zhou, Wenlong Liang, Junguo...
Snow Cover Mapping for Mountainous Areas by Fusion of MODIS L1B and Geographic Data Based on Stacked Denoising Auto-Encoders
Xi Kan, Yonghong Zhang, Linglong...
Rare Bird Sparse Recognition via Part-Based Gist Feature Fusion and Regularized Intraclass Dictionary Learning
Jixin Liu, Ning Sun, Xiaofei Li,...
Real-Time Visual Tracking with Compact Shape and Color Feature
Zhenguo Gao, Shixiong Xia, Yikun...
Paragraph Vector Representation Based on Word to Vector and CNN Learning
Zeyu Xiong, Qiangqiang Shen, Yijie...

All issues

Online First

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

Research Progress on Multi-Modal Fusion Object Detection Algorithms for Autonomous Driving: A Review

Abstract

Keywords

188

100

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link