ConvNeXt-UperNet-Based Deep Learning Model for Road Extraction from High-Resolution Remote Sensing Images

Jing Wang^1,2,*, Chen Zhang¹, Tianwen Lin¹
1 Geographic Information and Tourism College, Chuzhou University, Chuzhou, 239000, China
2 Anhui Province Key Laboratory of Physical Geographic Environment, Chuzhou University, Chuzhou, 239000, China
* Corresponding Author: Jing Wang. Email: email

Computers, Materials & Continua https://doi.org/10.32604/cmc.2024.052597

Received 08 April 2024; Accepted 12 June 2024; Published online 19 July 2024

Download PDF

Abstract

When existing deep learning models are used for road extraction tasks from high-resolution images, they are easily affected by noise factors such as tree and building occlusion and complex backgrounds, resulting in incomplete road extraction and low accuracy. We propose the introduction of spatial and channel attention modules to the convolutional neural network ConvNeXt. Then, ConvNeXt is used as the backbone network, which cooperates with the perceptual analysis network UPerNet, retains the detection head of the semantic segmentation, and builds a new model ConvNeXt-UPerNet to suppress noise interference. Training on the open-source DeepGlobe and CHN6-CUG datasets and introducing the DiceLoss on the basis of CrossEntropyLoss solves the problem of positive and negative sample imbalance. Experimental results show that the new network model can achieve the following performance on the DeepGlobe dataset: 79.40% for precision (Pre), 97.93% for accuracy (Acc), 69.28% for intersection over union (IoU), and 83.56% for mean intersection over union (MIoU). On the CHN6-CUG dataset, the model achieves the respective values of 78.17% for Pre, 97.63% for Acc, 65.4% for IoU, and 81.46% for MIoU. Compared with other network models, the fused ConvNeXt-UPerNet model can extract road information better when faced with the influence of noise contained in high-resolution remote sensing images. It also achieves multiscale image feature information with unified perception, ultimately improving the generalization ability of deep learning technology in extracting complex roads from high-resolution remote sensing images.

Keywords

Deep learning; semantic segmentation; remote sensing imagery; road extraction

Downloads
- Full-Text PDF
Citation Tools
- BibTex
- EndNote
- RIS

93

View
9

Download
0

Like

Improved VGG Model for Road Traffic Sign Recognition
Shuren Zhou, Wenlong Liang, Junguo...
Snow Cover Mapping for Mountainous Areas by Fusion of MODIS L1B and Geographic Data Based on Stacked Denoising Auto-Encoders
Xi Kan, Yonghong Zhang, Linglong...
Rare Bird Sparse Recognition via Part-Based Gist Feature Fusion and Regularized Intraclass Dictionary Learning
Jixin Liu, Ning Sun, Xiaofei Li,...
Real-Time Visual Tracking with Compact Shape and Color Feature
Zhenguo Gao, Shixiong Xia, Yikun...
Paragraph Vector Representation Based on Word to Vector and CNN Learning
Zeyu Xiong, Qiangqiang Shen, Yijie...

All issues

Online First

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

ConvNeXt-UperNet-Based Deep Learning Model for Road Extraction from High-Resolution Remote Sensing Images

Abstract

Keywords

93

9

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link