Open Access iconOpen Access

ARTICLE

crossmark

Multi-Feature Fusion-Guided Multiscale Bidirectional Attention Networks for Logistics Pallet Segmentation

by Weiwei Cai1,2, Yaping Song1, Huan Duan1, Zhenwei Xia1, Zhanguo Wei1,*

1 School of Logistics and Transportation, Central South University of Forestry and Technology, Changsha, 410004, China
2 Graduate School, Northern Arizona University, Flagstaff, AZ 86011, USA

* Corresponding Author: Zhanguo Wei. Email: email

(This article belongs to the Special Issue: Computer Modeling for Smart Cities Applications)

Computer Modeling in Engineering & Sciences 2022, 131(3), 1539-1555. https://doi.org/10.32604/cmes.2022.019785

Abstract

In the smart logistics industry, unmanned forklifts that intelligently identify logistics pallets can improve work efficiency in warehousing and transportation and are better than traditional manual forklifts driven by humans. Therefore, they play a critical role in smart warehousing, and semantics segmentation is an effective method to realize the intelligent identification of logistics pallets. However, most current recognition algorithms are ineffective due to the diverse types of pallets, their complex shapes, frequent blockades in production environments, and changing lighting conditions. This paper proposes a novel multi-feature fusion-guided multiscale bidirectional attention (MFMBA) neural network for logistics pallet segmentation. To better predict the foreground category (the pallet) and the background category (the cargo) of a pallet image, our approach extracts three types of features (grayscale, texture, and Hue, Saturation, Value features) and fuses them. The multiscale architecture deals with the problem that the size and shape of the pallet may appear different in the image in the actual, complex environment, which usually makes feature extraction difficult. Our study proposes a multiscale architecture that can extract additional semantic features. Also, since a traditional attention mechanism only assigns attention rights from a single direction, we designed a bidirectional attention mechanism that assigns cross-attention weights to each feature from two directions, horizontally and vertically, significantly improving segmentation. Finally, comparative experimental results show that the precision of the proposed algorithm is 0.53%–8.77% better than that of other methods we compared.

Keywords


Cite This Article

APA Style
Cai, W., Song, Y., Duan, H., Xia, Z., Wei, Z. (2022). Multi-feature fusion-guided multiscale bidirectional attention networks for logistics pallet segmentation. Computer Modeling in Engineering & Sciences, 131(3), 1539-1555. https://doi.org/10.32604/cmes.2022.019785
Vancouver Style
Cai W, Song Y, Duan H, Xia Z, Wei Z. Multi-feature fusion-guided multiscale bidirectional attention networks for logistics pallet segmentation. Comput Model Eng Sci. 2022;131(3):1539-1555 https://doi.org/10.32604/cmes.2022.019785
IEEE Style
W. Cai, Y. Song, H. Duan, Z. Xia, and Z. Wei, “Multi-Feature Fusion-Guided Multiscale Bidirectional Attention Networks for Logistics Pallet Segmentation,” Comput. Model. Eng. Sci., vol. 131, no. 3, pp. 1539-1555, 2022. https://doi.org/10.32604/cmes.2022.019785

Citations




cc Copyright © 2022 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 13251

    View

  • 4800

    Download

  • 4

    Like

Share Link