Samah Ibrahim Alshathri1,*, Desiree Juby Vincent2, V. S. Hari2
CMC-Computers, Materials & Continua, Vol.71, No.1, pp. 1371-1386, 2022, DOI:10.32604/cmc.2022.022458
- 03 November 2021
Abstract Invoice document digitization is crucial for efficient management in industries. The scanned invoice image is often noisy due to various reasons. This affects the OCR (optical character recognition) detection accuracy. In this paper, letter data obtained from images of invoices are denoised using a modified autoencoder based deep learning method. A stacked denoising autoencoder (SDAE) is implemented with two hidden layers each in encoder network and decoder network. In order to capture the most salient features of training samples, a undercomplete autoencoder is designed with non-linear encoder and decoder function. This autoencoder is regularized for… More >