Open Access iconOpen Access

ARTICLE

Leveraging Edge Optimize Vision Transformer for Monkeypox Lesion Diagnosis on Mobile Devices

Poonam Sharma1, Bhisham Sharma2,*, Dhirendra Prasad Yadav3, Surbhi Bhatia Khan4,5,6,*, Ahlam Almusharraf7

1 Chitkara University Institute of Engineering and Technology, Chitkara University, Rajpura, 140401, Punjab, India
2 Centre for Research Impact and Outcome, Chitkara University, Rajpura, 140401, Punjab, India
3 Department of Computer Engineering & Applications, G.L.A. University, Mathura, 281406, India
4 Department of Data Science, University of Salford, Manchester, M54WT, UK
5 University Centre for Research and Development, Chandigarh University, Mohali, 140413, Punjab, India
6 Division of Research and Development, Lovely Professional University, Phagwara, 144411, India
7 Department of Management, College of Business Administration, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh, 11671, Saudi Arabia

* Corresponding Authors: Bhisham Sharma. Email: email; Surbhi Bhatia Khan. Email: email

(This article belongs to the Special Issue: Medical Imaging Based Disease Diagnosis Using AI)

Computers, Materials & Continua 2025, 83(2), 3227-3245. https://doi.org/10.32604/cmc.2025.062376

Abstract

Rapid and precise diagnostic tools for Monkeypox (Mpox) lesions are crucial for effective treatment because their symptoms are similar to those of other pox-related illnesses, like smallpox and chickenpox. The morphological similarities between smallpox, chickenpox, and monkeypox, particularly in how they appear as rashes and skin lesions, which can sometimes make diagnosis challenging. Chickenpox lesions appear in many simultaneous phases and are more diffuse, often beginning on the trunk. In contrast, monkeypox lesions emerge progressively and are typically centralized on the face, palms, and soles. To provide accessible diagnostics, this study introduces a novel method for automated monkeypox lesion classification using the HMTNet (Hybrid Mobile Transformer Network). The convolutional layers and Vision Transformers (ViT) are combined to enhance the spatial features. In addition, we replace the classical MHSA (Multi-head self-attention) with the WMHSA (Window-based Multi-Head Self-Attention) to effectively capture long-range dependencies within image patches and depth-wise separable convolutions for local feature extraction. We trained and validated HMTNet on the two datasets for binary and multiclass classification. The model achieved 98.38% accuracy for multiclass classification using cross-validation and 99.25% accuracy for binary classification. These findings show that the model has the potential to be a useful diagnostic tool for monkeypox, especially in environments with limited resources.

Keywords

Monkeypox; disease; classification; local; global; transformer

Cite This Article

APA Style
Sharma, P., Sharma, B., Yadav, D.P., Khan, S.B., Almusharraf, A. (2025). Leveraging Edge Optimize Vision Transformer for Monkeypox Lesion Diagnosis on Mobile Devices. Computers, Materials & Continua, 83(2), 3227–3245. https://doi.org/10.32604/cmc.2025.062376
Vancouver Style
Sharma P, Sharma B, Yadav DP, Khan SB, Almusharraf A. Leveraging Edge Optimize Vision Transformer for Monkeypox Lesion Diagnosis on Mobile Devices. Comput Mater Contin. 2025;83(2):3227–3245. https://doi.org/10.32604/cmc.2025.062376
IEEE Style
P. Sharma, B. Sharma, D. P. Yadav, S. B. Khan, and A. Almusharraf, “Leveraging Edge Optimize Vision Transformer for Monkeypox Lesion Diagnosis on Mobile Devices,” Comput. Mater. Contin., vol. 83, no. 2, pp. 3227–3245, 2025. https://doi.org/10.32604/cmc.2025.062376



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 203

    View

  • 67

    Download

  • 0

    Like

Share Link