Malicious Document Detection Based on GGE Visualization

Youhe Wang; Yi Sun; Yujie Li; Chuanqi Zhou

doi:10.32604/cmc.2024.057710

Open Access icon Open Access

ARTICLE

Malicious Document Detection Based on GGE Visualization

Youhe Wang, Yi Sun^*, Yujie Li, Chuanqi Zhou

Henan Province Key Laboratory of Information Security, Information Engineering University, Zhengzhou, 450000, China

* Corresponding Author: Yi Sun. Email: email

Computers, Materials & Continua 2025, 82(1), 1233-1254. https://doi.org/10.32604/cmc.2024.057710

Received 26 August 2024; Accepted 30 October 2024; Issue published 03 January 2025

Abstract

With the development of anti-virus technology, malicious documents have gradually become the main pathway of Advanced Persistent Threat (APT) attacks, therefore, the development of effective malicious document classifiers has become particularly urgent. Currently, detection methods based on document structure and behavioral features encounter challenges in feature engineering, these methods not only have limited accuracy, but also consume large resources, and usually can only detect documents in specific formats, which lacks versatility and adaptability. To address such problems, this paper proposes a novel malicious document detection method-visualizing documents as GGE images (Grayscale, Grayscale matrix, Entropy). The GGE method visualizes the original byte sequence of the malicious document as a grayscale image, the information entropy sequence of the document as an entropy image, and at the same time, the grayscale level co-occurrence matrix and the texture and spatial information stored in it are converted into grayscale matrix image, and fuses the three types of images to get the GGE color image. The Convolutional Block Attention Module-EfficientNet-B0 (CBAM-EfficientNet-B0) model is then used for classification, combining transfer learning and applying the pre-trained model on the ImageNet dataset to the feature extraction process of GGE images. As shown in the experimental results, the GGE method has superior performance compared with other methods, which is suitable for detecting malicious documents in different formats, and achieves an accuracy of 99.44% and 97.39% on Portable Document Format (PDF) and office datasets, respectively, and consumes less time during the detection process, which can be effectively applied to the task of detecting malicious documents in real-time.

Keywords

Malicious document; visualization; EfficientNet-B0; convolutional block attention module; GGE image

Cite This Article

APA Style

Wang, Y., Sun, Y., Li, Y., Zhou, C. (2025). Malicious Document Detection Based on GGE Visualization. Computers, Materials & Continua, 82(1), 1233–1254. https://doi.org/10.32604/cmc.2024.057710

Vancouver Style

Wang Y, Sun Y, Li Y, Zhou C. Malicious Document Detection Based on GGE Visualization. Comput Mater Contin. 2025;82(1):1233–1254. https://doi.org/10.32604/cmc.2024.057710

IEEE Style

Y. Wang, Y. Sun, Y. Li, and C. Zhou, “Malicious Document Detection Based on GGE Visualization,” Comput. Mater. Contin., vol. 82, no. 1, pp. 1233–1254, 2025. https://doi.org/10.32604/cmc.2024.057710

BibTex EndNote RIS

Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Malicious Document Detection Based on GGE Visualization

Abstract

Keywords

Cite This Article

972

700

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link