Using Imbalanced Triangle Synthetic Data for Machine Learning Anomaly Detection

Menghua Luo; Ke Wang; Zhiping Cai; Anfeng Liu; Yangyang Li; Chak Cheang

doi:10.32604/cmc.2019.03708

Open Access icon Open Access

ARTICLE

Using Imbalanced Triangle Synthetic Data for Machine Learning Anomaly Detection

Menghua Luo^1,2, Ke Wang¹, Zhiping Cai^1,*, Anfeng Liu³, Yangyang Li⁴, Chak Fong Cheang⁵

1 College of Computer, National University of Defense Technology, Changsha, 410073, China.
2 Normal College of Jishou University, Jishou, 416000, China.
3 School of Information Science and Engineering, Central South University, Changsha, 410083, China .
4 Innovation Center, China Academy of Electronics and Information Technology, Beijing, 100041, China .
5 Faculty of Information Technology, Macau University of Science and Technology, 519020, MACAU.

* Corresponding Author: Zhiping Cai. Email: email .

Computers, Materials & Continua 2019, 58(1), 15-26. https://doi.org/10.32604/cmc.2019.03708

Download PDF

Abstract

The extreme imbalanced data problem is the core issue in anomaly detection. The amount of abnormal data is so small that we cannot get adequate information to analyze it. The mainstream methods focus on taking fully advantages of the normal data, of which the discrimination method is that the data not belonging to normal data distribution is the anomaly. From the view of data science, we concentrate on the abnormal data and generate artificial abnormal samples by machine learning method. In this kind of technologies, Synthetic Minority Over-sampling Technique and its improved algorithms are representative milestones, which generate synthetic examples randomly in selected line segments. In our work, we break the limitation of line segment and propose an Imbalanced Triangle Synthetic Data method. In theory, our method covers a wider range. In experiment with real world data, our method performs better than the SMOTE and its meliorations.

Keywords

Anomaly detection, imbalanced data, synthetic data, machine learning.

Cite This Article

APA Style

Luo, M., Wang, K., Cai, Z., Liu, A., Li, Y. et al. (2019). Using Imbalanced Triangle Synthetic Data for Machine Learning Anomaly Detection. Computers, Materials & Continua, 58(1), 15–26. https://doi.org/10.32604/cmc.2019.03708

Vancouver Style

Luo M, Wang K, Cai Z, Liu A, Li Y, Fong Cheang C. Using Imbalanced Triangle Synthetic Data for Machine Learning Anomaly Detection. Comput Mater Contin. 2019;58(1):15–26. https://doi.org/10.32604/cmc.2019.03708

IEEE Style

M. Luo, K. Wang, Z. Cai, A. Liu, Y. Li, and C. Fong Cheang, “Using Imbalanced Triangle Synthetic Data for Machine Learning Anomaly Detection,” Comput. Mater. Contin., vol. 58, no. 1, pp. 15–26, 2019. https://doi.org/10.32604/cmc.2019.03708

BibTex EndNote RIS

Citations

22

[click to view]

Copyright © 2019 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Using Imbalanced Triangle Synthetic Data for Machine Learning Anomaly Detection

Abstract

Keywords

Cite This Article

Citations

4578

2579

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link