Performances of K-Means Clustering Algorithm with Different Distance Metrics

Taher Ghazal; Muhammad Hussain; Raed Said; Afrozah Nadeem; Mohammad Hasan; Munir Ahmad; Muhammad Khan; Muhammad Naseem

doi:10.32604/iasc.2021.019067

Open Access icon Open Access

ARTICLE

Performances of K-Means Clustering Algorithm with Different Distance Metrics

Taher M. Ghazal^1,2, Muhammad Zahid Hussain³, Raed A. Said⁵, Afrozah Nadeem⁶, Mohammad Kamrul Hasan¹, Munir Ahmad⁷, Muhammad Adnan Khan^3,4,*, Muhammad Tahir Naseem³

1 Center for Cyber Security, Faculty of Information Science and Technology, Universiti Kebansaan Malaysia (UKM), 43600, Bangi, Selangor, Malaysia
2 School of Information Technology, Skyline University College, University City Sharjah, 1797, Sharjah, UAE
3 Riphah School of Computing & Innovation, Faculty of Computing, Riphah International University, Lahore Campus, Lahore, 54000, Pakistan
4 Pattern Recognition and Machine Learning Lab, Department of Software Engineering, Gachon University, Seongnam, 13557, South Korea
5 Canadian University Dubai, Dubai, UAE
6 Department of Computer Science, Lahore Garrison University, Lahore, 54000, Pakistan
7 School of Computer Science, National College of Business Administration & Economics, Lahore, 54000, Pakistan

* Corresponding Author: Muhammad Adnan Khan. Email: email

Intelligent Automation & Soft Computing 2021, 30(2), 735-742. https://doi.org/10.32604/iasc.2021.019067

Received 31 March 2021; Accepted 07 May 2021; Issue published 11 August 2021

Abstract

Clustering is the process of grouping the data based on their similar properties. Meanwhile, it is the categorization of a set of data into similar groups (clusters), and the elements in each cluster share similarities, where the similarity between elements in the same cluster must be smaller enough to the similarity between elements of different clusters. Hence, this similarity can be considered as a distance measure. One of the most popular clustering algorithms is K-means, where distance is measured between every point of the dataset and centroids of clusters to find similar data objects and assign them to the nearest cluster. Further, there are a series of distance metrics that can be applied to calculate point-to-point distances. In this research, the K-means clustering algorithm is evaluated with three different mathematical metrics in terms of execution time with different datasets and different numbers of clusters. The results indicate that the implementation of Manhattan distance measure metrics achieves the best results in most cases. These results also demonstrate that distance metrics can affect the execution time and the number of clusters created by the K-means algorithm.

Keywords

K-means clustering; distance metrics; Euclidean distance; Manhattan distance; Minkowski distance

Cite This Article

APA Style

Ghazal, T.M., Hussain, M.Z., Said, R.A., Nadeem, A., Hasan, M.K. et al. (2021). Performances of K-Means Clustering Algorithm with Different Distance Metrics. Intelligent Automation & Soft Computing, 30(2), 735–742. https://doi.org/10.32604/iasc.2021.019067

Vancouver Style

Ghazal TM, Hussain MZ, Said RA, Nadeem A, Hasan MK, Ahmad M, et al. Performances of K-Means Clustering Algorithm with Different Distance Metrics. Intell Automat Soft Comput. 2021;30(2):735–742. https://doi.org/10.32604/iasc.2021.019067

IEEE Style

T. M. Ghazal et al., “Performances of K-Means Clustering Algorithm with Different Distance Metrics,” Intell. Automat. Soft Comput., vol. 30, no. 2, pp. 735–742, 2021. https://doi.org/10.32604/iasc.2021.019067

BibTex EndNote RIS

Citations

4

[click to view]

Copyright © 2021 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Performances of K-Means Clustering Algorithm with Different Distance Metrics

Abstract

Keywords

Cite This Article

Citations

5659

3788

2

Related articles

Further Information

Guidelines

Follow Us

Join Us

Share Link