Internal Validity Index for Fuzzy Clustering Based on Relative Uncertainty

Refik Sirmen; Burak Üstündağ

doi:10.32604/cmc.2022.023947

Open Access icon Open Access

ARTICLE

Internal Validity Index for Fuzzy Clustering Based on Relative Uncertainty

Refik Tanju Sirmen^1,*, Burak Berk Üstündağ²

1 Graduate School of Science Engineering & Technology, Istanbul Technical University, Istanbul, 34469, Turkey
2 Faculty of Computer & Informatics, Istanbul Technical University, Istanbul, 34469, Turkey

* Corresponding Author: Refik Tanju Sirmen. Email: email

Computers, Materials & Continua 2022, 72(2), 2909-2926. https://doi.org/10.32604/cmc.2022.023947

Received 27 September 2021; Accepted 17 January 2022; Issue published 29 March 2022

Abstract

Unsupervised clustering and clustering validity are used as essential instruments of data analytics. Despite clustering being realized under uncertainty, validity indices do not deliver any quantitative evaluation of the uncertainties in the suggested partitionings. Also, validity measures may be biased towards the underlying clustering method. Moreover, neglecting a confidence requirement may result in over-partitioning. In the absence of an error estimate or a confidence parameter, probable clustering errors are forwarded to the later stages of the system. Whereas, having an uncertainty margin of the projected labeling can be very fruitful for many applications such as machine learning. Herein, the validity issue was approached through estimation of the uncertainty and a novel low complexity index proposed for fuzzy clustering. It involves only uni-dimensional membership weights, regardless of the data dimension, stipulates no specific distribution, and is independent of the underlying similarity measure. Inclusive tests and comparisons returned that it can reliably estimate the optimum number of partitions under different data distributions, besides behaving more robust to over partitioning. Also, in the comparative correlation analysis between true clustering error rates and some known internal validity indices, the suggested index exhibited the highest strong correlations. This relationship has been also proven stable through additional statistical acceptance tests. Thus the provided relative uncertainty measure can be used as a probable error estimate in the clustering as well. Besides, it is the only method known that can exclusively identify data points in dubiety and is adjustable according to the required confidence level.

Keywords

Machine learning; data science; clustering validity; fuzzy clustering; uncertainty; intelligent systems; data analytics

Cite This Article

APA Style

Sirmen, R.T., Üstündağ, B.B. (2022). Internal Validity Index for Fuzzy Clustering Based on Relative Uncertainty. Computers, Materials & Continua, 72(2), 2909–2926. https://doi.org/10.32604/cmc.2022.023947

Vancouver Style

Sirmen RT, Üstündağ BB. Internal Validity Index for Fuzzy Clustering Based on Relative Uncertainty. Comput Mater Contin. 2022;72(2):2909–2926. https://doi.org/10.32604/cmc.2022.023947

IEEE Style

R. T. Sirmen and B. B. Üstündağ, “Internal Validity Index for Fuzzy Clustering Based on Relative Uncertainty,” Comput. Mater. Contin., vol. 72, no. 2, pp. 2909–2926, 2022. https://doi.org/10.32604/cmc.2022.023947

BibTex EndNote RIS

Copyright © 2022 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Internal Validity Index for Fuzzy Clustering Based on Relative Uncertainty

Abstract

Keywords

Cite This Article

2147

1350

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link