K<i>-</i>Hyperparameter Tuning in High-Dimensional Space Clustering: Solving Smooth Elbow Challenges Using an Ensemble Based Technique of a Self-Adapting Autoencoder and Internal Validation Indexes

Gikera, Rufus; Mwaura, Jonathan; Muuro, Elizaphan; Mambo, Shadrack

doi:10.32604/jai.2023.043229

Open Access icon Open Access

ARTICLE

K-Hyperparameter Tuning in High-Dimensional Space Clustering: Solving Smooth Elbow Challenges Using an Ensemble Based Technique of a Self-Adapting Autoencoder and Internal Validation Indexes

by Rufus Gikera^1,*, Jonathan Mwaura², Elizaphan Muuro³, Shadrack Mambo³

1 Department of Computer Science, Riara University, Nairobi, 00200, Kenya
2 Department of Computing, Khoury College of Computer Sciences, Boston, 02115, USA
3 Department of Engineering, Kenyatta University, Nairobi, 00200, Kenya

* Corresponding Author: Rufus Gikera. Email: email

Journal on Artificial Intelligence 2023, 5, 75-112. https://doi.org/10.32604/jai.2023.043229

Received 26 June 2023; Accepted 01 September 2023; Issue published 26 October 2023

Abstract

k-means is a popular clustering algorithm because of its simplicity and scalability to handle large datasets. However, one of its setbacks is the challenge of identifying the correct k-hyperparameter value. Tuning this value correctly is critical for building effective k-means models. The use of the traditional elbow method to help identify this value has a long-standing literature. However, when using this method with certain datasets, smooth curves may appear, making it challenging to identify the k-value due to its unclear nature. On the other hand, various internal validation indexes, which are proposed as a solution to this issue, may be inconsistent. Although various techniques for solving smooth elbow challenges exist, k-hyperparameter tuning in high-dimensional spaces still remains intractable and an open research issue. In this paper, we have first reviewed the existing techniques for solving smooth elbow challenges. The identified research gaps are then utilized in the development of the new technique. The new technique, referred to as the ensemble-based technique of a self-adapting autoencoder and internal validation indexes, is then validated in high-dimensional space clustering. The optimal k-value, tuned by this technique using a voting scheme, is a trade-off between the number of clusters visualized in the autoencoder’s latent space, k-value from the ensemble internal validation index score and one that generates a value of 0 or close to 0 on the derivative , at the elbow. Experimental results based on the Cochran’s Q test, ANOVA, and McNemar’s score indicate a relatively good performance of the newly developed technique in k-hyperparameter tuning.

Keywords

k-hyperparameter tuning; high-dimensional; smooth elbow

Cite This Article

APA Style

Gikera, R., Mwaura, J., Muuro, E., Mambo, S. (2023). K-hyperparameter tuning in high-dimensional space clustering: solving smooth elbow challenges using an ensemble based technique of a self-adapting autoencoder and internal validation indexes. Journal on Artificial Intelligence, 5(1), 75-112. https://doi.org/10.32604/jai.2023.043229

Vancouver Style

Gikera R, Mwaura J, Muuro E, Mambo S. K-hyperparameter tuning in high-dimensional space clustering: solving smooth elbow challenges using an ensemble based technique of a self-adapting autoencoder and internal validation indexes. J Artif Intell . 2023;5(1):75-112 https://doi.org/10.32604/jai.2023.043229

IEEE Style

R. Gikera, J. Mwaura, E. Muuro, and S. Mambo, “K-Hyperparameter Tuning in High-Dimensional Space Clustering: Solving Smooth Elbow Challenges Using an Ensemble Based Technique of a Self-Adapting Autoencoder and Internal Validation Indexes,” J. Artif. Intell. , vol. 5, no. 1, pp. 75-112, 2023. https://doi.org/10.32604/jai.2023.043229

BibTex EndNote RIS

Copyright © 2023 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

K-Hyperparameter Tuning in High-Dimensional Space Clustering: Solving Smooth Elbow Challenges Using an Ensemble Based Technique of a Self-Adapting Autoencoder and Internal Validation Indexes

Abstract

Keywords

Cite This Article

675

472

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Share Link