Clustering Gene Expression Data Through Modified Agglomerative M-CURE Hierarchical Algorithm

E. Kavitha; R. Tamilarasan; N. Poonguzhali; M. K.

doi:10.32604/csse.2022.020634

Open Access icon Open Access

ARTICLE

Clustering Gene Expression Data Through Modified Agglomerative M-CURE Hierarchical Algorithm

E. Kavitha^1,*, R. Tamilarasan², N. Poonguzhali³, M. K. Jayanthi Kannan⁴

1 A Constituent College of Anna University, University College of Engineering, Villupuram, 605103, India
2 A Constituent College of Anna University, University College of Engineering, Pattukkottai, 614701, India
3 Department of Computer Science and Engineering, Manakula Vinayagar Institute of Technology, Puducherry, 605107, India
4 Department of Computer Science Engineering, Faculty of Engineering and Technology, JAIN (Deemed-To-Be University), Bangalore, 562112, India

* Corresponding Author: E. Kavitha. Email: email

Computer Systems Science and Engineering 2022, 41(3), 1027-141. https://doi.org/10.32604/csse.2022.020634

Received 01 June 2021; Accepted 11 July 2021; Issue published 10 November 2021

Abstract

Gene expression refers to the process in which the gene information is used in the functional gene product synthesis. They basically encode the proteins which in turn dictate the functionality of the cell. The first step in gene expression study involves the clustering usage. This is due to the reason that biological networks are very complex and the genes volume increases the comprehending challenges along with the data interpretation which itself inhibit vagueness, noise and imprecision. For a biological system to function, the essential cellular molecules must interact with its surrounding including RNA, DNA, metabolites and proteins. Clustering methods will help to expose the structures and the patterns in the original data for taking further decisions. The traditional clustering techniques involve hierarchical, model based, partitioning, density based, grid based and soft clustering methods. Though many of these methods provide a reliable output in clustering, they fail to incorporate huge data of gene expressions. Also, there are statistical issues along with choosing the right method and the choice of dissimilarity matrix when dealing with gene expression data. We propose to use a modified clustering algorithm using representatives (M-CURE) in this work which is more robust to outliers as compared to K-means clustering and also able to find clusters with size variances.

Keywords

Clustering; gene identifiers; representatives; dimension reduction

Cite This Article

APA Style

Kavitha, E., Tamilarasan, R., Poonguzhali, N., Jayanthi Kannan, M.K. (2022). Clustering Gene Expression Data Through Modified Agglomerative M-CURE Hierarchical Algorithm. Computer Systems Science and Engineering, 41(3), 1027–141. https://doi.org/10.32604/csse.2022.020634

Vancouver Style

Kavitha E, Tamilarasan R, Poonguzhali N, Jayanthi Kannan MK. Clustering Gene Expression Data Through Modified Agglomerative M-CURE Hierarchical Algorithm. Comput Syst Sci Eng. 2022;41(3):1027–141. https://doi.org/10.32604/csse.2022.020634

IEEE Style

E. Kavitha, R. Tamilarasan, N. Poonguzhali, and M. K. Jayanthi Kannan, “Clustering Gene Expression Data Through Modified Agglomerative M-CURE Hierarchical Algorithm,” Comput. Syst. Sci. Eng., vol. 41, no. 3, pp. 1027–141, 2022. https://doi.org/10.32604/csse.2022.020634

BibTex EndNote RIS

Copyright © 2022 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Clustering Gene Expression Data Through Modified Agglomerative M-CURE Hierarchical Algorithm

Abstract

Keywords

Cite This Article

2665

1937

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link