A novel data clustering algorithm based on gravity center methodology

Kuwil, Farag Hamed; Atila, Umit; Abu-Issa, Radwan; Murtagh, Fionn

A novel data clustering algorithm based on gravity center methodology

dc.authorid	kuwil, Farag/0000-0001-6630-3918
dc.authorid	ATILA, UMIT/0000-0002-1576-9977
dc.contributor.author	Kuwil, Farag Hamed
dc.contributor.author	Atila, Umit
dc.contributor.author	Abu-Issa, Radwan
dc.contributor.author	Murtagh, Fionn
dc.date.accessioned	2024-09-29T15:57:10Z
dc.date.available	2024-09-29T15:57:10Z
dc.date.issued	2020
dc.department	Karabük Üniversitesi	en_US
dc.description.abstract	The concept of clustering is to separate clusters based on the similarity which is greater within cluster than among clusters. The similarity consists of two principles, namely, connectivity and cohesion. However, in partitional clustering, while some algorithms such as K-means and K-medians divides the dataset points according to the first principle (connectivity) based on centroid clusters without any regard to the second principle (cohesion), some others like K-medoids partially consider cohesion in addition to connectivity. This prevents to discover clusters with convex shape and results are affected negatively by outliers. In this paper a new Gravity Center Clustering (GCC) algorithm is proposed which depends on critical distance (lambda) to define threshold among clusters. The algorithm falls under partition clustering and is based on gravity center which is a point within cluster that verifies both the connectivity and cohesion in determining the similarity of each point in the dataset. Therefore, the proposed algorithm deals with any shape of data better than K-means, K-medians and K-medoids. Furthermore, GCC algorithm does not need any parameters beforehand to perform clustering but can help user improving the control over clustering results and deal with overlapping and outliers providing two coefficients and an indicator. In this study, 22 experiments are conducted using different types of synthetic, and real healthcare datasets. The results show that the proposed algorithm satisfies the concept of clustering and provides great flexibility to get the optimal solution especially since clustering is considered as an optimization problem. (C) 2020 Elsevier Ltd. All rights reserved.	en_US
dc.description.sponsorship	Department of Computer Engineering at Karabuk University	en_US
dc.description.sponsorship	We would like to express our gratitude to the management of the Department of Computer Engineering at Karabuk University for supporting our research by providing the use of Big Data laboratory. Also special thanks to Hamed Atia and Ali Belal for their support.	en_US
dc.identifier.doi	10.1016/j.eswa.2020.113435
dc.identifier.issn	0957-4174
dc.identifier.issn	1873-6793
dc.identifier.scopus	2-s2.0-85084338374	en_US
dc.identifier.scopusquality	Q1	en_US
dc.identifier.uri	https://doi.org/10.1016/j.eswa.2020.113435
dc.identifier.uri	https://hdl.handle.net/20.500.14619/4629
dc.identifier.volume	156	en_US
dc.identifier.wos	WOS:000542130000002	en_US
dc.identifier.wosquality	Q1	en_US
dc.indekslendigikaynak	Web of Science	en_US
dc.indekslendigikaynak	Scopus	en_US
dc.language.iso	en	en_US
dc.publisher	Pergamon-Elsevier Science Ltd	en_US
dc.relation.ispartof	Expert Systems With Applications	en_US
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Algorithm	en_US
dc.subject	Cluster analysis	en_US
dc.subject	Euclidean distance	en_US
dc.subject	Gravity center	en_US
dc.subject	Partitional clustering	en_US
dc.title	A novel data clustering algorithm based on gravity center methodology	en_US
dc.type	Article	en_US

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu

A novel data clustering algorithm based on gravity center methodology

Dosyalar

Koleksiyon