A Novel Grid-Based Clustering Algorithm

Data clustering is an important method used to discover naturally occurring structures in datasets. One of the most popular approaches is the grid-based concept of clustering algorithms. This kind of method is characterized by a fast processing time and it can also discover clusters of arbitrary shapes in datasets. These properties allow these methods to be used in many different applications. Researchers have created many versions of the clustering method using the grid-based approach. However, the key issue is the right choice of the number of grid cells. This paper proposes a novel grid-based algorithm which uses a method for an automatic determining of the number of grid cells. This method is based on the k_dist function which computes the distance between each element of a dataset and its kth nearest neighbor. Experimental results have been obtained for several different datasets and they confirm a very good performance of the newly proposed method.

eISSN:: 2449-6499
Language:: English

Publication timeframe:: 4 times per year
Journal Subjects:: Computer Sciences, Databases and Data Mining, Artificial Intelligence

Journal RSS Feed

A Novel Grid-Based Clustering Algorithm

Published Online: Oct 08, 2021

Page range: 319 - 330

Received: Jan 24, 2021

Accepted: Sep 23, 2021

DOI: https://doi.org/10.2478/jaiscr-2021-0019

Keywords
data mining, grid-based clustering, grid structure

© 2021 Artur Starczewski et al., published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

A Novel Grid-Based Clustering Algorithm

Published Online: Oct 08, 2021

Page range: 319 - 330

Received: Jan 24, 2021

Accepted: Sep 23, 2021

DOI: https://doi.org/10.2478/jaiscr-2021-0019

Keywordsdata mining, grid-based clustering, grid structure

© 2021 Artur Starczewski et al., published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Keywords
data mining, grid-based clustering, grid structure