[arXiv] BigDataFr recommends: Distributed Spatial Data Clustering as a New Approach for Big Data Analysis

data clusteringBigDataFr recommends: Distributed Spatial Data Clustering as a New Approach for Big Data Analysis

[…] Subjects: Distributed, Parallel, and Cluster Computing (cs.DC)

In this paper we propose a new approach for Big Data mining and analysis. This new approach works well on distributed datasets and deals with data clustering task of the analysis. The approach consists of two main phases, the first phase executes a clustering algorithm on local data, assuming that the datasets was already distributed among the system processing nodes. The second phase deals with the local clusters aggregation to generate global clusters. This approach not only generates local clusters on each processing node in parallel, but also facilitates the formation of global clusters without prior knowledge of the number of the clusters, which many partitioning clustering algorithm require. In this study, this approach was applied on spatial datasets […]

Read more
By Malika Bendechache, Nhien-An Le-Khac, M-Tahar Kechadi
Source: arxiv.org

Laisser un commentaire