Skip to content
This repository has been archived by the owner on Sep 24, 2020. It is now read-only.

K means clustering

Arupjyoti Nath edited this page Aug 13, 2018 · 9 revisions

.It is the mostly used clustering methods. This algorithm assumes a geometric interpretation of the data sets and the datasets can be represented as some points in an n-dimensional data space. It aims to partition the n observations into k clusters in which each observation belongs to the cluster with the nearest mean, serving as a prototype of the cluster. This results in a partitioning of the data space into Voronoi cells. The steps for K-means of clustering are as follows:

  1. Construct a partition of a database D of n different objects into a set of k cluster centre. Here, we select the k clusters randomly.

1

  1. Now, assign every item of the datasets to its nearest cluster centroid.

2

  1. Recalculate the centre of the clusters and move items nearby to the clusters.

3

  1. Repeat steps 2 and 3 until the clusters stops changing their positions and move cluster centre to cluster

4

Clone this wiki locally