This repository has been archived by the owner on Sep 24, 2020. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 0
K means clustering
Arupjyoti Nath edited this page Aug 13, 2018
·
9 revisions
.It is the mostly used clustering methods. This algorithm assumes a geometric interpretation of the data sets and the datasets can be represented as some points in an n-dimensional data space. It aims to partition the n observations into k clusters in which each observation belongs to the cluster with the nearest mean, serving as a prototype of the cluster. This results in a partitioning of the data space into Voronoi cells. The steps for K-means of clustering are as follows:
- Construct a partition of a database D of n different objects into a set of k cluster centre. Here, we select the k clusters randomly.
- Now, assign every item of the datasets to its nearest cluster centroid.
- Recalculate the centre of the clusters and move items nearby to the clusters.
- Repeat steps 2 and 3 until the clusters stops changing their positions and move cluster centre to cluster
Overview
Basic Concepts
- What is Data Mining?
- Data mining goals
- Data objects and statistical concepts
- Machine Learning techniques
- Applications
- Related Technologies
Machine Learning Algorithms
- Association rules
- Classification
- Prediction
- Clustering
Machine Learning tool tutorials
Assignment
Advanced Topics
Data warehouse and OLAP