Please use this identifier to cite or link to this item:
http://hdl.handle.net/10263/7308
Title: | Center-based Robust Clustering |
Authors: | Das, Pranta |
Keywords: | Robust center-based clustering k-Means clustering Outliers Robust k-Means++ TMK++ LSO |
Issue Date: | Jul-2021 |
Publisher: | Indian Statistical Institute, Kolkata. |
Citation: | 35p. |
Series/Report no.: | Dissertation;CS-1923 |
Abstract: | We consider the problem of clustering observations xi ∈ R d , i = 1, ..., n into k possible clusters. We are mainly interested in clustering in the presence of outliers, where classical clustering algorithms face challenges. In the framework of center-based clustering that uses seeding method to initialize centroid and update the centroid in each iterations, we proposed the method of Modified k-Means clustering. In Modified k-Means method, we introduce a new sampling method for initialize the centroids where the Robust k-Means++ method [1] has been tweaked in a straightforward and understandable way and a new centroid update strategy for avoiding the effect of outlier during centroid update stage. Now use this Modified k-Means algorithm as building blocks we proposed Robust center-based clustering algorithm that provides outlier detection and data clustering simultaneously. The proposed algorithm consists of two stages. The first stage consists of Modified k-Means process, while the second stage iteratively remove the points which are far away from their cluster center. The experimental results suggest that our method has out performed this Robust k-Means++ [1] and also TMK++ [2] and local search (LSO) [3] on real world and synthetic data. |
Description: | Dissertation under the supervision of Dr. Swagatam Das |
URI: | http://hdl.handle.net/10263/7308 |
Appears in Collections: | Dissertations - M Tech (CS) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Pranta Das-cs-19-21.pdf | 525.5 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.