Improved normalization and standardization techniques for higher purity in K-means clustering

Clustering is basically one of the major sources of primary data mining tools, which make researchers understand the natural grouping of attributes in datasets. Clustering is an unsupervised classification method with aim of partitioning, where objects in the same cluster are similar, and objects be...

詳細記述

保存先:
書誌詳細
主要な著者: Dalatu, Paul Inuwa, Fitrianto, Anwar, Mustapha, Aida
フォーマット: 論文
言語:English
出版事項: Pushpa Publishing House 2016
オンライン・アクセス:http://psasir.upm.edu.my/id/eprint/54519/1/Improved%20normalization%20and%20standardization%20techniques%20for%20higher%20purity%20in%20K-means%20clustering.pdf
http://psasir.upm.edu.my/id/eprint/54519/
http://www.pphmj.com/abstract/10134.htm
タグ: タグ追加
タグなし, このレコードへの初めてのタグを付けませんか!
その他の書誌記述
要約:Clustering is basically one of the major sources of primary data mining tools, which make researchers understand the natural grouping of attributes in datasets. Clustering is an unsupervised classification method with aim of partitioning, where objects in the same cluster are similar, and objects belong to different clusters vary significantly, with respect to their attributes. The K-means algorithm is a famous and fast technique in non-hierarchical cluster algorithms. Based on its simplicity, the K-means algorithm has been used in many fields. This paper proposes improved normalization and standardization techniques for higher purity in K-means clustering experimented with benchmark datasets from UCI machine learning repository and it was found that all the proposed techniques’ performance was much higher compared to the conventional K-means and the three classic transformations, and it is evidently shown by purity and Rand index accuracy results.