The Multiple Outliers Detection using Agglomerative Hierarchical Methods in Circular Regression Model

Two agglomerative hierarchical clustering algorithms for identifying multiple outliers in circular regression model have been developed in this study. The agglomerative hierarchical clustering algorithm starts with every single data in a single cluster and it continues to merge with the closest pair...

Full description

Saved in:
Bibliographic Details
Main Authors: Siti Zanariah, Satari, Nur Faraidah, Muhammad Di, Roslinazairimah, Zakaria
Format: Article
Language:English
Published: IOP Publishing 2017
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/18916/1/The%20Multiple%20Outliers%20Detection%20using%20Agglomerative%20Hierarchical%20Methods%20in%20Circular%20Regression%20Model.pdf
http://umpir.ump.edu.my/id/eprint/18916/
http://dx.doi.org/10.1088/1742-6596/890/1/012152
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Two agglomerative hierarchical clustering algorithms for identifying multiple outliers in circular regression model have been developed in this study. The agglomerative hierarchical clustering algorithm starts with every single data in a single cluster and it continues to merge with the closest pair of clusters according to some similarity criterion until all the data are grouped in one cluster. The single-linkage method is one of the simplest agglomerative hierarchical methods that is commonly used to detect outlier. In this study, we compared the performance of single-linkage method with another agglomerative hierarchical method, namely average linkage for detecting outlier in circular regression model. The performances of both methods were examined via simulation studies by measuring their "success" probability, masking effect, and swamping effect with different number of sample sizes and level of contaminations. The results show that the single-linkage method performs very well in detecting the multiple outliers with lower masking and swamping effects.