Comparative study of clustering-based outliers detection methods in circular-circular regression model

This paper is a comparative study of several algorithms for detecting multiple outliers in circular-circular regression model based on the clustering algorithms. Three measures of similarity based on the circular distance were used to obtain a cluster tree using the agglomerative hierarchical method...

Full description

Saved in:
Bibliographic Details
Main Authors: Satari, Siti Zanariah, Di, Nur Faraidah Muhammad, Zubairi, Yong Zulina, Hussin, Abdul Ghapor
Format: Article
Published: Penerbit Universiti Kebangsaan Malaysia 2021
Subjects:
Online Access:http://eprints.um.edu.my/26377/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.um.eprints.26377
record_format eprints
spelling my.um.eprints.263772022-02-24T07:01:51Z http://eprints.um.edu.my/26377/ Comparative study of clustering-based outliers detection methods in circular-circular regression model Satari, Siti Zanariah Di, Nur Faraidah Muhammad Zubairi, Yong Zulina Hussin, Abdul Ghapor QA Mathematics This paper is a comparative study of several algorithms for detecting multiple outliers in circular-circular regression model based on the clustering algorithms. Three measures of similarity based on the circular distance were used to obtain a cluster tree using the agglomerative hierarchical methods. A stopping rule for the cluster tree based on the mean direction and circular standard deviation of the tree height was used as the cutoff point and classifier to the cluster group that exceeded the stopping rule as potential outliers. The performances of the algorithms have been demonstrated using the simulation studies that consider several outlier scenarios with a certain degree of contamination. Application to real data using wind data and a simulated data set are given for illustrative purposes. Thus, it has been found that Satari's algorithm (S-SL algorithm) performs well for any values of sample size n and error concentration parameter. The algorithms are good in identifying outliers which are not limited to one or few outliers only, but the presence of multiple outliers at one time. Penerbit Universiti Kebangsaan Malaysia 2021-06 Article PeerReviewed Satari, Siti Zanariah and Di, Nur Faraidah Muhammad and Zubairi, Yong Zulina and Hussin, Abdul Ghapor (2021) Comparative study of clustering-based outliers detection methods in circular-circular regression model. Sains Malaysiana, 50 (6). pp. 1787-1798. ISSN 0126-6039, DOI https://doi.org/10.17576/jsm-2021-5006-24 <https://doi.org/10.17576/jsm-2021-5006-24>. 10.17576/jsm-2021-5006-24
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
topic QA Mathematics
spellingShingle QA Mathematics
Satari, Siti Zanariah
Di, Nur Faraidah Muhammad
Zubairi, Yong Zulina
Hussin, Abdul Ghapor
Comparative study of clustering-based outliers detection methods in circular-circular regression model
description This paper is a comparative study of several algorithms for detecting multiple outliers in circular-circular regression model based on the clustering algorithms. Three measures of similarity based on the circular distance were used to obtain a cluster tree using the agglomerative hierarchical methods. A stopping rule for the cluster tree based on the mean direction and circular standard deviation of the tree height was used as the cutoff point and classifier to the cluster group that exceeded the stopping rule as potential outliers. The performances of the algorithms have been demonstrated using the simulation studies that consider several outlier scenarios with a certain degree of contamination. Application to real data using wind data and a simulated data set are given for illustrative purposes. Thus, it has been found that Satari's algorithm (S-SL algorithm) performs well for any values of sample size n and error concentration parameter. The algorithms are good in identifying outliers which are not limited to one or few outliers only, but the presence of multiple outliers at one time.
format Article
author Satari, Siti Zanariah
Di, Nur Faraidah Muhammad
Zubairi, Yong Zulina
Hussin, Abdul Ghapor
author_facet Satari, Siti Zanariah
Di, Nur Faraidah Muhammad
Zubairi, Yong Zulina
Hussin, Abdul Ghapor
author_sort Satari, Siti Zanariah
title Comparative study of clustering-based outliers detection methods in circular-circular regression model
title_short Comparative study of clustering-based outliers detection methods in circular-circular regression model
title_full Comparative study of clustering-based outliers detection methods in circular-circular regression model
title_fullStr Comparative study of clustering-based outliers detection methods in circular-circular regression model
title_full_unstemmed Comparative study of clustering-based outliers detection methods in circular-circular regression model
title_sort comparative study of clustering-based outliers detection methods in circular-circular regression model
publisher Penerbit Universiti Kebangsaan Malaysia
publishDate 2021
url http://eprints.um.edu.my/26377/
_version_ 1735409404753215488
score 13.159267