Comparative study of clustering-based outliers detection methods in circular-circular regression model
This paper is a comparative study of several algorithms for detecting multiple outliers in circular-circular regression model based on the clustering algorithms. Three measures of similarity based on the circular distance were used to obtain a cluster tree using the agglomerative hierarchical method...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Penerbit Universiti Kebangsaan Malaysia
2021
|
Subjects: | |
Online Access: | http://umpir.ump.edu.my/id/eprint/35176/1/Comparative%20study%20of%20clustering-based%20outliers%20detection%20methods%20in%20circular-circular%20regression%20model.pdf http://umpir.ump.edu.my/id/eprint/35176/ https://doi.org/10.17576/jsm-2021-5006-24 https://doi.org/10.17576/jsm-2021-5006-24 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.ump.umpir.35176 |
---|---|
record_format |
eprints |
spelling |
my.ump.umpir.351762022-11-07T06:30:43Z http://umpir.ump.edu.my/id/eprint/35176/ Comparative study of clustering-based outliers detection methods in circular-circular regression model Siti Zanariah, Satari Nur Faraidah, Muhammad Di Yong Zulina, Zubairi Abdul Ghapor, Hussin Q Science (General) QA Mathematics This paper is a comparative study of several algorithms for detecting multiple outliers in circular-circular regression model based on the clustering algorithms. Three measures of similarity based on the circular distance were used to obtain a cluster tree using the agglomerative hierarchical methods. A stopping rule for the cluster tree based on the mean direction and circular standard deviation of the tree height was used as the cutoff point and classifier to the cluster group that exceeded the stopping rule as potential outliers. The performances of the algorithms have been demonstrated using the simulation studies that consider several outlier scenarios with a certain degree of contamination. Application to real data using wind data and a simulated data set are given for illustrative purposes. Thus, it has been found that Satari's algorithm (S-SL algorithm) performs well for any values of sample size n and error concentration parameter. The algorithms are good in identifying outliers which are not limited to one or few outliers only, but the presence of multiple outliers at one time. Penerbit Universiti Kebangsaan Malaysia 2021-06 Article PeerReviewed pdf en http://umpir.ump.edu.my/id/eprint/35176/1/Comparative%20study%20of%20clustering-based%20outliers%20detection%20methods%20in%20circular-circular%20regression%20model.pdf Siti Zanariah, Satari and Nur Faraidah, Muhammad Di and Yong Zulina, Zubairi and Abdul Ghapor, Hussin (2021) Comparative study of clustering-based outliers detection methods in circular-circular regression model. Sains Malaysiana, 50 (6). pp. 1787-1798. ISSN 0126-6039 https://doi.org/10.17576/jsm-2021-5006-24 https://doi.org/10.17576/jsm-2021-5006-24 |
institution |
Universiti Malaysia Pahang |
building |
UMP Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaysia Pahang |
content_source |
UMP Institutional Repository |
url_provider |
http://umpir.ump.edu.my/ |
language |
English |
topic |
Q Science (General) QA Mathematics |
spellingShingle |
Q Science (General) QA Mathematics Siti Zanariah, Satari Nur Faraidah, Muhammad Di Yong Zulina, Zubairi Abdul Ghapor, Hussin Comparative study of clustering-based outliers detection methods in circular-circular regression model |
description |
This paper is a comparative study of several algorithms for detecting multiple outliers in circular-circular regression model based on the clustering algorithms. Three measures of similarity based on the circular distance were used to obtain a cluster tree using the agglomerative hierarchical methods. A stopping rule for the cluster tree based on the mean direction and circular standard deviation of the tree height was used as the cutoff point and classifier to the cluster group that exceeded the stopping rule as potential outliers. The performances of the algorithms have been demonstrated using the simulation studies that consider several outlier scenarios with a certain degree of contamination. Application to real data using wind data and a simulated data set are given for illustrative purposes. Thus, it has been found that Satari's algorithm (S-SL algorithm) performs well for any values of sample size n and error concentration parameter. The algorithms are good in identifying outliers which are not limited to one or few outliers only, but the presence of multiple outliers at one time. |
format |
Article |
author |
Siti Zanariah, Satari Nur Faraidah, Muhammad Di Yong Zulina, Zubairi Abdul Ghapor, Hussin |
author_facet |
Siti Zanariah, Satari Nur Faraidah, Muhammad Di Yong Zulina, Zubairi Abdul Ghapor, Hussin |
author_sort |
Siti Zanariah, Satari |
title |
Comparative study of clustering-based outliers detection methods in circular-circular regression model |
title_short |
Comparative study of clustering-based outliers detection methods in circular-circular regression model |
title_full |
Comparative study of clustering-based outliers detection methods in circular-circular regression model |
title_fullStr |
Comparative study of clustering-based outliers detection methods in circular-circular regression model |
title_full_unstemmed |
Comparative study of clustering-based outliers detection methods in circular-circular regression model |
title_sort |
comparative study of clustering-based outliers detection methods in circular-circular regression model |
publisher |
Penerbit Universiti Kebangsaan Malaysia |
publishDate |
2021 |
url |
http://umpir.ump.edu.my/id/eprint/35176/1/Comparative%20study%20of%20clustering-based%20outliers%20detection%20methods%20in%20circular-circular%20regression%20model.pdf http://umpir.ump.edu.my/id/eprint/35176/ https://doi.org/10.17576/jsm-2021-5006-24 https://doi.org/10.17576/jsm-2021-5006-24 |
_version_ |
1751536383315410944 |
score |
13.18916 |