Outlier labeling via circular boxplot
Boxplot is a simple and flexible graphical tool that has been widely used in exploratory data analysis. Its main application is to identify extreme values and outliers in linear univariate data sets. However, the standard boxplot for linear data set is not suitable to be used for circular data sets...
Saved in:
Main Authors: | , , |
---|---|
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2008
|
Subjects: | |
Online Access: | http://eprints.um.edu.my/10365/1/Outlier_labeling_via_circular_boxplot.pdf http://eprints.um.edu.my/10365/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.um.eprints.10365 |
---|---|
record_format |
eprints |
spelling |
my.um.eprints.103652014-12-19T03:24:21Z http://eprints.um.edu.my/10365/ Outlier labeling via circular boxplot Abuzaid, A.H. Hussin, A.G. Mohamed, I.B. QA Mathematics Boxplot is a simple and flexible graphical tool that has been widely used in exploratory data analysis. Its main application is to identify extreme values and outliers in linear univariate data sets. However, the standard boxplot for linear data set is not suitable to be used for circular data sets due to the bounded property of circular variables. In this paper, we propose and develop a boxplot for circular data sets based on five circular summary statistics which is called circular boxplot. In the process, several problems have been resolved. Firstly, we have overcome the problems of estimating the circular median, the first and second quartiles and overlapping areas between the upper and lower fences. Secondly, we resolve the problem of finding the appropriate boxplot criterion which is (νIQR=1.5IQR) in linear case, where IQR is the interquartiles range and ν is the resistance constant. Through simulation studies, we identify the appropriate values of circular boxplot criterion which depends on the concentration parameter. The power of performances of the proposed boxplot is investigated. We then develop S-Plus subroutines to display the circular boxplot and apply the plot on a real circular data set. 2008 Conference or Workshop Item PeerReviewed application/pdf en http://eprints.um.edu.my/10365/1/Outlier_labeling_via_circular_boxplot.pdf Abuzaid, A.H. and Hussin, A.G. and Mohamed, I.B. (2008) Outlier labeling via circular boxplot. In: Conference of the Asian Regional Section of the IASC on Computational Statistics and Data Analysis, 5-8 Dec 2008, Yokohama, Japan. (Submitted) |
institution |
Universiti Malaya |
building |
UM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaya |
content_source |
UM Research Repository |
url_provider |
http://eprints.um.edu.my/ |
language |
English |
topic |
QA Mathematics |
spellingShingle |
QA Mathematics Abuzaid, A.H. Hussin, A.G. Mohamed, I.B. Outlier labeling via circular boxplot |
description |
Boxplot is a simple and flexible graphical tool that has been widely used in exploratory data analysis. Its main application is to identify extreme values and outliers in linear univariate data sets. However, the standard boxplot for linear data set is not suitable to be used for circular data sets due to the bounded property of circular variables. In this paper, we propose and develop a boxplot for circular data sets based on five circular summary statistics which is called circular boxplot. In the process, several problems have been resolved. Firstly, we have overcome the problems of estimating the circular median, the first and second quartiles and overlapping areas between the upper and lower fences. Secondly, we resolve the problem of finding the appropriate boxplot criterion which is (νIQR=1.5IQR) in linear case, where IQR is the interquartiles range and ν is the resistance constant. Through simulation studies, we identify the appropriate values of circular boxplot criterion which depends on the concentration parameter. The
power of performances of the proposed boxplot is investigated. We then develop S-Plus subroutines to display the circular boxplot and apply the plot on a real circular data set. |
format |
Conference or Workshop Item |
author |
Abuzaid, A.H. Hussin, A.G. Mohamed, I.B. |
author_facet |
Abuzaid, A.H. Hussin, A.G. Mohamed, I.B. |
author_sort |
Abuzaid, A.H. |
title |
Outlier labeling via circular boxplot |
title_short |
Outlier labeling via circular boxplot |
title_full |
Outlier labeling via circular boxplot |
title_fullStr |
Outlier labeling via circular boxplot |
title_full_unstemmed |
Outlier labeling via circular boxplot |
title_sort |
outlier labeling via circular boxplot |
publishDate |
2008 |
url |
http://eprints.um.edu.my/10365/1/Outlier_labeling_via_circular_boxplot.pdf http://eprints.um.edu.my/10365/ |
_version_ |
1643688780846071808 |
score |
13.160551 |