The performance of robust-diagnostic F in the identification of multiple high leverage points
High leverage points have undue effects on the Least Square estimates. They are responsible for misleading conclusions in regression and multicollinearity problems. Hence, it is imperative to detect high leverage points and use robust estimators to estimate the parameters of a regression model, so a...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Pakistan Journal of Statistics
2015
|
Online Access: | http://psasir.upm.edu.my/id/eprint/46660/1/The%20performance%20of%20robust-diagnostic%20F%20in%20the%20identification%20of%20multiple%20high%20leverage%20points.pdf http://psasir.upm.edu.my/id/eprint/46660/ http://www.pakjs.com |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.upm.eprints.46660 |
---|---|
record_format |
eprints |
spelling |
my.upm.eprints.466602018-03-30T07:37:32Z http://psasir.upm.edu.my/id/eprint/46660/ The performance of robust-diagnostic F in the identification of multiple high leverage points Midi, Habshah Abu Bakar, Nor Mazlina High leverage points have undue effects on the Least Square estimates. They are responsible for misleading conclusions in regression and multicollinearity problems. Hence, it is imperative to detect high leverage points and use robust estimators to estimate the parameters of a regression model, so as to arrive at valid conclusions. Several well-known methods have failed to detect multiple high leverage points correctly because of the swamping and/or masking effects. The Diagnostic Robust Generalized Potential (DRGP), is an appealing alternative method that successfully detects high leverage points correctly. However, for small percentages of high leverage points, it has the tendency to identify few low leverage points to be points of high leverage. In this paper, an attempt is made to correctly identify real high leverage point by reducing swamping effects. We propose a method we call Robust Diagnostic-F (RDF), in which robust approach is employed to detect the suspected high leverage points. Then, F statistics that relates the change in data covariance structure is used to confirm the suspicion. The performance of RDF is evaluated through real data and simulations. Comparisons are also made with existing methods. Pakistan Journal of Statistics 2015 Article PeerReviewed text en http://psasir.upm.edu.my/id/eprint/46660/1/The%20performance%20of%20robust-diagnostic%20F%20in%20the%20identification%20of%20multiple%20high%20leverage%20points.pdf Midi, Habshah and Abu Bakar, Nor Mazlina (2015) The performance of robust-diagnostic F in the identification of multiple high leverage points. Pakistan Journal of Statistics, 31 (5). pp. 461-472. ISSN 1012-9367 http://www.pakjs.com |
institution |
Universiti Putra Malaysia |
building |
UPM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Putra Malaysia |
content_source |
UPM Institutional Repository |
url_provider |
http://psasir.upm.edu.my/ |
language |
English |
description |
High leverage points have undue effects on the Least Square estimates. They are responsible for misleading conclusions in regression and multicollinearity problems. Hence, it is imperative to detect high leverage points and use robust estimators to estimate the parameters of a regression model, so as to arrive at valid conclusions. Several well-known methods have failed to detect multiple high leverage points correctly because of the swamping and/or masking effects. The Diagnostic Robust Generalized Potential (DRGP), is an appealing alternative method that successfully detects high leverage points correctly. However, for small percentages of high leverage points, it has the tendency to identify few low leverage points to be points of high leverage. In this paper, an attempt is made to correctly identify real high leverage point by reducing swamping effects. We propose a method we call Robust Diagnostic-F (RDF), in which robust approach is employed to detect the suspected high leverage points. Then, F statistics that relates the change in data covariance structure is used to confirm the suspicion. The performance of RDF is evaluated through real data and simulations. Comparisons are also made with existing methods. |
format |
Article |
author |
Midi, Habshah Abu Bakar, Nor Mazlina |
spellingShingle |
Midi, Habshah Abu Bakar, Nor Mazlina The performance of robust-diagnostic F in the identification of multiple high leverage points |
author_facet |
Midi, Habshah Abu Bakar, Nor Mazlina |
author_sort |
Midi, Habshah |
title |
The performance of robust-diagnostic F in the identification of multiple high leverage points |
title_short |
The performance of robust-diagnostic F in the identification of multiple high leverage points |
title_full |
The performance of robust-diagnostic F in the identification of multiple high leverage points |
title_fullStr |
The performance of robust-diagnostic F in the identification of multiple high leverage points |
title_full_unstemmed |
The performance of robust-diagnostic F in the identification of multiple high leverage points |
title_sort |
performance of robust-diagnostic f in the identification of multiple high leverage points |
publisher |
Pakistan Journal of Statistics |
publishDate |
2015 |
url |
http://psasir.upm.edu.my/id/eprint/46660/1/The%20performance%20of%20robust-diagnostic%20F%20in%20the%20identification%20of%20multiple%20high%20leverage%20points.pdf http://psasir.upm.edu.my/id/eprint/46660/ http://www.pakjs.com |
_version_ |
1643833808028434432 |
score |
13.211869 |