The performance of robust-diagnostic F in the identification of multiple high leverage points

High leverage points have undue effects on the Least Square estimates. They are responsible for misleading conclusions in regression and multicollinearity problems. Hence, it is imperative to detect high leverage points and use robust estimators to estimate the parameters of a regression model, so a...

Full description

Saved in:
Bibliographic Details
Main Authors: Midi, Habshah, Abu Bakar, Nor Mazlina
Format: Article
Language:English
Published: Pakistan Journal of Statistics 2015
Online Access:http://psasir.upm.edu.my/id/eprint/46660/1/The%20performance%20of%20robust-diagnostic%20F%20in%20the%20identification%20of%20multiple%20high%20leverage%20points.pdf
http://psasir.upm.edu.my/id/eprint/46660/
http://www.pakjs.com
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.upm.eprints.46660
record_format eprints
spelling my.upm.eprints.466602018-03-30T07:37:32Z http://psasir.upm.edu.my/id/eprint/46660/ The performance of robust-diagnostic F in the identification of multiple high leverage points Midi, Habshah Abu Bakar, Nor Mazlina High leverage points have undue effects on the Least Square estimates. They are responsible for misleading conclusions in regression and multicollinearity problems. Hence, it is imperative to detect high leverage points and use robust estimators to estimate the parameters of a regression model, so as to arrive at valid conclusions. Several well-known methods have failed to detect multiple high leverage points correctly because of the swamping and/or masking effects. The Diagnostic Robust Generalized Potential (DRGP), is an appealing alternative method that successfully detects high leverage points correctly. However, for small percentages of high leverage points, it has the tendency to identify few low leverage points to be points of high leverage. In this paper, an attempt is made to correctly identify real high leverage point by reducing swamping effects. We propose a method we call Robust Diagnostic-F (RDF), in which robust approach is employed to detect the suspected high leverage points. Then, F statistics that relates the change in data covariance structure is used to confirm the suspicion. The performance of RDF is evaluated through real data and simulations. Comparisons are also made with existing methods. Pakistan Journal of Statistics 2015 Article PeerReviewed text en http://psasir.upm.edu.my/id/eprint/46660/1/The%20performance%20of%20robust-diagnostic%20F%20in%20the%20identification%20of%20multiple%20high%20leverage%20points.pdf Midi, Habshah and Abu Bakar, Nor Mazlina (2015) The performance of robust-diagnostic F in the identification of multiple high leverage points. Pakistan Journal of Statistics, 31 (5). pp. 461-472. ISSN 1012-9367 http://www.pakjs.com
institution Universiti Putra Malaysia
building UPM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Putra Malaysia
content_source UPM Institutional Repository
url_provider http://psasir.upm.edu.my/
language English
description High leverage points have undue effects on the Least Square estimates. They are responsible for misleading conclusions in regression and multicollinearity problems. Hence, it is imperative to detect high leverage points and use robust estimators to estimate the parameters of a regression model, so as to arrive at valid conclusions. Several well-known methods have failed to detect multiple high leverage points correctly because of the swamping and/or masking effects. The Diagnostic Robust Generalized Potential (DRGP), is an appealing alternative method that successfully detects high leverage points correctly. However, for small percentages of high leverage points, it has the tendency to identify few low leverage points to be points of high leverage. In this paper, an attempt is made to correctly identify real high leverage point by reducing swamping effects. We propose a method we call Robust Diagnostic-F (RDF), in which robust approach is employed to detect the suspected high leverage points. Then, F statistics that relates the change in data covariance structure is used to confirm the suspicion. The performance of RDF is evaluated through real data and simulations. Comparisons are also made with existing methods.
format Article
author Midi, Habshah
Abu Bakar, Nor Mazlina
spellingShingle Midi, Habshah
Abu Bakar, Nor Mazlina
The performance of robust-diagnostic F in the identification of multiple high leverage points
author_facet Midi, Habshah
Abu Bakar, Nor Mazlina
author_sort Midi, Habshah
title The performance of robust-diagnostic F in the identification of multiple high leverage points
title_short The performance of robust-diagnostic F in the identification of multiple high leverage points
title_full The performance of robust-diagnostic F in the identification of multiple high leverage points
title_fullStr The performance of robust-diagnostic F in the identification of multiple high leverage points
title_full_unstemmed The performance of robust-diagnostic F in the identification of multiple high leverage points
title_sort performance of robust-diagnostic f in the identification of multiple high leverage points
publisher Pakistan Journal of Statistics
publishDate 2015
url http://psasir.upm.edu.my/id/eprint/46660/1/The%20performance%20of%20robust-diagnostic%20F%20in%20the%20identification%20of%20multiple%20high%20leverage%20points.pdf
http://psasir.upm.edu.my/id/eprint/46660/
http://www.pakjs.com
_version_ 1643833808028434432
score 13.211869