Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224)
This study developed an algorithm for statistical classification that enable ones to classify a future data to one of predetermined groups based on the measured data which facing two major threats; (i) multicollinearity among the measured variables and (ii) imbalanced groups. The developed algorithm...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Monograph |
Language: | English |
Published: |
UUM
|
Subjects: | |
Online Access: | https://repo.uum.edu.my/id/eprint/31770/1/13224.pdf https://repo.uum.edu.my/id/eprint/31770/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.uum.repo.31770 |
---|---|
record_format |
eprints |
spelling |
my.uum.repo.317702024-12-12T10:47:04Z https://repo.uum.edu.my/id/eprint/31770/ Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224) Mahat, Nor Idayu Engku Abu Bakar, Engku Muhammad Nazri Zakaria, Ammar Mohd Nazir, Mohd Amril Nurman Misiran, Masnita QA Mathematics This study developed an algorithm for statistical classification that enable ones to classify a future data to one of predetermined groups based on the measured data which facing two major threats; (i) multicollinearity among the measured variables and (ii) imbalanced groups. The developed algorithm weighted the n objects contribution in explaining the separation between groups. Then, the weights are used together with either Principal Component Analysis (PCA) or Partial Least Square (PLS) to tackle the collinearity among variables. Next, the weighted and transformed features were used to train Linear Discriminant Function (LDA) and to evaluate the constructed rule. The designed algorithm was structured in k-fold cross-validation in attempt to minimise the biasness of the classification performance, measured using error rate. Both simulation on bivariate and multivariate cases show some promising results that the weighted PCA on LDA and the weighted PLS on LDA are better than the traditional LDA, kernel discriminant, and PCA+LDA methods. Whilst, critical investigation on the minority group using sensitivity value has given some evidence how the two proposed methods are competitive, but they are similar if the groups are well separated. Evidence obtained from the real data sets also providing similar results to the simulated ones. Hence, both weighted PCA on LDA and the weighted PLS on LDA can be recommended to discriminate imbalanced groups with correlated variables UUM Monograph NonPeerReviewed application/pdf en https://repo.uum.edu.my/id/eprint/31770/1/13224.pdf Mahat, Nor Idayu and Engku Abu Bakar, Engku Muhammad Nazri and Zakaria, Ammar and Mohd Nazir, Mohd Amril Nurman and Misiran, Masnita Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224). Project Report. UUM. (Submitted) |
institution |
Universiti Utara Malaysia |
building |
UUM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Utara Malaysia |
content_source |
UUM Institutional Repository |
url_provider |
http://repo.uum.edu.my/ |
language |
English |
topic |
QA Mathematics |
spellingShingle |
QA Mathematics Mahat, Nor Idayu Engku Abu Bakar, Engku Muhammad Nazri Zakaria, Ammar Mohd Nazir, Mohd Amril Nurman Misiran, Masnita Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224) |
description |
This study developed an algorithm for statistical classification that enable ones to classify a future data to one of predetermined groups based on the measured data which facing two major threats; (i) multicollinearity among the measured variables and (ii) imbalanced groups. The developed algorithm weighted the n objects contribution in explaining the separation between groups. Then, the weights are used together with either Principal Component Analysis (PCA) or Partial Least Square (PLS) to tackle
the collinearity among variables. Next, the weighted and transformed features were used to train Linear Discriminant Function (LDA) and to evaluate the constructed rule. The designed algorithm was structured in k-fold cross-validation in attempt to minimise the biasness of the classification performance, measured using error rate. Both simulation on bivariate and multivariate cases show some promising results that the weighted PCA on LDA and the weighted PLS on LDA are better than the traditional LDA, kernel discriminant, and PCA+LDA methods. Whilst, critical investigation on the minority group using sensitivity value has given some evidence how the two proposed methods are competitive, but they
are similar if the groups are well separated. Evidence obtained from the real data sets also providing similar results to the simulated ones. Hence, both weighted PCA on LDA and the weighted PLS on LDA can be recommended to discriminate imbalanced groups with correlated variables |
format |
Monograph |
author |
Mahat, Nor Idayu Engku Abu Bakar, Engku Muhammad Nazri Zakaria, Ammar Mohd Nazir, Mohd Amril Nurman Misiran, Masnita |
author_facet |
Mahat, Nor Idayu Engku Abu Bakar, Engku Muhammad Nazri Zakaria, Ammar Mohd Nazir, Mohd Amril Nurman Misiran, Masnita |
author_sort |
Mahat, Nor Idayu |
title |
Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224) |
title_short |
Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224) |
title_full |
Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224) |
title_fullStr |
Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224) |
title_full_unstemmed |
Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224) |
title_sort |
optimal weighted learning of pca and pls for multicollinearity discriminators and imbalanced groups in big data (s/o: 13224) |
publisher |
UUM |
url |
https://repo.uum.edu.my/id/eprint/31770/1/13224.pdf https://repo.uum.edu.my/id/eprint/31770/ |
_version_ |
1818837615516844032 |
score |
13.223943 |