Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224)

This study developed an algorithm for statistical classification that enable ones to classify a future data to one of predetermined groups based on the measured data which facing two major threats; (i) multicollinearity among the measured variables and (ii) imbalanced groups. The developed algorithm...

Full description

Saved in:
Bibliographic Details
Main Authors: Mahat, Nor Idayu, Engku Abu Bakar, Engku Muhammad Nazri, Zakaria, Ammar, Mohd Nazir, Mohd Amril Nurman, Misiran, Masnita
Format: Monograph
Language:English
Published: UUM
Subjects:
Online Access:https://repo.uum.edu.my/id/eprint/31770/1/13224.pdf
https://repo.uum.edu.my/id/eprint/31770/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.uum.repo.31770
record_format eprints
spelling my.uum.repo.317702024-12-12T10:47:04Z https://repo.uum.edu.my/id/eprint/31770/ Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224) Mahat, Nor Idayu Engku Abu Bakar, Engku Muhammad Nazri Zakaria, Ammar Mohd Nazir, Mohd Amril Nurman Misiran, Masnita QA Mathematics This study developed an algorithm for statistical classification that enable ones to classify a future data to one of predetermined groups based on the measured data which facing two major threats; (i) multicollinearity among the measured variables and (ii) imbalanced groups. The developed algorithm weighted the n objects contribution in explaining the separation between groups. Then, the weights are used together with either Principal Component Analysis (PCA) or Partial Least Square (PLS) to tackle the collinearity among variables. Next, the weighted and transformed features were used to train Linear Discriminant Function (LDA) and to evaluate the constructed rule. The designed algorithm was structured in k-fold cross-validation in attempt to minimise the biasness of the classification performance, measured using error rate. Both simulation on bivariate and multivariate cases show some promising results that the weighted PCA on LDA and the weighted PLS on LDA are better than the traditional LDA, kernel discriminant, and PCA+LDA methods. Whilst, critical investigation on the minority group using sensitivity value has given some evidence how the two proposed methods are competitive, but they are similar if the groups are well separated. Evidence obtained from the real data sets also providing similar results to the simulated ones. Hence, both weighted PCA on LDA and the weighted PLS on LDA can be recommended to discriminate imbalanced groups with correlated variables UUM Monograph NonPeerReviewed application/pdf en https://repo.uum.edu.my/id/eprint/31770/1/13224.pdf Mahat, Nor Idayu and Engku Abu Bakar, Engku Muhammad Nazri and Zakaria, Ammar and Mohd Nazir, Mohd Amril Nurman and Misiran, Masnita Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224). Project Report. UUM. (Submitted)
institution Universiti Utara Malaysia
building UUM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Utara Malaysia
content_source UUM Institutional Repository
url_provider http://repo.uum.edu.my/
language English
topic QA Mathematics
spellingShingle QA Mathematics
Mahat, Nor Idayu
Engku Abu Bakar, Engku Muhammad Nazri
Zakaria, Ammar
Mohd Nazir, Mohd Amril Nurman
Misiran, Masnita
Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224)
description This study developed an algorithm for statistical classification that enable ones to classify a future data to one of predetermined groups based on the measured data which facing two major threats; (i) multicollinearity among the measured variables and (ii) imbalanced groups. The developed algorithm weighted the n objects contribution in explaining the separation between groups. Then, the weights are used together with either Principal Component Analysis (PCA) or Partial Least Square (PLS) to tackle the collinearity among variables. Next, the weighted and transformed features were used to train Linear Discriminant Function (LDA) and to evaluate the constructed rule. The designed algorithm was structured in k-fold cross-validation in attempt to minimise the biasness of the classification performance, measured using error rate. Both simulation on bivariate and multivariate cases show some promising results that the weighted PCA on LDA and the weighted PLS on LDA are better than the traditional LDA, kernel discriminant, and PCA+LDA methods. Whilst, critical investigation on the minority group using sensitivity value has given some evidence how the two proposed methods are competitive, but they are similar if the groups are well separated. Evidence obtained from the real data sets also providing similar results to the simulated ones. Hence, both weighted PCA on LDA and the weighted PLS on LDA can be recommended to discriminate imbalanced groups with correlated variables
format Monograph
author Mahat, Nor Idayu
Engku Abu Bakar, Engku Muhammad Nazri
Zakaria, Ammar
Mohd Nazir, Mohd Amril Nurman
Misiran, Masnita
author_facet Mahat, Nor Idayu
Engku Abu Bakar, Engku Muhammad Nazri
Zakaria, Ammar
Mohd Nazir, Mohd Amril Nurman
Misiran, Masnita
author_sort Mahat, Nor Idayu
title Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224)
title_short Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224)
title_full Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224)
title_fullStr Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224)
title_full_unstemmed Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224)
title_sort optimal weighted learning of pca and pls for multicollinearity discriminators and imbalanced groups in big data (s/o: 13224)
publisher UUM
url https://repo.uum.edu.my/id/eprint/31770/1/13224.pdf
https://repo.uum.edu.my/id/eprint/31770/
_version_ 1818837615516844032
score 13.223943