Predicting Customer Buying Decisions for Online Shopping with Unbalanced Data Set

One of the common e-commerce problems is the low purchase conversion rate. Data mining techniques can help tackle the problem by analysing and predicting the customer purchase intention to give better service and better recommendations to customers. In this project, the realtime online shoppers purc...

Full description

Saved in:
Bibliographic Details
Main Author: Yap, Chau Tean
Format: Final Year Project / Dissertation / Thesis
Published: 2022
Subjects:
Online Access:http://eprints.utar.edu.my/4990/1/YAP_CHAU_TEAN_2000681.pdf
http://eprints.utar.edu.my/4990/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-utar-eprints.4990
record_format eprints
spelling my-utar-eprints.49902022-12-26T09:40:53Z Predicting Customer Buying Decisions for Online Shopping with Unbalanced Data Set Yap, Chau Tean T Technology (General) One of the common e-commerce problems is the low purchase conversion rate. Data mining techniques can help tackle the problem by analysing and predicting the customer purchase intention to give better service and better recommendations to customers. In this project, the realtime online shoppers purchasing intention data set from Sakar et al. (2018) was used. The data set is unbalanced as it consists of 15.5% of the positive class and 84.5% of the negative class. Weka, a data mining tool, provides the facility to classify the data set with different machine learning algorithms. Six machine learning algorithms were applied and compared based on the classification evaluation methods. The algorithms involved were K-Nearest Neighbor (KNN), Naïve Bayers, J48, Support Vector Machine (SVM), Sequential Minimal Optimization (SMO) and Multilayer Perceptron (MLP). Data pre-processing on the data set may improve the classification results. The methods used were over-sampling, under-sampling and hybrid sampling, which modified the data set class distribution to achieve a better result. The hybrid sampling method gave comparable classification results compared with Sakar et al. (2018). Ensemble learning methods AdaBoost and Bagging were tested but showed no improvement on this online shoppers purchasing intention data set. 2022 Final Year Project / Dissertation / Thesis NonPeerReviewed application/pdf http://eprints.utar.edu.my/4990/1/YAP_CHAU_TEAN_2000681.pdf Yap, Chau Tean (2022) Predicting Customer Buying Decisions for Online Shopping with Unbalanced Data Set. Master dissertation/thesis, UTAR. http://eprints.utar.edu.my/4990/
institution Universiti Tunku Abdul Rahman
building UTAR Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Tunku Abdul Rahman
content_source UTAR Institutional Repository
url_provider http://eprints.utar.edu.my
topic T Technology (General)
spellingShingle T Technology (General)
Yap, Chau Tean
Predicting Customer Buying Decisions for Online Shopping with Unbalanced Data Set
description One of the common e-commerce problems is the low purchase conversion rate. Data mining techniques can help tackle the problem by analysing and predicting the customer purchase intention to give better service and better recommendations to customers. In this project, the realtime online shoppers purchasing intention data set from Sakar et al. (2018) was used. The data set is unbalanced as it consists of 15.5% of the positive class and 84.5% of the negative class. Weka, a data mining tool, provides the facility to classify the data set with different machine learning algorithms. Six machine learning algorithms were applied and compared based on the classification evaluation methods. The algorithms involved were K-Nearest Neighbor (KNN), Naïve Bayers, J48, Support Vector Machine (SVM), Sequential Minimal Optimization (SMO) and Multilayer Perceptron (MLP). Data pre-processing on the data set may improve the classification results. The methods used were over-sampling, under-sampling and hybrid sampling, which modified the data set class distribution to achieve a better result. The hybrid sampling method gave comparable classification results compared with Sakar et al. (2018). Ensemble learning methods AdaBoost and Bagging were tested but showed no improvement on this online shoppers purchasing intention data set.
format Final Year Project / Dissertation / Thesis
author Yap, Chau Tean
author_facet Yap, Chau Tean
author_sort Yap, Chau Tean
title Predicting Customer Buying Decisions for Online Shopping with Unbalanced Data Set
title_short Predicting Customer Buying Decisions for Online Shopping with Unbalanced Data Set
title_full Predicting Customer Buying Decisions for Online Shopping with Unbalanced Data Set
title_fullStr Predicting Customer Buying Decisions for Online Shopping with Unbalanced Data Set
title_full_unstemmed Predicting Customer Buying Decisions for Online Shopping with Unbalanced Data Set
title_sort predicting customer buying decisions for online shopping with unbalanced data set
publishDate 2022
url http://eprints.utar.edu.my/4990/1/YAP_CHAU_TEAN_2000681.pdf
http://eprints.utar.edu.my/4990/
_version_ 1753793016210915328
score 13.18916