Towards lowering computational power in IoT systems: Clustering algorithm for high-dimensional data stream using entropy window reduction
In a world of connectivity empowered by the advancement of the Internet of Things (IoT), an infinite number of data streams have emerged. Thus, data stream clustering is crucial for extracting hidden knowledge and data mining. Various data stream clustering methods have lately been introduced. Yet,...
Saved in:
Main Authors: | , , , , , |
---|---|
Other Authors: | |
Format: | Article |
Published: |
Elsevier B.V.
2024
|
Subjects: | |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.uniten.dspace-34215 |
---|---|
record_format |
dspace |
spelling |
my.uniten.dspace-342152024-10-14T11:18:28Z Towards lowering computational power in IoT systems: Clustering algorithm for high-dimensional data stream using entropy window reduction Alkawsi G. Al-amri R. Baashar Y. Ghorashi S. Alabdulkreem E. Kiong Tiong S. 57191982354 57224896623 56768090200 57219241229 55320872600 57219799117 Anomaly detection Clustering Computational power Data stream Entropy High-dimensionality IoT Anomaly detection Cluster analysis Clustering algorithms Data mining Internet of things Petroleum reservoir evaluation % reductions Anomaly detection Clustering methods Clusterings Computational power Data stream Data stream clustering High dimensionality High-dimensional data streams Reduction algorithms Entropy In a world of connectivity empowered by the advancement of the Internet of Things (IoT), an infinite number of data streams have emerged. Thus, data stream clustering is crucial for extracting hidden knowledge and data mining. Various data stream clustering methods have lately been introduced. Yet, the majority of such algorithms are affected by the curse of high dimensionality. Lately, a fully online buffer-based clustering algorithm for handling evolving data streams (BOCEDS) was developed. Similarly to other existing density-based clustering methods, BOCEDS is not capable of handling high-dimensional data and has high computational power and high memory utilization. This paper introduces an Entropy Window Reduction (EWR) algorithm, which is an improved version of the BOCEDS technique. EWR is a fully online clustering technique for handling high-dimensional data streams using feature ranking and sorting. This process is accomplished by calculating the entropy of specific features with respect to the time window. The findings of the experiments are compared to the outcomes of BOCEDS, CEDAS, and MuDi-Stream algorithms. The outcomes indicate that the EWR algorithm outperformed the baseline clustering algorithms. The results are demonstrated using the KDDCup�99 dataset in terms of quality and complexity evaluation on the average of F-Measures, Jaccard Index, Fowlkes�Mallows index, Purity, and Rand Index as well as the memory usage and computational power with 88%, 66%, 81%, 100%, and 66%, respectively. The results also show low memory usage and computing power in comparison with the baseline algorithms. � 2023 THE AUTHORS Final 2024-10-14T03:18:28Z 2024-10-14T03:18:28Z 2023 Article 10.1016/j.aej.2023.03.008 2-s2.0-85149833939 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85149833939&doi=10.1016%2fj.aej.2023.03.008&partnerID=40&md5=5c75f128e66ab093f54c6a54183098d4 https://irepository.uniten.edu.my/handle/123456789/34215 70 503 513 All Open Access Gold Open Access Elsevier B.V. Scopus |
institution |
Universiti Tenaga Nasional |
building |
UNITEN Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Tenaga Nasional |
content_source |
UNITEN Institutional Repository |
url_provider |
http://dspace.uniten.edu.my/ |
topic |
Anomaly detection Clustering Computational power Data stream Entropy High-dimensionality IoT Anomaly detection Cluster analysis Clustering algorithms Data mining Internet of things Petroleum reservoir evaluation % reductions Anomaly detection Clustering methods Clusterings Computational power Data stream Data stream clustering High dimensionality High-dimensional data streams Reduction algorithms Entropy |
spellingShingle |
Anomaly detection Clustering Computational power Data stream Entropy High-dimensionality IoT Anomaly detection Cluster analysis Clustering algorithms Data mining Internet of things Petroleum reservoir evaluation % reductions Anomaly detection Clustering methods Clusterings Computational power Data stream Data stream clustering High dimensionality High-dimensional data streams Reduction algorithms Entropy Alkawsi G. Al-amri R. Baashar Y. Ghorashi S. Alabdulkreem E. Kiong Tiong S. Towards lowering computational power in IoT systems: Clustering algorithm for high-dimensional data stream using entropy window reduction |
description |
In a world of connectivity empowered by the advancement of the Internet of Things (IoT), an infinite number of data streams have emerged. Thus, data stream clustering is crucial for extracting hidden knowledge and data mining. Various data stream clustering methods have lately been introduced. Yet, the majority of such algorithms are affected by the curse of high dimensionality. Lately, a fully online buffer-based clustering algorithm for handling evolving data streams (BOCEDS) was developed. Similarly to other existing density-based clustering methods, BOCEDS is not capable of handling high-dimensional data and has high computational power and high memory utilization. This paper introduces an Entropy Window Reduction (EWR) algorithm, which is an improved version of the BOCEDS technique. EWR is a fully online clustering technique for handling high-dimensional data streams using feature ranking and sorting. This process is accomplished by calculating the entropy of specific features with respect to the time window. The findings of the experiments are compared to the outcomes of BOCEDS, CEDAS, and MuDi-Stream algorithms. The outcomes indicate that the EWR algorithm outperformed the baseline clustering algorithms. The results are demonstrated using the KDDCup�99 dataset in terms of quality and complexity evaluation on the average of F-Measures, Jaccard Index, Fowlkes�Mallows index, Purity, and Rand Index as well as the memory usage and computational power with 88%, 66%, 81%, 100%, and 66%, respectively. The results also show low memory usage and computing power in comparison with the baseline algorithms. � 2023 THE AUTHORS |
author2 |
57191982354 |
author_facet |
57191982354 Alkawsi G. Al-amri R. Baashar Y. Ghorashi S. Alabdulkreem E. Kiong Tiong S. |
format |
Article |
author |
Alkawsi G. Al-amri R. Baashar Y. Ghorashi S. Alabdulkreem E. Kiong Tiong S. |
author_sort |
Alkawsi G. |
title |
Towards lowering computational power in IoT systems: Clustering algorithm for high-dimensional data stream using entropy window reduction |
title_short |
Towards lowering computational power in IoT systems: Clustering algorithm for high-dimensional data stream using entropy window reduction |
title_full |
Towards lowering computational power in IoT systems: Clustering algorithm for high-dimensional data stream using entropy window reduction |
title_fullStr |
Towards lowering computational power in IoT systems: Clustering algorithm for high-dimensional data stream using entropy window reduction |
title_full_unstemmed |
Towards lowering computational power in IoT systems: Clustering algorithm for high-dimensional data stream using entropy window reduction |
title_sort |
towards lowering computational power in iot systems: clustering algorithm for high-dimensional data stream using entropy window reduction |
publisher |
Elsevier B.V. |
publishDate |
2024 |
_version_ |
1814061046336323584 |
score |
13.214268 |