MuDi-Stream: A multi density clustering algorithm for evolving data stream
Density-based method has emerged as a worthwhile class for clustering data streams. Recently, a number of density-based algorithms have been developed for clustering data streams. However, existing density-based data stream clustering algorithms are not without problem. There is a dramatic decrease...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Published: |
Elsevier
2016
|
Subjects: | |
Online Access: | http://eprints.um.edu.my/18278/ https://doi.org/10.1016/j.jnca.2014.11.007 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.um.eprints.18278 |
---|---|
record_format |
eprints |
spelling |
my.um.eprints.182782017-11-16T02:46:23Z http://eprints.um.edu.my/18278/ MuDi-Stream: A multi density clustering algorithm for evolving data stream Amini, A. Saboohi, H. Herawan, T. Teh, Y.W. QA75 Electronic computers. Computer science Density-based method has emerged as a worthwhile class for clustering data streams. Recently, a number of density-based algorithms have been developed for clustering data streams. However, existing density-based data stream clustering algorithms are not without problem. There is a dramatic decrease in the quality of clustering when there is a range in density of data. In this paper, a new method, called the MuDi-Stream, is developed. It is an online-offline algorithm with four main components. In the online phase, it keeps summary information about evolving multi-density data stream in the form of core mini-clusters. The offline phase generates the final clusters using an adapted density-based clustering algorithm. The grid-based method is used as an outlier buffer to handle both noises and multi-density data and yet is used to reduce the merging time of clustering. The algorithm is evaluated on various synthetic and real-world datasets using different quality metrics and further, scalability results are compared. The experimental results show that the proposed method in this study improves clustering quality in multi-density environments. Elsevier 2016 Article PeerReviewed Amini, A. and Saboohi, H. and Herawan, T. and Teh, Y.W. (2016) MuDi-Stream: A multi density clustering algorithm for evolving data stream. Journal of Network and Computer Applications, 59. pp. 370-385. ISSN 1084-8045 https://doi.org/10.1016/j.jnca.2014.11.007 doi:10.1016/j.jnca.2014.11.007 |
institution |
Universiti Malaya |
building |
UM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaya |
content_source |
UM Research Repository |
url_provider |
http://eprints.um.edu.my/ |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Amini, A. Saboohi, H. Herawan, T. Teh, Y.W. MuDi-Stream: A multi density clustering algorithm for evolving data stream |
description |
Density-based method has emerged as a worthwhile class for clustering data streams. Recently, a number of density-based algorithms have been developed for clustering data streams. However, existing density-based data stream clustering algorithms are not without problem. There is a dramatic decrease in the quality of clustering when there is a range in density of data. In this paper, a new method, called the MuDi-Stream, is developed. It is an online-offline algorithm with four main components. In the online phase, it keeps summary information about evolving multi-density data stream in the form of core mini-clusters. The offline phase generates the final clusters using an adapted density-based clustering algorithm. The grid-based method is used as an outlier buffer to handle both noises and multi-density data and yet is used to reduce the merging time of clustering. The algorithm is evaluated on various synthetic and real-world datasets using different quality metrics and further, scalability results are compared. The experimental results show that the proposed method in this study improves clustering quality in multi-density environments. |
format |
Article |
author |
Amini, A. Saboohi, H. Herawan, T. Teh, Y.W. |
author_facet |
Amini, A. Saboohi, H. Herawan, T. Teh, Y.W. |
author_sort |
Amini, A. |
title |
MuDi-Stream: A multi density clustering algorithm for evolving data stream |
title_short |
MuDi-Stream: A multi density clustering algorithm for evolving data stream |
title_full |
MuDi-Stream: A multi density clustering algorithm for evolving data stream |
title_fullStr |
MuDi-Stream: A multi density clustering algorithm for evolving data stream |
title_full_unstemmed |
MuDi-Stream: A multi density clustering algorithm for evolving data stream |
title_sort |
mudi-stream: a multi density clustering algorithm for evolving data stream |
publisher |
Elsevier |
publishDate |
2016 |
url |
http://eprints.um.edu.my/18278/ https://doi.org/10.1016/j.jnca.2014.11.007 |
_version_ |
1643690660980588544 |
score |
13.209306 |