MuDi-Stream: A multi density clustering algorithm for evolving data stream

Density-based method has emerged as a worthwhile class for clustering data streams. Recently, a number of density-based algorithms have been developed for clustering data streams. However, existing density-based data stream clustering algorithms are not without problem. There is a dramatic decrease...

Full description

Saved in:
Bibliographic Details
Main Authors: Amini, A., Saboohi, H., Herawan, T., Teh, Y.W.
Format: Article
Published: Elsevier 2016
Subjects:
Online Access:http://eprints.um.edu.my/18278/
https://doi.org/10.1016/j.jnca.2014.11.007
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.um.eprints.18278
record_format eprints
spelling my.um.eprints.182782017-11-16T02:46:23Z http://eprints.um.edu.my/18278/ MuDi-Stream: A multi density clustering algorithm for evolving data stream Amini, A. Saboohi, H. Herawan, T. Teh, Y.W. QA75 Electronic computers. Computer science Density-based method has emerged as a worthwhile class for clustering data streams. Recently, a number of density-based algorithms have been developed for clustering data streams. However, existing density-based data stream clustering algorithms are not without problem. There is a dramatic decrease in the quality of clustering when there is a range in density of data. In this paper, a new method, called the MuDi-Stream, is developed. It is an online-offline algorithm with four main components. In the online phase, it keeps summary information about evolving multi-density data stream in the form of core mini-clusters. The offline phase generates the final clusters using an adapted density-based clustering algorithm. The grid-based method is used as an outlier buffer to handle both noises and multi-density data and yet is used to reduce the merging time of clustering. The algorithm is evaluated on various synthetic and real-world datasets using different quality metrics and further, scalability results are compared. The experimental results show that the proposed method in this study improves clustering quality in multi-density environments. Elsevier 2016 Article PeerReviewed Amini, A. and Saboohi, H. and Herawan, T. and Teh, Y.W. (2016) MuDi-Stream: A multi density clustering algorithm for evolving data stream. Journal of Network and Computer Applications, 59. pp. 370-385. ISSN 1084-8045 https://doi.org/10.1016/j.jnca.2014.11.007 doi:10.1016/j.jnca.2014.11.007
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Amini, A.
Saboohi, H.
Herawan, T.
Teh, Y.W.
MuDi-Stream: A multi density clustering algorithm for evolving data stream
description Density-based method has emerged as a worthwhile class for clustering data streams. Recently, a number of density-based algorithms have been developed for clustering data streams. However, existing density-based data stream clustering algorithms are not without problem. There is a dramatic decrease in the quality of clustering when there is a range in density of data. In this paper, a new method, called the MuDi-Stream, is developed. It is an online-offline algorithm with four main components. In the online phase, it keeps summary information about evolving multi-density data stream in the form of core mini-clusters. The offline phase generates the final clusters using an adapted density-based clustering algorithm. The grid-based method is used as an outlier buffer to handle both noises and multi-density data and yet is used to reduce the merging time of clustering. The algorithm is evaluated on various synthetic and real-world datasets using different quality metrics and further, scalability results are compared. The experimental results show that the proposed method in this study improves clustering quality in multi-density environments.
format Article
author Amini, A.
Saboohi, H.
Herawan, T.
Teh, Y.W.
author_facet Amini, A.
Saboohi, H.
Herawan, T.
Teh, Y.W.
author_sort Amini, A.
title MuDi-Stream: A multi density clustering algorithm for evolving data stream
title_short MuDi-Stream: A multi density clustering algorithm for evolving data stream
title_full MuDi-Stream: A multi density clustering algorithm for evolving data stream
title_fullStr MuDi-Stream: A multi density clustering algorithm for evolving data stream
title_full_unstemmed MuDi-Stream: A multi density clustering algorithm for evolving data stream
title_sort mudi-stream: a multi density clustering algorithm for evolving data stream
publisher Elsevier
publishDate 2016
url http://eprints.um.edu.my/18278/
https://doi.org/10.1016/j.jnca.2014.11.007
_version_ 1643690660980588544
score 13.160551