Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure
In recent years, uncertain data clustering has become the subject of active research in many fields, for example, pattern recognition, and machine learning. Nowadays, researchers have committed themselves to substitute the traditional distance or similarity measures with new metrics in the existing...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Institute of Electrical and Electronics Engineers Inc.
2021
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/95349/1/AnisYazidi2021_ClusteringUncertainDataObjects.pdf http://eprints.utm.my/id/eprint/95349/ http://dx.doi.org/10.1109/ACCESS.2021.3083969 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.utm.95349 |
---|---|
record_format |
eprints |
spelling |
my.utm.953492022-04-29T22:32:20Z http://eprints.utm.my/id/eprint/95349/ Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure Sharma, K. K. Seal, A. Yazidi, A. Selamat, A. Krejcar, O. T Technology (General) In recent years, uncertain data clustering has become the subject of active research in many fields, for example, pattern recognition, and machine learning. Nowadays, researchers have committed themselves to substitute the traditional distance or similarity measures with new metrics in the existing centralized clustering algorithms in order to tackle uncertainty in data. However, in order to perform uncertain data clustering, representation plays an imperative role. In this paper, a Monte-Carlo integration is adopted and modified to express uncertain data in a probabilistic form. Then three similarity measures are used to determine the closeness between two probability distributions including one novel measure. These similarity measures are derived from the notion of Kullback-Leibler divergence and Jeffreys divergence. Finally, density-based spatial clustering of applications with noise and $k$ -medoids algorithms are modified and implemented on one synthetic database and three real-world uncertain databases. The obtained outcomes confirm that the proposed clustering technique defeats some of the existing algorithms. Institute of Electrical and Electronics Engineers Inc. 2021 Article PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/95349/1/AnisYazidi2021_ClusteringUncertainDataObjects.pdf Sharma, K. K. and Seal, A. and Yazidi, A. and Selamat, A. and Krejcar, O. (2021) Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure. IEEE Access, 9 . ISSN 2169-3536 http://dx.doi.org/10.1109/ACCESS.2021.3083969 DOI: 10.1109/ACCESS.2021.3083969 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
language |
English |
topic |
T Technology (General) |
spellingShingle |
T Technology (General) Sharma, K. K. Seal, A. Yazidi, A. Selamat, A. Krejcar, O. Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure |
description |
In recent years, uncertain data clustering has become the subject of active research in many fields, for example, pattern recognition, and machine learning. Nowadays, researchers have committed themselves to substitute the traditional distance or similarity measures with new metrics in the existing centralized clustering algorithms in order to tackle uncertainty in data. However, in order to perform uncertain data clustering, representation plays an imperative role. In this paper, a Monte-Carlo integration is adopted and modified to express uncertain data in a probabilistic form. Then three similarity measures are used to determine the closeness between two probability distributions including one novel measure. These similarity measures are derived from the notion of Kullback-Leibler divergence and Jeffreys divergence. Finally, density-based spatial clustering of applications with noise and $k$ -medoids algorithms are modified and implemented on one synthetic database and three real-world uncertain databases. The obtained outcomes confirm that the proposed clustering technique defeats some of the existing algorithms. |
format |
Article |
author |
Sharma, K. K. Seal, A. Yazidi, A. Selamat, A. Krejcar, O. |
author_facet |
Sharma, K. K. Seal, A. Yazidi, A. Selamat, A. Krejcar, O. |
author_sort |
Sharma, K. K. |
title |
Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure |
title_short |
Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure |
title_full |
Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure |
title_fullStr |
Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure |
title_full_unstemmed |
Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure |
title_sort |
clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure |
publisher |
Institute of Electrical and Electronics Engineers Inc. |
publishDate |
2021 |
url |
http://eprints.utm.my/id/eprint/95349/1/AnisYazidi2021_ClusteringUncertainDataObjects.pdf http://eprints.utm.my/id/eprint/95349/ http://dx.doi.org/10.1109/ACCESS.2021.3083969 |
_version_ |
1732945462223175680 |
score |
13.211869 |