Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure

In recent years, uncertain data clustering has become the subject of active research in many fields, for example, pattern recognition, and machine learning. Nowadays, researchers have committed themselves to substitute the traditional distance or similarity measures with new metrics in the existing...

Full description

Saved in:
Bibliographic Details
Main Authors: Sharma, K. K., Seal, A., Yazidi, A., Selamat, A., Krejcar, O.
Format: Article
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2021
Subjects:
Online Access:http://eprints.utm.my/id/eprint/95349/1/AnisYazidi2021_ClusteringUncertainDataObjects.pdf
http://eprints.utm.my/id/eprint/95349/
http://dx.doi.org/10.1109/ACCESS.2021.3083969
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.95349
record_format eprints
spelling my.utm.953492022-04-29T22:32:20Z http://eprints.utm.my/id/eprint/95349/ Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure Sharma, K. K. Seal, A. Yazidi, A. Selamat, A. Krejcar, O. T Technology (General) In recent years, uncertain data clustering has become the subject of active research in many fields, for example, pattern recognition, and machine learning. Nowadays, researchers have committed themselves to substitute the traditional distance or similarity measures with new metrics in the existing centralized clustering algorithms in order to tackle uncertainty in data. However, in order to perform uncertain data clustering, representation plays an imperative role. In this paper, a Monte-Carlo integration is adopted and modified to express uncertain data in a probabilistic form. Then three similarity measures are used to determine the closeness between two probability distributions including one novel measure. These similarity measures are derived from the notion of Kullback-Leibler divergence and Jeffreys divergence. Finally, density-based spatial clustering of applications with noise and $k$ -medoids algorithms are modified and implemented on one synthetic database and three real-world uncertain databases. The obtained outcomes confirm that the proposed clustering technique defeats some of the existing algorithms. Institute of Electrical and Electronics Engineers Inc. 2021 Article PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/95349/1/AnisYazidi2021_ClusteringUncertainDataObjects.pdf Sharma, K. K. and Seal, A. and Yazidi, A. and Selamat, A. and Krejcar, O. (2021) Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure. IEEE Access, 9 . ISSN 2169-3536 http://dx.doi.org/10.1109/ACCESS.2021.3083969 DOI: 10.1109/ACCESS.2021.3083969
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic T Technology (General)
spellingShingle T Technology (General)
Sharma, K. K.
Seal, A.
Yazidi, A.
Selamat, A.
Krejcar, O.
Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure
description In recent years, uncertain data clustering has become the subject of active research in many fields, for example, pattern recognition, and machine learning. Nowadays, researchers have committed themselves to substitute the traditional distance or similarity measures with new metrics in the existing centralized clustering algorithms in order to tackle uncertainty in data. However, in order to perform uncertain data clustering, representation plays an imperative role. In this paper, a Monte-Carlo integration is adopted and modified to express uncertain data in a probabilistic form. Then three similarity measures are used to determine the closeness between two probability distributions including one novel measure. These similarity measures are derived from the notion of Kullback-Leibler divergence and Jeffreys divergence. Finally, density-based spatial clustering of applications with noise and $k$ -medoids algorithms are modified and implemented on one synthetic database and three real-world uncertain databases. The obtained outcomes confirm that the proposed clustering technique defeats some of the existing algorithms.
format Article
author Sharma, K. K.
Seal, A.
Yazidi, A.
Selamat, A.
Krejcar, O.
author_facet Sharma, K. K.
Seal, A.
Yazidi, A.
Selamat, A.
Krejcar, O.
author_sort Sharma, K. K.
title Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure
title_short Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure
title_full Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure
title_fullStr Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure
title_full_unstemmed Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure
title_sort clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure
publisher Institute of Electrical and Electronics Engineers Inc.
publishDate 2021
url http://eprints.utm.my/id/eprint/95349/1/AnisYazidi2021_ClusteringUncertainDataObjects.pdf
http://eprints.utm.my/id/eprint/95349/
http://dx.doi.org/10.1109/ACCESS.2021.3083969
_version_ 1732945462223175680
score 13.211869