Discrimination and source correspondence of black gel inks using Raman spectroscopy and chemometric analysis with UMAP and PLS-DA
In the examination of forged documents, ink analysis plays an important role and the forensic scientist is required to opine on the origin of ink and colorants when the physical appearance is similar. Also required is to link the ink with its source as this has an important bearing in solving cases...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | Article |
Published: |
Elsevier B.V.
2022
|
Subjects: | |
Online Access: | http://eprints.utm.my/103120/ http://dx.doi.org/10.1016/j.chemolab.2022.104557 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.utm.103120 |
---|---|
record_format |
eprints |
spelling |
my.utm.1031202023-11-13T05:03:53Z http://eprints.utm.my/103120/ Discrimination and source correspondence of black gel inks using Raman spectroscopy and chemometric analysis with UMAP and PLS-DA Mohamad Asri, Muhammad Naeim Verma, Rajesh Mahat, Naji Arafat Mohd. Nor, Nor Azman Mat Desa, Wan Nur Syuhaila Ismail, Dzulkiflee QD Chemistry In the examination of forged documents, ink analysis plays an important role and the forensic scientist is required to opine on the origin of ink and colorants when the physical appearance is similar. Also required is to link the ink with its source as this has an important bearing in solving cases involving documents. In this study, we have tried to explore a relatively new type of writing instrument, the gel ink pen, which is commonly used by perpetrators of fraud. The favorable approach in ink analysis is the non-destructive technique such as Raman spectroscopy combined with chemometrics for objective and automated examinations. Most of the studies so far used unsupervised chemometrics for data exploration like PCA and HCA without any use of supervised methods for classification and source prediction of inks. In recent years, more complex unsupervised algorithms such as t-Distributed Stochastic Neighbor Embedding (t-SNE) and Uniform Manifold Approximation and Projection (UMAP) have emerged, which are frequently employed in big data scenarios. These strategies are also appropriate for the type of data we used in this study. Partial Least Square-Discriminant Analysis (PLS-DA) is a supervised classification technique commonly used to classify samples into known groups and predict the class of unknown samples. The performance of PLS-DA has been reported to be near perfect for a dataset of few classes, however, its applicability for large datasets remains to be explored. In this study, we report the application of PLS-DA for the classification of black gel ink samples (n ?= ?140) from 14 different brands. To demonstrate the applicability of PLS-DA in a forensic investigation involving unknown ink deposited on documents, we have tested 11 unknown samples to the trained PLS-DA model, and have achieved 91% correct classification rate. We also demonstrated misclassification due to large datasets can be mitigated by UMAP exploration and then applying PLS-DA to a reduced number of classes datasets. The procedure of using UMAP and PLS-DA may prove useful for unveiling the identity of black gel ink deposited on forged documents. Elsevier B.V. 2022 Article PeerReviewed Mohamad Asri, Muhammad Naeim and Verma, Rajesh and Mahat, Naji Arafat and Mohd. Nor, Nor Azman and Mat Desa, Wan Nur Syuhaila and Ismail, Dzulkiflee (2022) Discrimination and source correspondence of black gel inks using Raman spectroscopy and chemometric analysis with UMAP and PLS-DA. Chemometrics and Intelligent Laboratory Systems, 225 (NA). pp. 1-9. ISSN 0169-7439 http://dx.doi.org/10.1016/j.chemolab.2022.104557 DOI : 10.1016/j.chemolab.2022.104557 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
topic |
QD Chemistry |
spellingShingle |
QD Chemistry Mohamad Asri, Muhammad Naeim Verma, Rajesh Mahat, Naji Arafat Mohd. Nor, Nor Azman Mat Desa, Wan Nur Syuhaila Ismail, Dzulkiflee Discrimination and source correspondence of black gel inks using Raman spectroscopy and chemometric analysis with UMAP and PLS-DA |
description |
In the examination of forged documents, ink analysis plays an important role and the forensic scientist is required to opine on the origin of ink and colorants when the physical appearance is similar. Also required is to link the ink with its source as this has an important bearing in solving cases involving documents. In this study, we have tried to explore a relatively new type of writing instrument, the gel ink pen, which is commonly used by perpetrators of fraud. The favorable approach in ink analysis is the non-destructive technique such as Raman spectroscopy combined with chemometrics for objective and automated examinations. Most of the studies so far used unsupervised chemometrics for data exploration like PCA and HCA without any use of supervised methods for classification and source prediction of inks. In recent years, more complex unsupervised algorithms such as t-Distributed Stochastic Neighbor Embedding (t-SNE) and Uniform Manifold Approximation and Projection (UMAP) have emerged, which are frequently employed in big data scenarios. These strategies are also appropriate for the type of data we used in this study. Partial Least Square-Discriminant Analysis (PLS-DA) is a supervised classification technique commonly used to classify samples into known groups and predict the class of unknown samples. The performance of PLS-DA has been reported to be near perfect for a dataset of few classes, however, its applicability for large datasets remains to be explored. In this study, we report the application of PLS-DA for the classification of black gel ink samples (n ?= ?140) from 14 different brands. To demonstrate the applicability of PLS-DA in a forensic investigation involving unknown ink deposited on documents, we have tested 11 unknown samples to the trained PLS-DA model, and have achieved 91% correct classification rate. We also demonstrated misclassification due to large datasets can be mitigated by UMAP exploration and then applying PLS-DA to a reduced number of classes datasets. The procedure of using UMAP and PLS-DA may prove useful for unveiling the identity of black gel ink deposited on forged documents. |
format |
Article |
author |
Mohamad Asri, Muhammad Naeim Verma, Rajesh Mahat, Naji Arafat Mohd. Nor, Nor Azman Mat Desa, Wan Nur Syuhaila Ismail, Dzulkiflee |
author_facet |
Mohamad Asri, Muhammad Naeim Verma, Rajesh Mahat, Naji Arafat Mohd. Nor, Nor Azman Mat Desa, Wan Nur Syuhaila Ismail, Dzulkiflee |
author_sort |
Mohamad Asri, Muhammad Naeim |
title |
Discrimination and source correspondence of black gel inks using Raman spectroscopy and chemometric analysis with UMAP and PLS-DA |
title_short |
Discrimination and source correspondence of black gel inks using Raman spectroscopy and chemometric analysis with UMAP and PLS-DA |
title_full |
Discrimination and source correspondence of black gel inks using Raman spectroscopy and chemometric analysis with UMAP and PLS-DA |
title_fullStr |
Discrimination and source correspondence of black gel inks using Raman spectroscopy and chemometric analysis with UMAP and PLS-DA |
title_full_unstemmed |
Discrimination and source correspondence of black gel inks using Raman spectroscopy and chemometric analysis with UMAP and PLS-DA |
title_sort |
discrimination and source correspondence of black gel inks using raman spectroscopy and chemometric analysis with umap and pls-da |
publisher |
Elsevier B.V. |
publishDate |
2022 |
url |
http://eprints.utm.my/103120/ http://dx.doi.org/10.1016/j.chemolab.2022.104557 |
_version_ |
1783876350649565184 |
score |
13.188404 |