Discrimination and source correspondence of black gel inks using Raman spectroscopy and chemometric analysis with UMAP and PLS-DA

In the examination of forged documents, ink analysis plays an important role and the forensic scientist is required to opine on the origin of ink and colorants when the physical appearance is similar. Also required is to link the ink with its source as this has an important bearing in solving cases...

Full description

Saved in:
Bibliographic Details
Main Authors: Mohamad Asri, Muhammad Naeim, Verma, Rajesh, Mahat, Naji Arafat, Mohd. Nor, Nor Azman, Mat Desa, Wan Nur Syuhaila, Ismail, Dzulkiflee
Format: Article
Published: Elsevier B.V. 2022
Subjects:
Online Access:http://eprints.utm.my/103120/
http://dx.doi.org/10.1016/j.chemolab.2022.104557
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.103120
record_format eprints
spelling my.utm.1031202023-11-13T05:03:53Z http://eprints.utm.my/103120/ Discrimination and source correspondence of black gel inks using Raman spectroscopy and chemometric analysis with UMAP and PLS-DA Mohamad Asri, Muhammad Naeim Verma, Rajesh Mahat, Naji Arafat Mohd. Nor, Nor Azman Mat Desa, Wan Nur Syuhaila Ismail, Dzulkiflee QD Chemistry In the examination of forged documents, ink analysis plays an important role and the forensic scientist is required to opine on the origin of ink and colorants when the physical appearance is similar. Also required is to link the ink with its source as this has an important bearing in solving cases involving documents. In this study, we have tried to explore a relatively new type of writing instrument, the gel ink pen, which is commonly used by perpetrators of fraud. The favorable approach in ink analysis is the non-destructive technique such as Raman spectroscopy combined with chemometrics for objective and automated examinations. Most of the studies so far used unsupervised chemometrics for data exploration like PCA and HCA without any use of supervised methods for classification and source prediction of inks. In recent years, more complex unsupervised algorithms such as t-Distributed Stochastic Neighbor Embedding (t-SNE) and Uniform Manifold Approximation and Projection (UMAP) have emerged, which are frequently employed in big data scenarios. These strategies are also appropriate for the type of data we used in this study. Partial Least Square-Discriminant Analysis (PLS-DA) is a supervised classification technique commonly used to classify samples into known groups and predict the class of unknown samples. The performance of PLS-DA has been reported to be near perfect for a dataset of few classes, however, its applicability for large datasets remains to be explored. In this study, we report the application of PLS-DA for the classification of black gel ink samples (n ?= ?140) from 14 different brands. To demonstrate the applicability of PLS-DA in a forensic investigation involving unknown ink deposited on documents, we have tested 11 unknown samples to the trained PLS-DA model, and have achieved 91% correct classification rate. We also demonstrated misclassification due to large datasets can be mitigated by UMAP exploration and then applying PLS-DA to a reduced number of classes datasets. The procedure of using UMAP and PLS-DA may prove useful for unveiling the identity of black gel ink deposited on forged documents. Elsevier B.V. 2022 Article PeerReviewed Mohamad Asri, Muhammad Naeim and Verma, Rajesh and Mahat, Naji Arafat and Mohd. Nor, Nor Azman and Mat Desa, Wan Nur Syuhaila and Ismail, Dzulkiflee (2022) Discrimination and source correspondence of black gel inks using Raman spectroscopy and chemometric analysis with UMAP and PLS-DA. Chemometrics and Intelligent Laboratory Systems, 225 (NA). pp. 1-9. ISSN 0169-7439 http://dx.doi.org/10.1016/j.chemolab.2022.104557 DOI : 10.1016/j.chemolab.2022.104557
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic QD Chemistry
spellingShingle QD Chemistry
Mohamad Asri, Muhammad Naeim
Verma, Rajesh
Mahat, Naji Arafat
Mohd. Nor, Nor Azman
Mat Desa, Wan Nur Syuhaila
Ismail, Dzulkiflee
Discrimination and source correspondence of black gel inks using Raman spectroscopy and chemometric analysis with UMAP and PLS-DA
description In the examination of forged documents, ink analysis plays an important role and the forensic scientist is required to opine on the origin of ink and colorants when the physical appearance is similar. Also required is to link the ink with its source as this has an important bearing in solving cases involving documents. In this study, we have tried to explore a relatively new type of writing instrument, the gel ink pen, which is commonly used by perpetrators of fraud. The favorable approach in ink analysis is the non-destructive technique such as Raman spectroscopy combined with chemometrics for objective and automated examinations. Most of the studies so far used unsupervised chemometrics for data exploration like PCA and HCA without any use of supervised methods for classification and source prediction of inks. In recent years, more complex unsupervised algorithms such as t-Distributed Stochastic Neighbor Embedding (t-SNE) and Uniform Manifold Approximation and Projection (UMAP) have emerged, which are frequently employed in big data scenarios. These strategies are also appropriate for the type of data we used in this study. Partial Least Square-Discriminant Analysis (PLS-DA) is a supervised classification technique commonly used to classify samples into known groups and predict the class of unknown samples. The performance of PLS-DA has been reported to be near perfect for a dataset of few classes, however, its applicability for large datasets remains to be explored. In this study, we report the application of PLS-DA for the classification of black gel ink samples (n ?= ?140) from 14 different brands. To demonstrate the applicability of PLS-DA in a forensic investigation involving unknown ink deposited on documents, we have tested 11 unknown samples to the trained PLS-DA model, and have achieved 91% correct classification rate. We also demonstrated misclassification due to large datasets can be mitigated by UMAP exploration and then applying PLS-DA to a reduced number of classes datasets. The procedure of using UMAP and PLS-DA may prove useful for unveiling the identity of black gel ink deposited on forged documents.
format Article
author Mohamad Asri, Muhammad Naeim
Verma, Rajesh
Mahat, Naji Arafat
Mohd. Nor, Nor Azman
Mat Desa, Wan Nur Syuhaila
Ismail, Dzulkiflee
author_facet Mohamad Asri, Muhammad Naeim
Verma, Rajesh
Mahat, Naji Arafat
Mohd. Nor, Nor Azman
Mat Desa, Wan Nur Syuhaila
Ismail, Dzulkiflee
author_sort Mohamad Asri, Muhammad Naeim
title Discrimination and source correspondence of black gel inks using Raman spectroscopy and chemometric analysis with UMAP and PLS-DA
title_short Discrimination and source correspondence of black gel inks using Raman spectroscopy and chemometric analysis with UMAP and PLS-DA
title_full Discrimination and source correspondence of black gel inks using Raman spectroscopy and chemometric analysis with UMAP and PLS-DA
title_fullStr Discrimination and source correspondence of black gel inks using Raman spectroscopy and chemometric analysis with UMAP and PLS-DA
title_full_unstemmed Discrimination and source correspondence of black gel inks using Raman spectroscopy and chemometric analysis with UMAP and PLS-DA
title_sort discrimination and source correspondence of black gel inks using raman spectroscopy and chemometric analysis with umap and pls-da
publisher Elsevier B.V.
publishDate 2022
url http://eprints.utm.my/103120/
http://dx.doi.org/10.1016/j.chemolab.2022.104557
_version_ 1783876350649565184
score 13.188404