PubMed text data mining automation for biological validation on lists of genes and pathways

A prognostic cancer marker is helpful in oncology to identify the abnormal cancer cells from the collected sample. This marker can be used as an indicator to determine a disease outcome, cancer treatment, and drug discovery. Identifying cancer markers is also beneficial to improve cancer patients�...

Full description

Saved in:
Bibliographic Details
Main Authors: Nies, Hui Wen, Zakaria, Zalmiyah, Chan, Weng Howe, Kamsani, Izyan Izzati, Hasan, Nor Shahida
Format: Article
Language:English
Published: Penerbit UTM Press 2022
Subjects:
Online Access:http://eprints.utm.my/108823/1/NiesHuiWen2022_PubMedTextDataMiningAutomation.pdf
http://eprints.utm.my/108823/
http://dx.doi.org/10.11113/ijic.v12n1.313
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.108823
record_format eprints
spelling my.utm.1088232024-12-09T07:46:51Z http://eprints.utm.my/108823/ PubMed text data mining automation for biological validation on lists of genes and pathways Nies, Hui Wen Zakaria, Zalmiyah Chan, Weng Howe Kamsani, Izyan Izzati Hasan, Nor Shahida QA75 Electronic computers. Computer science A prognostic cancer marker is helpful in oncology to identify the abnormal cancer cells from the collected sample. This marker can be used as an indicator to determine a disease outcome, cancer treatment, and drug discovery. Identifying cancer markers is also beneficial to improve cancer patients' survival rate in receiving the treatment decision-making. Cancer markers can be determined by manually testing every gene or pathway in the wet lab or using the text mining automation method. The use of text mining techniques effectively investigates hidden information and gathers new knowledge from many existing sources. Unfortunately, querying relevant text to excavate important information is a challenging task. PubMed text data mining is one of the applications that help explore potential cancer markers as the trend of scientific articles in PubMed is steadily increased. Besides, it can support biologists to concentrate on the identified small set of genes or pathways. PubMed identifiers (PMIDs) are then obtained as evidence to ascertain the relationship between diseases and genes (or pathways) used as biological validation. Thus, this technique can discover the biological relationship between disease and genes or pathways. The existing method is commonly manually curated for the biological validation of genes and pathways. Manual curation takes time in the process and may lead to inconsistency. This study aims to automate the process of biological validation of genes and pathways for PubMed text data mining. Therefore, the PubMed text data mining automation was invented to link to the websites for saving time instead of manually. A list of genes and pathways from breast cancer are used in this study. Using PubMed text data mining automation for biological context verification and validation, p53 signaling pathway and TP53 gene as prognostic cancer markers for breast cancer. Hence, the p53 signaling pathway and TP53 are associated with the development of tumour cells and DNA damage after irradiation in breast cancer. Penerbit UTM Press 2022 Article PeerReviewed application/pdf en http://eprints.utm.my/108823/1/NiesHuiWen2022_PubMedTextDataMiningAutomation.pdf Nies, Hui Wen and Zakaria, Zalmiyah and Chan, Weng Howe and Kamsani, Izyan Izzati and Hasan, Nor Shahida (2022) PubMed text data mining automation for biological validation on lists of genes and pathways. International Journal of Innovative Computing, 12 (1). pp. 59-64. ISSN 2180-4370 http://dx.doi.org/10.11113/ijic.v12n1.313 DOI : 10.11113/ijic.v12n1.313
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Nies, Hui Wen
Zakaria, Zalmiyah
Chan, Weng Howe
Kamsani, Izyan Izzati
Hasan, Nor Shahida
PubMed text data mining automation for biological validation on lists of genes and pathways
description A prognostic cancer marker is helpful in oncology to identify the abnormal cancer cells from the collected sample. This marker can be used as an indicator to determine a disease outcome, cancer treatment, and drug discovery. Identifying cancer markers is also beneficial to improve cancer patients' survival rate in receiving the treatment decision-making. Cancer markers can be determined by manually testing every gene or pathway in the wet lab or using the text mining automation method. The use of text mining techniques effectively investigates hidden information and gathers new knowledge from many existing sources. Unfortunately, querying relevant text to excavate important information is a challenging task. PubMed text data mining is one of the applications that help explore potential cancer markers as the trend of scientific articles in PubMed is steadily increased. Besides, it can support biologists to concentrate on the identified small set of genes or pathways. PubMed identifiers (PMIDs) are then obtained as evidence to ascertain the relationship between diseases and genes (or pathways) used as biological validation. Thus, this technique can discover the biological relationship between disease and genes or pathways. The existing method is commonly manually curated for the biological validation of genes and pathways. Manual curation takes time in the process and may lead to inconsistency. This study aims to automate the process of biological validation of genes and pathways for PubMed text data mining. Therefore, the PubMed text data mining automation was invented to link to the websites for saving time instead of manually. A list of genes and pathways from breast cancer are used in this study. Using PubMed text data mining automation for biological context verification and validation, p53 signaling pathway and TP53 gene as prognostic cancer markers for breast cancer. Hence, the p53 signaling pathway and TP53 are associated with the development of tumour cells and DNA damage after irradiation in breast cancer.
format Article
author Nies, Hui Wen
Zakaria, Zalmiyah
Chan, Weng Howe
Kamsani, Izyan Izzati
Hasan, Nor Shahida
author_facet Nies, Hui Wen
Zakaria, Zalmiyah
Chan, Weng Howe
Kamsani, Izyan Izzati
Hasan, Nor Shahida
author_sort Nies, Hui Wen
title PubMed text data mining automation for biological validation on lists of genes and pathways
title_short PubMed text data mining automation for biological validation on lists of genes and pathways
title_full PubMed text data mining automation for biological validation on lists of genes and pathways
title_fullStr PubMed text data mining automation for biological validation on lists of genes and pathways
title_full_unstemmed PubMed text data mining automation for biological validation on lists of genes and pathways
title_sort pubmed text data mining automation for biological validation on lists of genes and pathways
publisher Penerbit UTM Press
publishDate 2022
url http://eprints.utm.my/108823/1/NiesHuiWen2022_PubMedTextDataMiningAutomation.pdf
http://eprints.utm.my/108823/
http://dx.doi.org/10.11113/ijic.v12n1.313
_version_ 1818834054174212096
score 13.234133