PubMed text data mining automation for biological validation on lists of genes and pathways
A prognostic cancer marker is helpful in oncology to identify the abnormal cancer cells from the collected sample. This marker can be used as an indicator to determine a disease outcome, cancer treatment, and drug discovery. Identifying cancer markers is also beneficial to improve cancer patients...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Penerbit UTM Press
2022
|
Subjects: | |
Online Access: | http://eprints.utm.my/108823/1/NiesHuiWen2022_PubMedTextDataMiningAutomation.pdf http://eprints.utm.my/108823/ http://dx.doi.org/10.11113/ijic.v12n1.313 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.utm.108823 |
---|---|
record_format |
eprints |
spelling |
my.utm.1088232024-12-09T07:46:51Z http://eprints.utm.my/108823/ PubMed text data mining automation for biological validation on lists of genes and pathways Nies, Hui Wen Zakaria, Zalmiyah Chan, Weng Howe Kamsani, Izyan Izzati Hasan, Nor Shahida QA75 Electronic computers. Computer science A prognostic cancer marker is helpful in oncology to identify the abnormal cancer cells from the collected sample. This marker can be used as an indicator to determine a disease outcome, cancer treatment, and drug discovery. Identifying cancer markers is also beneficial to improve cancer patients' survival rate in receiving the treatment decision-making. Cancer markers can be determined by manually testing every gene or pathway in the wet lab or using the text mining automation method. The use of text mining techniques effectively investigates hidden information and gathers new knowledge from many existing sources. Unfortunately, querying relevant text to excavate important information is a challenging task. PubMed text data mining is one of the applications that help explore potential cancer markers as the trend of scientific articles in PubMed is steadily increased. Besides, it can support biologists to concentrate on the identified small set of genes or pathways. PubMed identifiers (PMIDs) are then obtained as evidence to ascertain the relationship between diseases and genes (or pathways) used as biological validation. Thus, this technique can discover the biological relationship between disease and genes or pathways. The existing method is commonly manually curated for the biological validation of genes and pathways. Manual curation takes time in the process and may lead to inconsistency. This study aims to automate the process of biological validation of genes and pathways for PubMed text data mining. Therefore, the PubMed text data mining automation was invented to link to the websites for saving time instead of manually. A list of genes and pathways from breast cancer are used in this study. Using PubMed text data mining automation for biological context verification and validation, p53 signaling pathway and TP53 gene as prognostic cancer markers for breast cancer. Hence, the p53 signaling pathway and TP53 are associated with the development of tumour cells and DNA damage after irradiation in breast cancer. Penerbit UTM Press 2022 Article PeerReviewed application/pdf en http://eprints.utm.my/108823/1/NiesHuiWen2022_PubMedTextDataMiningAutomation.pdf Nies, Hui Wen and Zakaria, Zalmiyah and Chan, Weng Howe and Kamsani, Izyan Izzati and Hasan, Nor Shahida (2022) PubMed text data mining automation for biological validation on lists of genes and pathways. International Journal of Innovative Computing, 12 (1). pp. 59-64. ISSN 2180-4370 http://dx.doi.org/10.11113/ijic.v12n1.313 DOI : 10.11113/ijic.v12n1.313 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
language |
English |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Nies, Hui Wen Zakaria, Zalmiyah Chan, Weng Howe Kamsani, Izyan Izzati Hasan, Nor Shahida PubMed text data mining automation for biological validation on lists of genes and pathways |
description |
A prognostic cancer marker is helpful in oncology to identify the abnormal cancer cells from the collected sample. This marker can be used as an indicator to determine a disease outcome, cancer treatment, and drug discovery. Identifying cancer markers is also beneficial to improve cancer patients' survival rate in receiving the treatment decision-making. Cancer markers can be determined by manually testing every gene or pathway in the wet lab or using the text mining automation method. The use of text mining techniques effectively investigates hidden information and gathers new knowledge from many existing sources. Unfortunately, querying relevant text to excavate important information is a challenging task. PubMed text data mining is one of the applications that help explore potential cancer markers as the trend of scientific articles in PubMed is steadily increased. Besides, it can support biologists to concentrate on the identified small set of genes or pathways. PubMed identifiers (PMIDs) are then obtained as evidence to ascertain the relationship between diseases and genes (or pathways) used as biological validation. Thus, this technique can discover the biological relationship between disease and genes or pathways. The existing method is commonly manually curated for the biological validation of genes and pathways. Manual curation takes time in the process and may lead to inconsistency. This study aims to automate the process of biological validation of genes and pathways for PubMed text data mining. Therefore, the PubMed text data mining automation was invented to link to the websites for saving time instead of manually. A list of genes and pathways from breast cancer are used in this study. Using PubMed text data mining automation for biological context verification and validation, p53 signaling pathway and TP53 gene as prognostic cancer markers for breast cancer. Hence, the p53 signaling pathway and TP53 are associated with the development of tumour cells and DNA damage after irradiation in breast cancer. |
format |
Article |
author |
Nies, Hui Wen Zakaria, Zalmiyah Chan, Weng Howe Kamsani, Izyan Izzati Hasan, Nor Shahida |
author_facet |
Nies, Hui Wen Zakaria, Zalmiyah Chan, Weng Howe Kamsani, Izyan Izzati Hasan, Nor Shahida |
author_sort |
Nies, Hui Wen |
title |
PubMed text data mining automation for biological validation on lists of genes and pathways |
title_short |
PubMed text data mining automation for biological validation on lists of genes and pathways |
title_full |
PubMed text data mining automation for biological validation on lists of genes and pathways |
title_fullStr |
PubMed text data mining automation for biological validation on lists of genes and pathways |
title_full_unstemmed |
PubMed text data mining automation for biological validation on lists of genes and pathways |
title_sort |
pubmed text data mining automation for biological validation on lists of genes and pathways |
publisher |
Penerbit UTM Press |
publishDate |
2022 |
url |
http://eprints.utm.my/108823/1/NiesHuiWen2022_PubMedTextDataMiningAutomation.pdf http://eprints.utm.my/108823/ http://dx.doi.org/10.11113/ijic.v12n1.313 |
_version_ |
1818834054174212096 |
score |
13.234133 |