Comparisons of Enhancers Associated Marks Prediction Using K-mer Feature

Epigenetic signatures such as chromatin and histone modification marks are prominent indicator of enhancer motif regions. While many works have been using k-mer as feature of epigenetic sequence, no comprehensive studies has been done to compare and contrast how the different choices of k-mers fe...

Full description

Saved in:
Bibliographic Details
Main Authors: Nazeri, Sina, Lee, Nung Kion, Norwati, Mustapha
Format: Conference or Workshop Item
Language:English
Published: 2015
Subjects:
Online Access:http://ir.unimas.my/id/eprint/11940/1/Comparisons%20of%20Enhancers_abstract.pdf
http://ir.unimas.my/id/eprint/11940/
http://www.cita.my/cita2015/docs/shortpaper/69.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Epigenetic signatures such as chromatin and histone modification marks are prominent indicator of enhancer motif regions. While many works have been using k-mer as feature of epigenetic sequence, no comprehensive studies has been done to compare and contrast how the different choices of k-mers feature parameter affect machine learning algorithm performances. Furthermore, it is not known how effective is the k-mer feature for representing different epigenetic marksH3K4me1, DHS and p300. In this paper, a comparative study is performed to determine the accuracy, sensitivity and specificity of using k-mer feature for predicting these marks. Our results found that, classifier perform better when the k-mer length is between 4 to 6. Short k-mer length has poor accuracy, sensitivity and specificity. The k-mer feature works best for DHS sequences and has low accuracy for H3K4me1 sequences prediction. The k-mer feature is also performed poorly on specificity of DHS sequences. It can be concluded that, there are still much room for improvement of identifying better feature for representing epigenetic feature for enhancer prediction.