File(s) stored somewhere else
Please note: Linked content is NOT stored on Open Access Te Herenga Waka-Victoria University of Wellington and we can't guarantee its availability, quality, security or accept any liability.
i4mC-GRU: Identifying DNA N4-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features
journal contribution
posted on 2023-09-12, 16:06 authored by TH Nguyen-Vo, QH Trinh, Hoang NguyenHoang Nguyen, PU Nguyen-Hoang, S Rahardja, Binh NguyenBinh NguyenN4-methylcytosine (4mC) is one of the most common DNA methylation modifications found in both prokaryotic and eukaryotic genomes. Since the 4mC has various essential biological roles, determining its location helps reveal unexplored physiological and pathological pathways. In this study, we propose an effective computational method called i4mC-GRU using a gated recurrent unit and duplet sequence-embedded features to predict potential 4mC sites in mouse (Mus musculus) genomes. To fairly assess the performance of the model, we compared our method with several state-of-the-art methods using two different benchmark datasets. Our results showed that i4mC-GRU achieved area under the receiver operating characteristic curve values of 0.97 and 0.89 and area under the precision-recall curve values of 0.98 and 0.90 on the first and second benchmark datasets, respectively. Briefly, our method outperformed existing methods in predicting 4mC sites in mouse genomes. Also, we deployed i4mC-GRU as an online web server, supporting users in genomics studies.
History
Preferred citation
Nguyen-Vo, T. H., Trinh, Q. H., Nguyen, L., Nguyen-Hoang, P. U., Rahardja, S. & Nguyen, B. P. (2023). i4mC-GRU: Identifying DNA N4-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features. Computational and Structural Biotechnology Journal, 21, 3045-3053. https://doi.org/10.1016/j.csbj.2023.05.014Publisher DOI
Journal title
Computational and Structural Biotechnology JournalVolume
21Publication date
2023-01-01Pagination
3045-3053Publisher
Elsevier BVPublication status
PublishedOnline publication date
2023-05-16ISSN
2001-0370eISSN
2001-0370Language
enUsage metrics
Categories
Keywords
DNAN4-methylcytosineEpigeneticsDeep learningBidirectional gated recurrent unitSequence-embedded features46 Information and Computing Sciences31 Biological Sciences3102 Bioinformatics and Computational BiologyHuman GenomeGeneticsBiotechnology4903 Numerical and computational mathematics4613 Theory of computation3101 Biochemistry and cell biology4601 Applied computing