Open Access Te Herenga Waka-Victoria University of Wellington
Browse

File(s) stored somewhere else

Please note: Linked content is NOT stored on Open Access Te Herenga Waka-Victoria University of Wellington and we can't guarantee its availability, quality, security or accept any liability.

i4mC-GRU: Identifying DNA N4-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features

journal contribution
posted on 2023-09-12, 16:06 authored by TH Nguyen-Vo, QH Trinh, Hoang Nguyen, PU Nguyen-Hoang, S Rahardja, Binh NguyenBinh Nguyen
N4-methylcytosine (4mC) is one of the most common DNA methylation modifications found in both prokaryotic and eukaryotic genomes. Since the 4mC has various essential biological roles, determining its location helps reveal unexplored physiological and pathological pathways. In this study, we propose an effective computational method called i4mC-GRU using a gated recurrent unit and duplet sequence-embedded features to predict potential 4mC sites in mouse (Mus musculus) genomes. To fairly assess the performance of the model, we compared our method with several state-of-the-art methods using two different benchmark datasets. Our results showed that i4mC-GRU achieved area under the receiver operating characteristic curve values of 0.97 and 0.89 and area under the precision-recall curve values of 0.98 and 0.90 on the first and second benchmark datasets, respectively. Briefly, our method outperformed existing methods in predicting 4mC sites in mouse genomes. Also, we deployed i4mC-GRU as an online web server, supporting users in genomics studies.

History

Preferred citation

Nguyen-Vo, T. H., Trinh, Q. H., Nguyen, L., Nguyen-Hoang, P. U., Rahardja, S. & Nguyen, B. P. (2023). i4mC-GRU: Identifying DNA N4-Methylcytosine sites in mouse genomes using bidirectional gated recurrent unit and sequence-embedded features. Computational and Structural Biotechnology Journal, 21, 3045-3053. https://doi.org/10.1016/j.csbj.2023.05.014

Journal title

Computational and Structural Biotechnology Journal

Volume

21

Publication date

2023-01-01

Pagination

3045-3053

Publisher

Elsevier BV

Publication status

Published

Online publication date

2023-05-16

ISSN

2001-0370

eISSN

2001-0370

Language

en