Open Access Te Herenga Waka-Victoria University of Wellington
Browse

IProDNA-CapsNet: Identifying protein-DNA binding residues using capsule neural networks

journal contribution
posted on 2021-03-26, 10:13 authored by Binh NguyenBinh Nguyen, QH Nguyen, GN Doan-Ngoc, TH Nguyen-Vo, S Rahardja
© 2019 The Author(s). Background: Since protein-DNA interactions are highly essential to diverse biological events, accurately positioning the location of the DNA-binding residues is necessary. This biological issue, however, is currently a challenging task in the age of post-genomic where data on protein sequences have expanded very fast. In this study, we propose iProDNA-CapsNet - a new prediction model identifying protein-DNA binding residues using an ensemble of capsule neural networks (CapsNets) on position specific scoring matrix (PSMM) profiles. The use of CapsNets promises an innovative approach to determine the location of DNA-binding residues. In this study, the benchmark datasets introduced by Hu et al. (2017), i.e., PDNA-543 and PDNA-TEST, were used to train and evaluate the model, respectively. To fairly assess the model performance, comparative analysis between iProDNA-CapsNet and existing state-of-the-art methods was done. Results: Under the decision threshold corresponding to false positive rate (FPR) ≈ 5%, the accuracy, sensitivity, precision, and Matthews's correlation coefficient (MCC) of our model is increased by about 2.0%, 2.0%, 14.0%, and 5.0% with respect to TargetDNA (Hu et al., 2017) and 1.0%, 75.0%, 45.0%, and 77.0% with respect to BindN+ (Wang et al., 2010), respectively. With regards to other methods not reporting their threshold settings, iProDNA-CapsNet also shows a significant improvement in performance based on most of the evaluation metrics. Even with different patterns of change among the models, iProDNA-CapsNets remains to be the best model having top performance in most of the metrics, especially MCC which is boosted from about 8.0% to 220.0%. Conclusions: According to all evaluation metrics under various decision thresholds, iProDNA-CapsNet shows better performance compared to the two current best models (BindN and TargetDNA). Our proposed approach also shows that CapsNet can potentially be used and adopted in other biological applications.

History

Preferred citation

Nguyen, B. P., Nguyen, Q. H., Doan-Ngoc, G. N., Nguyen-Vo, T. H. & Rahardja, S. (2019). IProDNA-CapsNet: Identifying protein-DNA binding residues using capsule neural networks. BMC Bioinformatics, 20(S23), 634-. https://doi.org/10.1186/s12859-019-3295-2

Journal title

BMC Bioinformatics

Volume

20

Issue

S23

Publication date

2019-12-27

Pagination

634

Publisher

Springer Science and Business Media LLC

Publication status

Published

Online publication date

2019-12-27

ISSN

1471-2105

eISSN

1471-2105

Article number

634

Language

en

Usage metrics

    Journal articles

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC