Cited 0 times in Scipus Cited Count

Meta-i6mA: An interspecies predictor for identifying DNA N6-methyladenine sites of plant genomes by exploiting informative features in an integrative machine-learning framework

DC Field Value Language
dc.contributor.authorHasan, MM-
dc.contributor.authorBasith, S-
dc.contributor.authorKhatun, MS-
dc.contributor.authorLee, G-
dc.contributor.authorManavalan, B-
dc.contributor.authorKurata, H-
dc.date.accessioned2023-01-05T03:03:09Z-
dc.date.available2023-01-05T03:03:09Z-
dc.date.issued2021-
dc.identifier.issn1467-5463-
dc.identifier.urihttp://repository.ajou.ac.kr/handle/201003/23624-
dc.description.abstractDNA N6-methyladenine (6mA) represents important epigenetic modifications, which are responsible for various cellular processes. The accurate identification of 6mA sites is one of the challenging tasks in genome analysis, which leads to an understanding of their biological functions. To date, several species-specific machine learning (ML)-based models have been proposed, but majority of them did not test their model to other species. Hence, their practical application to other plant species is quite limited. In this study, we explored 10 different feature encoding schemes, with the goal of capturing key characteristics around 6mA sites. We selected five feature encoding schemes based on physicochemical and position-specific information that possesses high discriminative capability. The resultant feature sets were inputted to six commonly used ML methods (random forest, support vector machine, extremely randomized tree, logistic regression, naïve Bayes and AdaBoost). The Rosaceae genome was employed to train the above classifiers, which generated 30 baseline models. To integrate their individual strength, Meta-i6mA was proposed that combined the baseline models using the meta-predictor approach. In extensive independent test, Meta-i6mA showed high Matthews correlation coefficient values of 0.918, 0.827 and 0.635 on Rosaceae, rice and Arabidopsis thaliana, respectively and outperformed the existing predictors. We anticipate that the Meta-i6mA can be applied across different plant species. Furthermore, we developed an online user-friendly web server, which is available at http://kurata14.bio.kyutech.ac.jp/Meta-i6mA/.-
dc.language.isoen-
dc.subject.MESHAdenosine-
dc.subject.MESHAlgorithms-
dc.subject.MESHArabidopsis-
dc.subject.MESHBase Sequence-
dc.subject.MESHComputational Biology-
dc.subject.MESHDNA, Plant-
dc.subject.MESHEpigenesis, Genetic-
dc.subject.MESHGenome, Plant-
dc.subject.MESHInternet-
dc.subject.MESHMachine Learning-
dc.subject.MESHModels, Genetic-
dc.subject.MESHOryza-
dc.subject.MESHRosaceae-
dc.subject.MESHSpecies Specificity-
dc.subject.MESHSupport Vector Machine-
dc.titleMeta-i6mA: An interspecies predictor for identifying DNA N6-methyladenine sites of plant genomes by exploiting informative features in an integrative machine-learning framework-
dc.typeArticle-
dc.identifier.pmid32910169-
dc.subject.keywordDNA N6-methyladenine modification-
dc.subject.keywordfeature extraction-
dc.subject.keywordmeta-predictor-
dc.subject.keywordprediction model-
dc.subject.keywordtwo-step feature optimization-
dc.contributor.affiliatedAuthorBasith, S-
dc.contributor.affiliatedAuthorLee, G-
dc.contributor.affiliatedAuthorManavalan, B-
dc.type.localJournal Papers-
dc.identifier.doi10.1093/bib/bbaa202-
dc.citation.titleBriefings in bioinformatics-
dc.citation.volume22-
dc.citation.number3-
dc.citation.date2021-
dc.citation.startPagebbaa202-
dc.citation.endPagebbaa202-
dc.identifier.bibliographicCitationBriefings in bioinformatics, 22(3). : bbaa202-bbaa202, 2021-
dc.embargo.liftdate9999-12-31-
dc.embargo.terms9999-12-31-
dc.identifier.eissn1477-4054-
dc.relation.journalidJ014675463-
Appears in Collections:
Journal Papers > School of Medicine / Graduate School of Medicine > Physiology
Files in This Item:
There are no files associated with this item.

qrcode

해당 아이템을 이메일로 공유하기 원하시면 인증을 거치시기 바랍니다.

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse