Cited 0 times in Scipus Cited Count

STALLION: A stacking-based ensemble learning framework for prokaryotic lysine acetylation site prediction

DC Field Value Language
dc.contributor.authorBasith, S-
dc.contributor.authorLee, G-
dc.contributor.authorManavalan, B-
dc.date.accessioned2023-02-13T06:22:52Z-
dc.date.available2023-02-13T06:22:52Z-
dc.date.issued2022-
dc.identifier.issn1467-5463-
dc.identifier.urihttp://repository.ajou.ac.kr/handle/201003/24432-
dc.description.abstractProtein post-translational modification (PTM) is an important regulatory mechanism that plays a key role in both normal and disease states. Acetylation on lysine residues is one of the most potent PTMs owing to its critical role in cellular metabolism and regulatory processes. Identifying protein lysine acetylation (Kace) sites is a challenging task in bioinformatics. To date, several machine learning-based methods for the in silico identification of Kace sites have been developed. Of those, a few are prokaryotic species-specific. Despite their attractive advantages and performances, these methods have certain limitations. Therefore, this study proposes a novel predictor STALLION (STacking-based Predictor for ProkAryotic Lysine AcetyLatION), containing six prokaryotic species-specific models to identify Kace sites accurately. To extract crucial patterns around Kace sites, we employed 11 different encodings representing three different characteristics. Subsequently, a systematic and rigorous feature selection approach was employed to identify the optimal feature set independently for five tree-based ensemble algorithms and built their respective baseline model for each species. Finally, the predicted values from baseline models were utilized and trained with an appropriate classifier using the stacking strategy to develop STALLION. Comparative benchmarking experiments showed that STALLION significantly outperformed existing predictor on independent tests. To expedite direct accessibility to the STALLION models, a user-friendly online predictor was implemented, which is available at: http://thegleelab.org/STALLION.-
dc.language.isoen-
dc.subject.MESHAcetylation-
dc.subject.MESHAnimals-
dc.subject.MESHComputational Biology-
dc.subject.MESHHorses-
dc.subject.MESHLysine-
dc.subject.MESHMachine Learning-
dc.subject.MESHMale-
dc.subject.MESHProkaryotic Cells-
dc.subject.MESHProtein Processing, Post-Translational-
dc.titleSTALLION: A stacking-based ensemble learning framework for prokaryotic lysine acetylation site prediction-
dc.typeArticle-
dc.identifier.pmid34532736-
dc.identifier.urlhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8769686-
dc.subject.keywordbioinformatics-
dc.subject.keywordfeature optimization-
dc.subject.keywordlysine acetylation sites-
dc.subject.keywordmachine learning-
dc.subject.keywordperformance assessment-
dc.subject.keywordstacking strategy-
dc.contributor.affiliatedAuthorBasith, S-
dc.contributor.affiliatedAuthorLee, G-
dc.contributor.affiliatedAuthorManavalan, B-
dc.type.localJournal Papers-
dc.identifier.doi10.1093/bib/bbab376-
dc.citation.titleBriefings in bioinformatics-
dc.citation.volume23-
dc.citation.number1-
dc.citation.date2022-
dc.citation.startPagebbab376-
dc.citation.endPagebbab376-
dc.identifier.bibliographicCitationBriefings in bioinformatics, 23(1). : bbab376-bbab376, 2022-
dc.identifier.eissn1477-4054-
dc.relation.journalidJ014675463-
Appears in Collections:
Journal Papers > School of Medicine / Graduate School of Medicine > Physiology
Files in This Item:
34532736.pdfDownload

qrcode

해당 아이템을 이메일로 공유하기 원하시면 인증을 거치시기 바랍니다.

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse