AJOU Open Repository: Computational prediction and interpretation of cell-specific replication origin sites from multiple eukaryotes by exploiting stacking framework

BROWSE

Cited 0 times in Scipus Cited Count

Computational prediction and interpretation of cell-specific replication origin sites from multiple eukaryotes by exploiting stacking framework

DC Field	Value	Language
dc.contributor.author	Wei, L	-
dc.contributor.author	He, W	-
dc.contributor.author	Malik, A	-
dc.contributor.author	Su, R	-
dc.contributor.author	Cui, L	-
dc.contributor.author	Manavalan, B	-
dc.date.accessioned	2023-01-05T03:03:12Z	-
dc.date.available	2023-01-05T03:03:12Z	-
dc.date.issued	2021	-
dc.identifier.issn	1467-5463	-
dc.identifier.uri	http://repository.ajou.ac.kr/handle/201003/23635	-
dc.description.abstract	Origins of replication sites (ORIs), which refers to the initiative locations of genomic DNA replication, play essential roles in DNA replication process. Detection of ORIs' distribution in genome scale is one of key steps to in-depth understanding their regulation mechanisms. In this study, we presented a novel machine learning-based approach called Stack-ORI encompassing 10 cell-specific prediction models for identifying ORIs from four different eukaryotic species (Homo sapiens, Mus musculus, Drosophila melanogaster and Arabidopsis thaliana). For each cell-specific model, we employed 12 feature encoding schemes that cover nucleic acid composition, position-specific and physicochemical properties information. The optimal feature set was identified from each encoding individually and developed their respective baseline models using the eXtreme Gradient Boosting (XGBoost) classifier. Subsequently, the predicted scores of 12 baseline models are integrated as a novel feature vector to train XGBoost and develop the final model. Extensive experimental results show that Stack-ORI achieves significantly better performance as compared with their baseline models on both training and independent datasets. Interestingly, Stack-ORI consistently outperforms existing predictor in all cell-specific models, not only on training but also on independent test. Moreover, our novel approach provides necessary interpretations that help understanding model success by leveraging the powerful SHapley Additive exPlanation algorithm, thus underlining the most important feature encoding schemes significant for predicting cell-specific ORIs.	-
dc.language.iso	en	-
dc.subject.MESH	Animals	-
dc.subject.MESH	Databases, Nucleic Acid	-
dc.subject.MESH	Drosophila melanogaster	-
dc.subject.MESH	Humans	-
dc.subject.MESH	Mice	-
dc.subject.MESH	Models, Genetic	-
dc.subject.MESH	Replication Origin	-
dc.subject.MESH	Support Vector Machine	-
dc.subject.MESH	Transcription, Genetic	-
dc.title	Computational prediction and interpretation of cell-specific replication origin sites from multiple eukaryotes by exploiting stacking framework	-
dc.type	Article	-
dc.identifier.pmid	33152766	-
dc.subject.keyword	eXtreme Gradient Boosting	-
dc.subject.keyword	feature extraction	-
dc.subject.keyword	model interpretability	-
dc.subject.keyword	origin of replication site	-
dc.subject.keyword	stacking strategy	-
dc.contributor.affiliatedAuthor	Manavalan, B	-
dc.type.local	Journal Papers	-
dc.identifier.doi	10.1093/bib/bbaa275	-
dc.citation.title	Briefings in bioinformatics	-
dc.citation.volume	22	-
dc.citation.number	4	-
dc.citation.date	2021	-
dc.citation.startPage	bbaa275	-
dc.citation.endPage	bbaa275	-
dc.identifier.bibliographicCitation	Briefings in bioinformatics, 22(4). : bbaa275-bbaa275, 2021	-
dc.embargo.liftdate	9999-12-31	-
dc.embargo.terms	9999-12-31	-
dc.identifier.eissn	1477-4054	-
dc.relation.journalid	J014675463	-

Appears in Collections:: Journal Papers > School of Medicine / Graduate School of Medicine > Physiology

Files in This Item:: There are no files associated with this item.

Show simple item record

qrcode

트윗하기

License

Ajou University Medical Information & Media Center 164 Worldcup-ro Yeongtong-gu Suwon 16499 Korea / TEL : 031-219-5312
Copyright (c) Ajou University Medical Information & Media Center All Rights Reserved.
AJOU Open Repository는 국립중앙도서관 OAK 보급사업으로 구축되었습니다.

BROWSE

Browse