ASCARIS: positional feature annotation and protein structure-based representation of single amino acid variations

dc.contributor.authorid0000-0002-9595-3722
dc.contributor.coauthorDogan, Tunca
dc.contributor.departmentN/A
dc.contributor.kuauthorCankara, Fatma
dc.contributor.kuprofilePhD Student
dc.contributor.schoolcollegeinstituteGraduate School of Sciences and Engineering
dc.date.accessioned2025-01-19T10:29:40Z
dc.date.issued2023
dc.description.abstractBackground: Genomic variations may cause deleterious effects on protein functionality and perturb biological processes. Elucidating the effects of variations is critical for developing novel treatment strategies for diseases of genetic origin. Computational approaches have been aiding the work in this field by modeling and analyzing the mutational landscape. However, new approaches are required, especially for accurate representation and datacentric analysis of sequence variations. Method: In this study, we propose ASCARIS (Annotation and StruCture-bAsed RepresentatIon of Single amino acid variations), a method for the featurization (i.e., quantitative representation) of single amino acid variations (SAVs), which could be used for a variety of purposes, such as predicting their functional effects or building multi-omics-based integrative models. ASCARIS utilizes the direct and spatial correspondence between the location of the SAV on the sequence/structure and 30 different types of positional feature annotations (e.g., active/lipidation/glycosylation sites; calcium/metal/DNA binding, inter/transmembrane regions, etc.), along with structural features and physicochemical properties. The main novelty of this method lies in constructing reusable numerical representations of SAVs via functional annotations. Results: We statistically analyzed the relationship between these features and the consequences of variations and found that each carries information in this regard. To investigate potential applications of ASCARIS, we trained variant effect prediction models that utilize our SAV representations as input. We carried out an ablation study and a comparison against the state-of-the-art methods and observed that ASCARIS has a competing and complementary performance against widely-used predictors. ASCARIS can be used alone or in combination with other approaches to represent SAVs from a functional perspective. ASCARIS is available as a programmatic tool at https://github.com/HUBioDataLab/ASCARIS and as a web-service at https://huggingface.co/spaces/HUBioDataLab/ASCARIS.
dc.description.indexedbyWoS
dc.description.indexedbyScopus
dc.description.indexedbyPubMed
dc.description.openaccessGreen Submitted, Green Published
dc.description.publisherscopeInternational
dc.description.volume21
dc.identifier.doi10.1016/j.csbj.2023.09.017
dc.identifier.issn2001-0370
dc.identifier.quartileQ2
dc.identifier.scopus2-s2.0-85173183971
dc.identifier.urihttps://doi.org/10.1016/j.csbj.2023.09.017
dc.identifier.urihttps://hdl.handle.net/20.500.14288/25924
dc.identifier.wos1123814500001
dc.keywordsSingle amino acid variations
dc.keywordsBiomolecular representations
dc.keywordsProtein sequence annotations
dc.keywordsProtein domains
dc.keywordsBioinformatics tools/models
dc.keywordsVariant effect prediction
dc.keywordsMachine learning
dc.languageen
dc.publisherElsevier
dc.sourceComputational and Structural Biotechnology Journal
dc.subjectBiochemistry and molecular biology
dc.subjectBiotechnology and applied microbiology
dc.titleASCARIS: positional feature annotation and protein structure-based representation of single amino acid variations
dc.typeJournal Article

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
IR04417.pdf
Size:
7.74 MB
Format:
Adobe Portable Document Format