TechRxiv
AllelePred Manuscript (v6.2).pdf (879.13 kB)
Download file

AllelePred: A Simple Allele Frequencies Ensemble Predictor for Different Single Nucleotide Variants

Download (879.13 kB)
preprint
posted on 2021-08-10, 17:36 authored by Turki SobahyTurki Sobahy, Meshari Alazmi
Genomic medicine stands to be revolutionized through the understanding of single nucleotide variants (SNVs) and their expression in single-gene disorders (mendelian diseases). Computational tools can play a vital role in the exploration of such variations and their pathogenicity. Consequently, we developed the ensemble prediction tool AllelePred to identify deleterious SNVs and disease causative genes. In comparison to other tools, our classifier achieves higher accuracy, precision, F1 score, and coverage for different types of coding variants. Furthermore, this research analyzes and structures 168,945 broad spectrum genetic variants from the genomes of the Saudi population to denote the accuracy of the model. When compared, AllelePred was able to structure the unlabeled Saudi genetic variants of the dataset to mimic the data characteristics of the known labeled data. On this basis, we accumulated a list of highly probable deleterious variants that we recommend for further experimental validation prior to medical diagnostic usage.

History

Email Address of Submitting Author

tsobahy@kfshrc.edu.sa

ORCID of Submitting Author

https://orcid.org/0000-0003-1797-3098

Submitting Author's Institution

King Faisal Specialist Hospital & Research Center-Jeddah

Submitting Author's Country

Saudi Arabia