Sofia Papadimitriou, Andrea Gazzo, Nassim Versbraegen, Charlotte Nachtegael, Jan Aerts, Yves Moreau, Sonia Van Dooren, Ann Nowe, Guillaume Smits, Tom Lenaerts
Notwithstanding advances in predicting pathogenic variants in rare monogenic diseases, identifying causative variant combinations in disorders where more genes are involved still remains a challenge. As data on digenic diseases accumulates, we can now develop predictive methods that identify pathogenic variant combinations in gene pairs. We present VarCoPP, the first Variant Combination Pathogenicity Predictor, which is an innovative tool for variant assessment in oligogenic disorders. VarCoPP was trained on pathogenic digenic variant combinations from the Digenic Diseases Database (DIDA) and neutral digenic combinations from the 1000 Genomes Project. By combining 500 different trained Random Forest predictors, VarCoPP differentiates pathogenic from neutral digenic variant combinations. After cross-validation the predictor reaches a Matthews Correlation Coefficient of 0.74. VarCoPP ranks the results and provides evaluation scores for each prediction, i.e. a Classification Score (CS), which expresses the likelihood of a digenic combination being pathogenic and a Support Score (SS) that expresses how many RFs agree with that decision. Clinically relevant confidence zones, delimited by minimum CS and SS scores, are also provided that guarantee with 95% or a 99% probability that the prediction is indeed a true positive. Validation with 23 gene pairs related to digenic diseases not yet present in DIDA shows that the method identifies 20 cases correctly, all of which have at least a 95% confidence score. The results of VarCoPP show that the first steps to oligogenic pathogenicity prediction can be taken, expanding our capability to assist clinicians and geneticists in unraveling oligogenic signatures in rare diseases.
Papadimitriou, S, Gazzo, A, Versbraegen, N, Nachtegael, C, Aerts, J, Moreau, Y, Van Dooren, S, Nowe, A, Smits, G & Lenaerts, T 2018, 'Predicting disease-causing variant combinations for oligogenic diseases', Applied Bioinformatics in Life Sciences, Leuven, Belgium, 8/03/18 - 9/03/18.
Papadimitriou, S., Gazzo, A., Versbraegen, N., Nachtegael, C., Aerts, J., Moreau, Y., Van Dooren, S., Nowe, A., Smits, G., & Lenaerts, T. (2018). Predicting disease-causing variant combinations for oligogenic diseases. Poster session presented at Applied Bioinformatics in Life Sciences, Leuven, Belgium.
@conference{a666364193ca42b2a11ad4ac36bf006f,
title = "Predicting disease-causing variant combinations for oligogenic diseases",
abstract = "Notwithstanding advances in predicting pathogenic variants in rare monogenic diseases, identifying causative variant combinations in disorders where more genes are involved still remains a challenge. As data on digenic diseases accumulates, we can now develop predictive methods that identify pathogenic variant combinations in gene pairs. We present VarCoPP, the first Variant Combination Pathogenicity Predictor, which is an innovative tool for variant assessment in oligogenic disorders. VarCoPP was trained on pathogenic digenic variant combinations from the Digenic Diseases Database (DIDA) and neutral digenic combinations from the 1000 Genomes Project. By combining 500 different trained Random Forest predictors, VarCoPP differentiates pathogenic from neutral digenic variant combinations. After cross-validation the predictor reaches a Matthews Correlation Coefficient of 0.74. VarCoPP ranks the results and provides evaluation scores for each prediction, i.e. a Classification Score (CS), which expresses the likelihood of a digenic combination being pathogenic and a Support Score (SS) that expresses how many RFs agree with that decision. Clinically relevant confidence zones, delimited by minimum CS and SS scores, are also provided that guarantee with 95% or a 99% probability that the prediction is indeed a true positive. Validation with 23 gene pairs related to digenic diseases not yet present in DIDA shows that the method identifies 20 cases correctly, all of which have at least a 95% confidence score. The results of VarCoPP show that the first steps to oligogenic pathogenicity prediction can be taken, expanding our capability to assist clinicians and geneticists in unraveling oligogenic signatures in rare diseases. ",
author = "Sofia Papadimitriou and Andrea Gazzo and Nassim Versbraegen and Charlotte Nachtegael and Jan Aerts and Yves Moreau and {Van Dooren}, Sonia and Ann Nowe and Guillaume Smits and Tom Lenaerts",
year = "2018",
month = mar,
day = "8",
language = "English",
note = "Applied Bioinformatics in Life Sciences : 2nd edition, ABLS2018 ; Conference date: 08-03-2018 Through 09-03-2018",
}