{"id":65,"date":"2015-03-17T16:19:31","date_gmt":"2015-03-17T23:19:31","guid":{"rendered":"https:\/\/www.enlis.com\/blog\/?p=65"},"modified":"2015-03-17T16:25:45","modified_gmt":"2015-03-17T23:25:45","slug":"the-best-variant-prediction-method-that-no-one-is-using","status":"publish","type":"post","link":"https:\/\/www.enlis.com\/blog\/2015\/03\/17\/the-best-variant-prediction-method-that-no-one-is-using\/","title":{"rendered":"The Best Variant Prediction Method That No One Is Using"},"content":{"rendered":"<p><span style=\"font-size: 14pt;\"><em>A test of the latest functional prediction algorithms:<\/em><\/span><\/p>\n<p><span style=\"font-size: 14pt;\"><em>CADD, DANN, FATHMM<\/em><\/span><\/p>\n<p>&nbsp;<\/p>\n<hr \/>\n<p><span style=\"color: #ff0000; font-size: 12pt;\">Spoiler alert!\u00c2\u00a0 Summary for those short on time:<\/span><\/p>\n<p style=\"padding-left: 30px;\"><span style=\"font-size: 12pt;\"><span style=\"color: #808080;\"> Our conclusion:\u00c2\u00a0 Despite being released <\/span><span style=\"color: #808080;\"> to little fanfare <\/span><span style=\"color: #808080;\">last October &#8211; we found that DANN offered the best sensitivity and specificity (highest true positives and lowest false positives).\u00c2\u00a0 DANN had less &#8216;noise&#8217; compared to both CADD and FATHMM.\u00c2\u00a0 Our latest software includes DANN predictions and you can test it out on your own data!<\/span><\/span><\/p>\n<hr \/>\n<p>&nbsp;<\/p>\n<p><span style=\"font-size: 14pt;\">How do you decide which variations are important in a genome?<\/span><\/p>\n<p><span style=\"font-size: 14pt;\">When we lack good information on a variation, can we predict whether it is functional and important?\u00c2\u00a0 Why do we even want to predict?\u00c2\u00a0 Let&#8217;s get to the bottom of this.<\/span><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-size: 14pt;\"><span style=\"font-size: 18pt;\"><strong>The Problem<\/strong>:\u00c2\u00a0<\/span> <\/span><\/p>\n<p><span style=\"font-size: 12pt;\">The human genome is vast.\u00c2\u00a0 Billions of bases.\u00c2\u00a0 Really, really quite big.\u00c2\u00a0 And we don&#8217;t know much about it.<\/span><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-68 aligncenter\" src=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/stuffWeKnowAbout.png\" alt=\"stuffWeKnowAbout\" width=\"600\" height=\"300\" srcset=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/stuffWeKnowAbout.png 800w, https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/stuffWeKnowAbout-300x150.png 300w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><br \/>\n<br style=\"clear: both;\" \/><br \/>\n<span style=\"font-size: 12pt;\">The latest sequencing technology has made nearly all of those bases available to us.\u00c2\u00a0 In many cases, when we see a variation, we don&#8217;t have any information on its functional impact.\u00c2\u00a0 It could have no impact whatsoever (benign), or it could be the cause of a disease that we are studying (pathogenic).\u00c2\u00a0 If we could predict which ones were more likely to be functional, we can focus validation efforts on those variations first &#8211; potentially saving time and money.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">To this aim, researchers have developed algorithms to predict if a variation is functional, based on a number of different criteria.<\/span><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-size: 18pt;\"><strong>The algorithms:<\/strong><\/span><\/p>\n<p><span style=\"font-size: 12pt;\">For many years, the most commonly used prediction algorithms were SIFT and POLYPHEN2.\u00c2\u00a0 These are both limited to variations that change the amino acid sequence of a protein.\u00c2\u00a0 However, increasingly researchers want to look outside of protein-coding regions.\u00c2\u00a0 In addition, given all the new genome-wide data that has been released over the last few years, like ENCODE and new allele frequency data, the world was ripe for a new strategy.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">\u00c2\u00a0<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">The <a href=\"http:\/\/www.nature.com\/ng\/journal\/v46\/n3\/full\/ng.2892.html\" target=\"_blank\">CADD<\/a> algorithm was published in February 2014, and it appeared to be a significant improvement over existing methods.<\/span><\/p>\n<div id=\"attachment_69\" style=\"width: 478px\" class=\"wp-caption alignnone\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-69\" class=\"size-full wp-image-69\" src=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/CADD_suppf12b.png\" alt=\"CADD Supplementary Data Figure 12 - A curve to the furthest to the left shows improved sensitivity and specificity\" width=\"468\" height=\"499\" srcset=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/CADD_suppf12b.png 468w, https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/CADD_suppf12b-281x300.png 281w\" sizes=\"auto, (max-width: 468px) 100vw, 468px\" \/><p id=\"caption-attachment-69\" class=\"wp-caption-text\">CADD Supplementary Data Figure 12 &#8211; A curve furthest to the top left shows improved sensitivity and specificity<\/p><\/div>\n<p>&nbsp;<\/p>\n<p><span style=\"font-size: 12pt;\">CADD used machine learning to develop its algorithm.\u00c2\u00a0 In a nutshell, CADD developers gave the computer a set of training data so that it could have a list of functional vs. non-functional variations.\u00c2\u00a0 Then, they fed the machine 63 different annotations of that data, and let the computer figure out how to use those annotations to rank functional vs. non-functional variations.\u00c2\u00a0 Once they had an algorithm set, they ran it on every possible single nucleotide variation in the reference genome and gave each variation a score &#8211; higher the score, the higher the prediction that it is functional.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">\u00c2\u00a0<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">More recently, two new algorithms were published &#8211; both of which claim to be improvements on CADD:<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">\u00c2\u00a0<\/span><\/p>\n<p style=\"padding-left: 30px;\"><span style=\"font-size: 12pt;\"><a href=\"http:\/\/bioinformatics.oxfordjournals.org\/content\/31\/5\/761\" target=\"_blank\">DANN<\/a> &#8211; (October 2014)\u00c2\u00a0 DANN uses the exact same training and annotation data as CADD, but uses a different &#8216;nonlinear&#8217; machine learning approach.<\/span><\/p>\n<p style=\"padding-left: 30px;\"><span style=\"font-size: 12pt;\"><a href=\"http:\/\/bioinformatics.oxfordjournals.org\/content\/early\/2015\/02\/10\/bioinformatics.btv009\" target=\"_blank\">FATHMM<\/a> &#8211; (January 2015)\u00c2\u00a0 FATHMM uses a similar machine learning approach to CADD, but uses a different set of training and annotation data.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">\u00c2\u00a0<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">We decided to take a look at all three and compare their results.\u00c2\u00a0 Which one will prove to be the best?<\/span><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-size: 18pt;\"><strong>The test data set:<\/strong><\/span><\/p>\n<p><span style=\"font-size: 12pt;\">We need to compare a list of known functional (pathogenic) variations against a list of known non-functional (benign) variations.\u00c2\u00a0 Then, we can score all the variations with each algorithm and see how the lists compare.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">We chose a difficult data set, but one that we think accurately portrays the task that is faced by many researchers today.\u00c2\u00a0 The test data is based on ClinVar.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\"><a href=\"http:\/\/www.ncbi.nlm.nih.gov\/clinvar\/\" target=\"_blank\">ClinVar<\/a> has fast become an important resource for the analysis of sequencing data.\u00c2\u00a0 It stores information on pathogenic variations and how they are connected to human health.\u00c2\u00a0 In fact, all three of the papers describing the contenders used ClinVar to demonstrate their effectiveness.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">So what will we do differently?\u00c2\u00a0 The papers used pathogenic variations in ClinVar, and compared them to (assumed benign) variations that have a global allele frequency of &gt;5%.\u00c2\u00a0 But ClinVar also records some variations that are known to be benign, and we think this is a more interesting set of non-functional variation for these reasons:<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">\u00c2\u00a0<\/span><\/p>\n<ul>\n<li><span style=\"font-size: 12pt;\">The ClinVar benign variations are mostly in genes that are already known to be clinically relevant (have functional effects).\u00c2\u00a0 It is important to be able to differentiate pathogenic vs. benign variations in genes that are already known to affect human health<\/span><\/li>\n<li><span style=\"font-size: 12pt;\">The ClinVar benign variations are more rare.\u00c2\u00a0 Researchers today often have to decide if a rare variation is important.\u00c2\u00a0 In the benign data, 55% of the variations have an allele frequency of &lt;5%.\u00c2\u00a0 Over 31% of the variations have allele frequency &lt;1%<\/span><\/li>\n<\/ul>\n<p><span style=\"font-size: 12pt;\">In short, many of the benign variations in ClinVar are those which at some point somebody asked: &#8220;Could this be important?&#8221;<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">In the end, we chose an equal number (~6500) of benign and pathogenic variations from ClinVar, and scored them with each algorithm.<\/span><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-size: 18pt;\"><strong>What are we looking for:<\/strong><\/span><\/p>\n<p><span style=\"font-size: 12pt;\">How do we tell which algorithm is better?\u00c2\u00a0 What do we want to see?<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">Here are some criteria that we want to look at:<\/span><\/p>\n<ul>\n<li><span style=\"font-size: 12pt;\">Find as many pathogenic variations possible (True positives)<\/span><\/li>\n<li><span style=\"font-size: 12pt;\">Minimize the number of benign variations that are scored as pathogenic (False positives)<\/span><\/li>\n<li><span style=\"font-size: 12pt;\">It would be good to have a clear differentiation between pathogenic and benign variation scores<\/span><\/li>\n<li><span style=\"font-size: 12pt;\">Minimize the &#8216;noise&#8217; in the data.\u00c2\u00a0 We expect that given the size of the genome, functional variations are a relatively rare event.\u00c2\u00a0 So if we can find more true positive variations within a smaller section of the data that would be better.<\/span><\/li>\n<\/ul>\n<p><span style=\"font-size: 12pt;\">Along the way we want to try to identify the best score thresholds for each algorithm &#8211; the best dividing line between a pathogenic score and a benign score.<\/span><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-size: 18pt;\"><strong>The results:<\/strong><\/span><\/p>\n<p><span style=\"text-decoration: underline; font-size: 12pt;\">Raw scores:<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">Here are box plots showing each algorithm&#8217;s raw scores of pathogenic variations (blue) vs benign variations (orange): (click to enlarge)<\/span><\/p>\n<table style=\"width: 100%;\">\n<tbody>\n<tr>\n<td>\n<p><div id=\"attachment_70\" style=\"width: 260px\" class=\"wp-caption alignnone\"><a href=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/cadd_clinvar_pathogenic_vs_benign.png\" rel=\"lightbox\" data-rel=\"lightbox-image-0\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-70\" class=\"wp-image-70\" src=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/cadd_clinvar_pathogenic_vs_benign-300x286.png\" alt=\"cadd_clinvar_pathogenic_vs_benign\" width=\"250\" height=\"239\" srcset=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/cadd_clinvar_pathogenic_vs_benign-300x286.png 300w, https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/cadd_clinvar_pathogenic_vs_benign.png 880w\" sizes=\"auto, (max-width: 250px) 100vw, 250px\" \/><\/a><p id=\"caption-attachment-70\" class=\"wp-caption-text\"><span style=\"font-size: 10pt;\"><strong>CADD<\/strong><\/span> ClinVar Pathogenic vs. Benign scores<\/p><\/div><\/td>\n<td>\n<p><div id=\"attachment_71\" style=\"width: 260px\" class=\"wp-caption alignnone\"><a href=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/dann_clinvar_pathogenic_vs_benign.png\" rel=\"lightbox\" data-rel=\"lightbox-image-1\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-71\" class=\"wp-image-71\" src=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/dann_clinvar_pathogenic_vs_benign-300x286.png\" alt=\"dann_clinvar_pathogenic_vs_benign\" width=\"250\" height=\"239\" srcset=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/dann_clinvar_pathogenic_vs_benign-300x286.png 300w, https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/dann_clinvar_pathogenic_vs_benign.png 880w\" sizes=\"auto, (max-width: 250px) 100vw, 250px\" \/><\/a><p id=\"caption-attachment-71\" class=\"wp-caption-text\"><span style=\"font-size: 10pt;\"><strong>DANN<\/strong><\/span> ClinVar Pathogenic vs. Benign scores<\/p><\/div><\/td>\n<td>\n<p><div id=\"attachment_73\" style=\"width: 260px\" class=\"wp-caption alignnone\"><a href=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/fathmm_coding_clinvar_pathogenic_vs_benign.png\" rel=\"lightbox\" data-rel=\"lightbox-image-2\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-73\" class=\"wp-image-73\" src=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/fathmm_coding_clinvar_pathogenic_vs_benign-300x286.png\" alt=\"fathmm_coding_clinvar_pathogenic_vs_benign\" width=\"250\" height=\"239\" srcset=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/fathmm_coding_clinvar_pathogenic_vs_benign-300x286.png 300w, https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/fathmm_coding_clinvar_pathogenic_vs_benign.png 880w\" sizes=\"auto, (max-width: 250px) 100vw, 250px\" \/><\/a><p id=\"caption-attachment-73\" class=\"wp-caption-text\"><span style=\"font-size: 10pt;\"><strong>FATHMM<\/strong><\/span> ClinVar Pathogenic vs. Benign scores<\/p><\/div><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><span style=\"font-size: 12pt;\">Note that while both DANN and FATHMM use a scoring system between 0 and 1, CADD uses an open-ended scoring system that can range from around -7 up to 20.\u00c2\u00a0 The DANN pathogenic scores are tightly clustered near the top of the graph, making the box plot difficult to see, so we also created a <a href=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/dann_clinvar_pathogenic_vs_benign_zoom.png\" data-rel=\"lightbox-image-3\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\">DANN zoomed in view<\/a>.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">Looking at the raw scores, one can get a sense for how well separated the pathogenic scores are from the benign scores, but we need to put some numbers on that!<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">\u00c2\u00a0<\/span><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"text-decoration: underline; font-size: 12pt;\">True Positives vs. False Positives:<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">To classify variations as pathogenic or benign, one needs to pick a dividing line or score threshold.\u00c2\u00a0 When a variation has a score above the threshold, it is considered pathogenic &#8211; below the threshold, it is considered benign.\u00c2\u00a0 In doing this type of classification, there is often a tradeoff.\u00c2\u00a0 Set the threshold too low, and you will have a lot of false positives <em>i.e.<\/em> truly benign variations that were considered pathogenic.\u00c2\u00a0 Set the threshold too high, and you will not find very many true positive pathogenic variations.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">This tradeoff is captured in a graph called a receiver operating characteristic curve (ROC curve).\u00c2\u00a0 First, the classification of pathogenic vs. benign is made at many different score thresholds.\u00c2\u00a0 Then, for each threshold, the percent of true positive identifications is placed on the Y-axis to correspond with the percent of false positive identifications on the X-axis.\u00c2\u00a0\u00c2\u00a0 The best classification algorithm will be the one where its ROC curve is closest to the top left corner &#8211; the place on the graph where all the true positive variations were found, and no variations were false positive.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">Below is the ROC curve for each different algorithm when considering the ClinVar pathogenic vs. benign dataset (click to enlarge):<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">\u00c2\u00a0<\/span><a href=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/ROC_clinvar_pathogenic_vs_benign.png\" rel=\"lightbox\" data-rel=\"lightbox-image-4\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-77\" src=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/ROC_clinvar_pathogenic_vs_benign.png\" alt=\"ROC_clinvar_pathogenic_vs_benign\" width=\"430\" height=\"400\" srcset=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/ROC_clinvar_pathogenic_vs_benign.png 860w, https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/ROC_clinvar_pathogenic_vs_benign-300x279.png 300w\" sizes=\"auto, (max-width: 430px) 100vw, 430px\" \/><\/a><br \/>\n<br style=\"clear: both;\" \/><br \/>\nD<span style=\"font-size: 12pt;\">ANN (orange) is clearly superior in this test compared to CADD (blue) and FATHMM (green). \u00c2\u00a0 This means that it will find more true positive variations and less false positives.\u00c2\u00a0 Given that CADD and DANN use the exact same training data, it would seem that DANN&#8217;s &#8216;non-linear&#8217; machine learning approach is the better choice.\u00c2\u00a0 CADD and FATHMM use the same type of machine learning algorithm and have very similar curves.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">To put some concrete numbers on this &#8211; lets say that you wanted a maximum of 20% false positives in your prediction data (0.2 on the X-axis).\u00c2\u00a0 At that threshold, you would find <span style=\"color: #ff6600;\">93%<\/span> of the true positives with the <span style=\"color: #ff6600;\">DANN<span style=\"color: #000000;\"> algorithm<\/span><\/span>, but you would only find around <span style=\"color: #0000ff;\">80%<\/span> of the true positives with the <span style=\"color: #0000ff;\">CADD<\/span> and <span style=\"color: #008000;\">FATHMM<\/span> algorithms.<\/span><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"text-decoration: underline; font-size: 12pt;\">Best Thresholds:<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">To find the best score threshold for each algorithm, we wanted to find a score threshold where there was the maximum difference between the number of true positives identified (pathogenic) and the number of false positives (benign).<\/span><\/p>\n<p>&nbsp;<\/p>\n<table style=\"width: 100%;\">\n<tbody>\n<tr>\n<td>\n<p><div id=\"attachment_74\" style=\"width: 260px\" class=\"wp-caption alignnone\"><a href=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/cadd_thresholds.png\" rel=\"lightbox\" data-rel=\"lightbox-image-5\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-74\" class=\"wp-image-74\" src=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/cadd_thresholds-300x240.png\" alt=\"cadd_thresholds\" width=\"250\" height=\"200\" srcset=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/cadd_thresholds-300x240.png 300w, https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/cadd_thresholds.png 1000w\" sizes=\"auto, (max-width: 250px) 100vw, 250px\" \/><\/a><p id=\"caption-attachment-74\" class=\"wp-caption-text\"><span style=\"font-size: 10pt;\"><strong>CADD<\/strong><\/span> &#8211; Percent of true positive pathogenic (blue) and false positive benign (orange) variations found at each score threshold<\/p><\/div><\/td>\n<td>\n<p><div id=\"attachment_75\" style=\"width: 260px\" class=\"wp-caption alignnone\"><a href=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/dann_thresholds1.png\" rel=\"lightbox\" data-rel=\"lightbox-image-6\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-75\" class=\"wp-image-75\" src=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/dann_thresholds1-300x240.png\" alt=\"dann_thresholds(1)\" width=\"250\" height=\"200\" srcset=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/dann_thresholds1-300x240.png 300w, https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/dann_thresholds1.png 1000w\" sizes=\"auto, (max-width: 250px) 100vw, 250px\" \/><\/a><p id=\"caption-attachment-75\" class=\"wp-caption-text\"><strong><span style=\"font-size: 10pt;\">DANN<\/span><\/strong> &#8211; Percent of true positive pathogenic (blue) and false positive benign (orange) variations found at each score threshold<\/p><\/div><\/td>\n<td>\n<p><div id=\"attachment_76\" style=\"width: 260px\" class=\"wp-caption alignnone\"><a href=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/fathmm_thresholds1.png\" rel=\"lightbox\" data-rel=\"lightbox-image-7\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-76\" class=\"wp-image-76\" src=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/fathmm_thresholds1-300x240.png\" alt=\"fathmm_thresholds(1)\" width=\"250\" height=\"200\" srcset=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/fathmm_thresholds1-300x240.png 300w, https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/fathmm_thresholds1.png 1000w\" sizes=\"auto, (max-width: 250px) 100vw, 250px\" \/><\/a><p id=\"caption-attachment-76\" class=\"wp-caption-text\"><span style=\"font-size: 10pt;\"><strong>FATHMM<\/strong><\/span> &#8211; Percent of true positive pathogenic (blue) and false positive benign (orange) variations found at each score threshold<\/p><\/div><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><span style=\"font-size: 12pt;\">For <strong>CADD<\/strong> the best threshold was at a score of 1.75 &#8211; where it identified <span style=\"color: #0000ff;\">84.1%<\/span> of the true positive pathogenic variations, and found <span style=\"color: #ff9900;\">23.9%<\/span> of the false positive benign variations.\u00c2\u00a0 A maximum difference of <span style=\"color: #999999;\">60.2%<\/span>.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">For <strong>DANN<\/strong> the best threshold was at a score of 0.96 &#8211; where it identified <span style=\"color: #3366ff;\">92.1%<\/span> of the true positive pathogenic variations, and found <span style=\"color: #ff9900;\">18.1%<\/span> of the false positive benign variations.\u00c2\u00a0 A maximum difference of <span style=\"color: #999999;\">74.0%<\/span>.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">For <strong>FATHMM<\/strong> the best threshold was at a score of 0.80 &#8211; where it identified <span style=\"color: #3366ff;\">83.3%<\/span> of the true positive pathogenic variations, and found <span style=\"color: #ff9900;\">22.8%<\/span> of the false positive benign variations.\u00c2\u00a0 A maximum difference of <span style=\"color: #999999;\">60.5%<\/span>.<\/span><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"text-decoration: underline; font-size: 12pt;\">Minimize Noise:<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">Let&#8217;s imagine that algorithm A labels 20 million variations as pathogenic and algorithm B labels 40 million variations as pathogenic, but both find 90% of all the pathogenic variations in the genome.\u00c2\u00a0 In this case, algorithm A would be superior because it would have less false positives &#8211; less noise to consider in the analysis of data. So this is related to the specificity values that we saw earlier.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">Another way to think about this is to ask &#8211; in the top XX% of the variations scored by each algorithm, how many of the ClinVar pathogenic variations were found?<\/span><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-size: 12pt;\">Percent of ClinVar Pathogenic variations found:<\/span><\/p>\n<table style=\"width: 100%; border: 1px solid #999;\">\n<tbody>\n<tr>\n<td style=\"border: 1px solid #999;\"><\/td>\n<td style=\"border: 1px solid #999;\">Top 0.3% of algorithm scores<\/td>\n<td style=\"border: 1px solid #999;\">Top 1% of algorithm scores<\/td>\n<td style=\"border: 1px solid #999;\">Top 2% of algorithm scores<\/td>\n<\/tr>\n<tr>\n<td style=\"border: 1px solid #999;\">CADD<\/td>\n<td style=\"border: 1px solid #999;\">53.7%<\/td>\n<td style=\"border: 1px solid #999;\">73.5%<\/td>\n<td style=\"border: 1px solid #999;\">83.0%<\/td>\n<\/tr>\n<tr>\n<td style=\"border: 1px solid #999;\">DANN<\/td>\n<td style=\"border: 1px solid #999;\"><strong>64.9%<\/strong><\/td>\n<td style=\"border: 1px solid #999;\"><strong>92.2%<\/strong><\/td>\n<td style=\"border: 1px solid #999;\"><strong>94.0%<\/strong><\/td>\n<\/tr>\n<tr>\n<td style=\"border: 1px solid #999;\">FATHMM<\/td>\n<td style=\"border: 1px solid #999;\">40.7%<\/td>\n<td style=\"border: 1px solid #999;\">74.1%<\/td>\n<td style=\"border: 1px solid #999;\">85.4%<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p><span style=\"font-size: 12pt;\">As our knowledge of the functionality of the genome is still in its early stages, this metric is secondary for now.\u00c2\u00a0 The values here may change significantly with different test data sets &#8211; but it is interesting to consider nonetheless.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">\u00c2\u00a0<\/span><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-size: 18pt;\"><strong>DANN in Enlis Genome Research:<\/strong><\/span><\/p>\n<p><span style=\"font-size: 12pt;\">So how can you make use of the DANN predictions?\u00c2\u00a0 DANN scores have been fully integrated into our latest release of Enlis Genome Research.\u00c2\u00a0 <\/span><\/p>\n<p><span style=\"font-size: 12pt;\">For example, you can see them here in the Predicted Deleterious column of the position pages: (click to enlarge)<\/span><\/p>\n<p><a href=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/positionPageWPrec.png\" rel=\"lightbox\" data-rel=\"lightbox-image-8\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-78 size-medium\" src=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/positionPageWPrec-300x202.png\" alt=\"positionPageWPrec\" width=\"300\" height=\"202\" srcset=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/positionPageWPrec-300x202.png 300w, https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/positionPageWPrec-1024x689.png 1024w, https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/positionPageWPrec.png 1246w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><br \/>\n<br style=\"clear: both;\" \/><br \/>\n<span style=\"font-size: 12pt;\">Instead of simply setting one score threshold, we have annotated variations at 3 different score levels, so that the user can choose their level of specificity and sensitivity.<\/span><\/p>\n<p>&nbsp;<\/p>\n<table style=\"width: 100%; border: 1px solid #999;\">\n<tbody>\n<tr>\n<td style=\"border: 1px solid #999;\"><strong>Enlis Score Level<\/strong><\/td>\n<td style=\"border: 1px solid #999;\"><strong>DANN Score Range<\/strong><\/td>\n<td style=\"border: 1px solid #999;\"><strong>Percentage of Variations<\/strong><\/td>\n<td style=\"border: 1px solid #999;\"><strong>Best For Variation Types<\/strong><\/td>\n<\/tr>\n<tr>\n<td style=\"border: 1px solid #999;\">\u00c2\u00a0<img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-79\" src=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/dannLevel3.png\" alt=\"dannLevel3\" width=\"40\" height=\"18\" \/><\/td>\n<td style=\"border: 1px solid #999;\">0.995 &#8211; 1<\/td>\n<td style=\"border: 1px solid #999;\">0.31% of all scores<\/td>\n<td style=\"border: 1px solid #999;\">Protein disrupting\/altering<\/td>\n<\/tr>\n<tr>\n<td style=\"border: 1px solid #999;\">\u00c2\u00a0<img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-80\" src=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/dannLevel2.png\" alt=\"dannLevel2\" width=\"40\" height=\"18\" \/><\/td>\n<td style=\"border: 1px solid #999;\">0.98 &#8211; 0.995<\/td>\n<td style=\"border: 1px solid #999;\">0.62% of all scores<\/td>\n<td style=\"border: 1px solid #999;\">Protein disrupting\/altering, Splice site<\/td>\n<\/tr>\n<tr>\n<td style=\"border: 1px solid #999;\">\u00c2\u00a0<img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-81\" src=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/dannLevel1.png\" alt=\"dannLevel1\" width=\"40\" height=\"18\" \/><\/td>\n<td style=\"border: 1px solid #999;\">0.93 &#8211; 0.98<\/td>\n<td style=\"border: 1px solid #999;\">2.15% of all scores<\/td>\n<td style=\"border: 1px solid #999;\">Splice site, Promoter region<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p><span style=\"font-size: 12pt;\">In addition, there is a new &#8216;Predicted Deleterious&#8217; filter in the Variation Filter tool for finding variations in your genomes that are at these score levels.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">We have pre-loaded one interesting example query using this filter &#8211; that is to search for &#8216;rare variations&#8217; &#8216;in OMIM genes&#8217; that are &#8216;predicted to be deleterious&#8217;.\u00c2\u00a0 This query can be accessed with two clicks: (click to enlarge)<br \/>\n<\/span><\/p>\n<p><a href=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/varFilterWPrec.png\" rel=\"lightbox\" data-rel=\"lightbox-image-9\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-82 size-medium\" src=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/varFilterWPrec-300x197.png\" alt=\"varFilterWPrec\" width=\"300\" height=\"197\" srcset=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/varFilterWPrec-300x197.png 300w, https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/varFilterWPrec-1024x673.png 1024w, https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/varFilterWPrec.png 1283w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><br \/>\n<br style=\"clear: both;\" \/><br \/>\n<span style=\"font-size: 12pt;\">\u00c2\u00a0<\/span><\/p>\n<p><span style=\"font-size: 18pt;\"><strong>Conclusion:<\/strong><\/span><\/p>\n<p><span style=\"font-size: 12pt;\">We tested three variant prediction algorithms for their ability to correctly score pathogenic and benign variations from a highly relevant ClinVar data set.\u00c2\u00a0 In this test, we believe that the DANN algorithm is clearly superior.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">CADD received a lot of attention upon its release, but DANN seems to have been overlooked.\u00c2\u00a0 We believe that DANN deserves more consideration.<br \/>\n<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">DANN scores have been integrated into our flagship product &#8211; Enlis Genome Research.\u00c2\u00a0 If you would like to give the software a try, let us know here:\u00c2\u00a0 <a href=\"https:\/\/www.enlis.com\/trial_request.html\" target=\"_blank\">https:\/\/www.enlis.com\/trial_request.html<\/a><\/span><\/p>\n<p><span style=\"font-size: 12pt;\">\u00c2\u00a0<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">Caveats:<\/span><\/p>\n<ul>\n<li><span style=\"font-size: 12pt;\">This is only one data set.\u00c2\u00a0 Other validation data sets may show different results.<\/span><\/li>\n<li><span style=\"font-size: 12pt;\">There are other prediction algorithms out there.\u00c2\u00a0 We could not possibly test them all.\u00c2\u00a0 I hope the field continues to improve!<\/span><\/li>\n<li><span style=\"font-size: 12pt;\">This last point cannot be emphasized enough:\u00c2\u00a0 <strong>A prediction is not evidence for functionality.<\/strong>\u00c2\u00a0 Variations that are suspected to be functional need to be validated.<\/span><\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<hr \/>\n<p>&nbsp;<\/p>\n<table style=\"width: 740px; height: 456px;\">\n<tbody>\n<tr>\n<td style=\"vertical-align: top;\"><img loading=\"lazy\" decoding=\"async\" class=\" size-thumbnail wp-image-83 alignnone\" src=\"https:\/\/www.enlis.com\/blog\/wp-content\/uploads\/2015\/03\/Devon_Jensen-150x150.jpg\" alt=\"Devon_Jensen\" width=\"150\" height=\"150\" \/><\/td>\n<td style=\"vertical-align: top;\"><span style=\"font-size: 14pt;\">ABOUT THE AUTHOR:<\/span><br style=\"clear: both;\" \/><br \/>\n<span style=\"font-size: 14pt;\">Devon Jensen, Ph.D.<\/span><br style=\"clear: both;\" \/><br \/>\n<span style=\"font-size: 12pt;\">Devon Jensen is the founder and original developer of Enlis Genomics.\u00c2\u00a0 Devon is a rare combination of scientific instinct, technical know-how, and entrepreneurial spirit. He received his Ph.D. in Molecular and Cell Biology from the University of California, Berkeley.\u00c2\u00a0 There, he studied protein secretion and neural tube disease under the guidance of 2013 Nobel Prize laureate Randy Schekman.\u00c2\u00a0 In addition to his extensive experience in genomic analysis, algorithm development, and user interface design, Devon is an accomplished entrepreneur. He also founded Enzymatic Software, LLC, which developed the popular Firefox add-on, Download Statusbar. After reaching over 3 million active daily users, the company was sold in 2011.<\/span><br \/>\n<br style=\"clear: both;\" \/><br \/>\nConnect with Devon on <a href=\"http:\/\/www.linkedin.com\/in\/devonjensen\" target=\"_blank\">Linkedin<\/a> or <a href=\"https:\/\/twitter.com\/devonjensen\" target=\"_blank\">Twitter<\/a><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>A test of the latest functional prediction algorithms: CADD, DANN, FATHMM &nbsp; Spoiler alert!\u00c2\u00a0 Summary for those short on time: Our conclusion:\u00c2\u00a0 Despite being released to little fanfare last October &#8211; we found that DANN offered the best sensitivity and specificity (highest true positives and lowest false positives).\u00c2\u00a0 DANN had less &#8216;noise&#8217; compared to both [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-65","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/www.enlis.com\/blog\/wp-json\/wp\/v2\/posts\/65","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.enlis.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.enlis.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.enlis.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.enlis.com\/blog\/wp-json\/wp\/v2\/comments?post=65"}],"version-history":[{"count":0,"href":"https:\/\/www.enlis.com\/blog\/wp-json\/wp\/v2\/posts\/65\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.enlis.com\/blog\/wp-json\/wp\/v2\/media?parent=65"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.enlis.com\/blog\/wp-json\/wp\/v2\/categories?post=65"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.enlis.com\/blog\/wp-json\/wp\/v2\/tags?post=65"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}