We’ve already determined that 23andMe’s raw data offers the most health information among consumer genomic tests.
We’ve fixed problems with the SNP data, and reverse-engineered 23andMe’s proprietary insertions and deletions — allowing us to make the most accurate and comprehensive interpretation.
Now lets jump in and take a look at the health information that is available from 23andMe and from the raw data.
23andMe was launched with the promise of bringing personal genome information to consumers everywhere. For several years, they were able to provide information on both health-related traits and ancestry. Then, in 2013 the FDA stepped in to stop the delivery of results on health-related traits. Starting last month, 23andMe has revamped their service, and will now be able to offer some health reports, but not as many as before.
As we developed and tested Enlis Genome Personal – it became clear that the raw data from 23andMe contains significantly more health information than they are reporting in their health reports. That got us interested to put some numbers on just how much information there was in the raw data. First, we wanted to compare the count of diseases or health-related traits that are reported by 23andme vs. how many are found in the raw data.
The previous 23andMe health reports had 201 health-related diseases or traits, while the new reports only have 36. These 36 diseases are limited to carrier status on autosomal recessive disorders. An autosomal recessive disorder is one in which a person needs 2 bad copies of a gene to be affected by the disorder. With only 1 bad copy, that person is considered a non-affected ‘carrier’. Interestingly, 23andMe’s website claims that:
“Our tests can be used to determine carrier status in adults, but cannot determine if you have two copies of the genetic variant.”
Edit: This is surprising because the Illumina Infinium technology (the genotyping chip that 23andMe uses) tends to have low error rates and 23andMe has real world data on over 1 million customers now.  The FDA document about 23andMe’s approved Bloom Syndrome carrier test says that “all homozygous variant genotype samples receive a ‘no-call’ result, since the calling software was designed not to detect homozygous variant genotypes.” It sounds to me like they designed the software to ignore and throw out homozygous data.
With raw data imported into Enlis Genome Personal – there are over 2,000 diseases or health-related traits analyzed. Here is a complete list of the diseases and traits found in 23andMe’s raw data. (In this non-consolidated list, sub-types of diseases are listed separately)
A disease or trait can be caused by different genomic variants, and each of these variants can be tested by 23andMe. For instance, 23andMe reports on 28 different variants that are connected with Cystic Fibrosis. So how many total health-related variants are reported by 23andMe? And how many are in 23andMe’s raw data?
23andme has a long way to go to get back to reporting the same number of variants they were before the FDA ban. However – both the previous and new 23andMe reports pale in comparison to an analysis of the raw data. 23andMe’s new reports tell you about less than 1% of the health-related variants that are in their raw data.
How does this translate to the level of individual diseases? Let’s look at the count of variants for some specific diseases and inherited conditions:
Disease | Number of variants in previous 23andMe reports | Number of variants in new 23andMe reports | Number of variants tested in the raw data |
Beta Thalassemia | 17 | 10 | 43 |
BRCA1/2 Inherited Breast Cancer | 3 | 0 | 677 |
Cystic Fibrosis | 26 | 28 | 225 |
Gaucher Disease | 3 | 0 | 47 |
Hypertrophic Cardiomyopathy | 1 | 0 | 201 |
Li-Fraumeni syndrome | 0 | 0 | 32 |
Lynch Syndrome | 0 | 0 | 708 |
Marfan Syndrome | 0 | 0 | 100 |
Phenylketonuria | 27 | 0 | 95 |
Tay-Sachs Disease | 6 | 4 | 31 |
Usher Syndrome | 2 | 2 | 30 |
This is only a small sample of diseases and conditions, but you can see the enormous disparity between the number of variants reported by 23andMe and the number that is in the raw data.
If you want to get the most health information out of your 23andMe data, you need to get a third-party interpretation. Given our quality control and comprehensive, easy to use software, I think that our interpretation it the best. Give it a try here: