Flags and Lollipops

Monday, December 31, 2007

Open notebook - what's a disease again?

Is there a super-semantic-web-enabled phenotype database out there*? I want to ask a question like 'give a list of monogenic disorders whose locus has been confirmed by at least two labs, broken down by type of causative mutation type' and get an answer.

(* on a tangent: is 23andMe's gene book thing freely available?)

OMIM falls quite a long way short of this... it never set out to be a resource for programmatic access so you can't really blame them. The morbid map is available for download and contains all of the gene -> disorder mappings in their database.

A couple of issues:

  • OMIM's weird entry categorization system (#*%+...) is very confusing. There are 2229 'phenotypes' (note: not 'Mendelian phenotypes') with a known molecular basis in the database, apparently, but only 386 genes with a phenotype associated with them? Some of those phenotypes are going to be caused by gross insertions / deletions / whatever and not small mutations in single genes, multiple phenotypes might arise from different mutations in the same genes but even so... what's with the disparity?
  • It contains polygenic disorders (diabetes, schizophrenia) as well as monogenic ones
  • You can't tell which is which - you could count the number of genes associated with the disorder but a 'monogenic' disorder might be a complex one whose OMIM entry hasn't been updated yet
  • It's not a disease database - it has other phenotypes in it too. Longevity? Wet or dry ear wax? Novelty seeking personality?


The last point is interesting, really. When is a phenotype a disease? If you have a novelty seeking personality and so are relatively impulsive and prone to climbing mountains, swimming with sharks, cycling without a helmet etc. then are you ill?

Well, no, is the obvious answer. But where do you draw the line? Is autism a disease?

Neh. Beyond our remit. For us a monogenic disease = a clinically recognized disorder with a single, genetic cause.

Comments and trackbacks Feel free to post your comments OpenID mndoci Blogger Dan Anonymous bioinfblog . This post has trackbacks.

Trackbacks:

3 Comments:

At January 02, 2008 1:33 AM, OpenID mndoci said...

I don't think the Gene Book is freely available, but if I am wrong, PLEASE email me somebody.


Great series by the way!!! Looking forward to the whole thing

 
At January 02, 2008 1:08 PM, Blogger Dan said...

Hmm one of the problems with a "super-semantic-web-enabled phenotype database" is that there (probably) isn't a defined medical phenotype ontology in wide use. I know JAX use one for their mouse strains (the Mammalian Phenotype Ontology) but I don't see this being useful unless everyone a) uses it and b) reports it. OMIM focuses I assume at the disease level, but with multiple phenotypes being share with multiple disorders, I imagine the crosstalk on multiphenotype (but not multigene) disorders is quite interesting.

A typically frustrating bioinformatics issue...

 
At January 02, 2008 11:32 PM, Anonymous bioinfblog said...

some folks at the NIA/NIH put this together: http://geneticassociationdb.nih.gov/
i think it was published in nature genetics a while back and you can download the whole thing

 

Post a Comment

<< Home


See all posts from: July 2005 August 2005 September 2005 October 2005 November 2005 December 2005 January 2006 February 2006 March 2006 April 2006 May 2006 June 2006 July 2006 September 2006 October 2006 November 2006 December 2006 January 2007 February 2007 March 2007 April 2007 May 2007 June 2007 July 2007 August 2007 October 2007 November 2007 December 2007 January 2008 February 2008 March 2008