Flags and Lollipops

Wednesday, October 19, 2005

Retrieving arbitrary sequences with web services

I dig SOAP (once it's been abstracted out for me by a helper library: maybe I should say that I dig SOAP::Lite). It's good to see that bioinformatics centres are providing web service interfaces to their data.

One thing, though - quite often I want to retrieve DNA sequences that aren't centered around genes (gene-centric genomics? how very last century). And that's strangely difficult with existing bioinformatics web services (it's easy enough with the Ensembl API or with wget and the right URL for UCSC, but not quick).

For some reason they're all based around retrieving sequences by identifier, a la RefSeq. Doesn't seem much good if you're not looking at a feature already defined somewhere. What I'd really like to see is a server which takes in SOAP requests (or, even simpler, uses REST). It'd take four parameters:
  • Build
  • Chr
  • Start
  • End
and return the relevant base sequence. If you really wanted to be fancy you could put repeatmasked bases in lowercase or something. Simple, no?

Does such a server already exist? If not, why not?

Comments and trackbacks Feel free to post your comments . This post has trackbacks.

Trackbacks:

0 Comments:

Post a Comment

<< Home


See all posts from: July 2005 August 2005 September 2005 October 2005 November 2005 December 2005 January 2006 February 2006 March 2006 April 2006 May 2006 June 2006 July 2006 September 2006 October 2006 November 2006 December 2006 January 2007 February 2007 March 2007 April 2007 May 2007 June 2007 July 2007 August 2007 October 2007 November 2007 December 2007 January 2008 February 2008 March 2008