dennogumi.org-archive/_posts/2007-11-15-gene-identifiers.markdown at master

Archived

This repository has been archived on 2021-01-06. You can view files and clone it, but you cannot make any changes to it's state, such as pushing and creating new issues, pull requests or comments.

Luca Beltrame e53510fd98 First batch of changes for new banner for science posts

2015-05-24 01:19:13 +02:00

1.1 KiB

Raw Permalink Blame History

author

comments

date

layout

slug

title

wordpress_id

tags

einar

true

2007-11-15 19:57:16+00:00

page

gene-identifiers

Gene identifiers

336

Science

image_fullwidth
banner_other.jpg

annotation

bioinformatics

microarray

python

While working today on an annotation class in Python I stumbled on a problem. Normally I work with lists of genes that are consistent, i.e. all Entrez Gene IDs (or RefSeq IDs, or Genome Browser IDs...), but today I had a list of mixed identifiers.

The subsequent idea was "let's implement auto-detection of common identifiers in the class". The problem is... is there any actual documentation on how identifiers are made? So far, using regular expressions, I've tracked down a few:

RefSeq
GenBank
Entrez Gene
UCSC Genome Browser
Ensembl

However, I have no idea if I have implemented all types of these IDs. Does anyone know a place where to look these information up?

(On a related note: my thesis defense will be on January 14th, 2008, so I have to get the printing going)

1.1 KiB Raw Permalink Blame History

1.1 KiB

Raw Permalink Blame History