Initial import of new posts
This commit is contained in:
parent
e4bafbb361
commit
0e12688f04
391 changed files with 14594 additions and 0 deletions
42
_posts/2007-11-15-gene-identifiers.markdown
Normal file
42
_posts/2007-11-15-gene-identifiers.markdown
Normal file
|
@ -0,0 +1,42 @@
|
|||
---
|
||||
author: einar
|
||||
comments: true
|
||||
date: 2007-11-15 19:57:16+00:00
|
||||
layout: post
|
||||
slug: gene-identifiers
|
||||
title: Gene identifiers
|
||||
wordpress_id: 336
|
||||
categories:
|
||||
- Science
|
||||
tags:
|
||||
- annotation
|
||||
- bioinformatics
|
||||
- microarray
|
||||
- python
|
||||
---
|
||||
|
||||
While working today on an annotation class in Python I stumbled on a problem. Normally I work with lists of genes that are consistent, i.e. all Entrez Gene IDs (or RefSeq IDs, or Genome Browser IDs...), but today I had a list of mixed identifiers.
|
||||
|
||||
The subsequent idea was "let's implement auto-detection of common identifiers in the class". The problem is... is there any actual documentation on how identifiers are made? So far, using regular expressions, I've tracked down a few:
|
||||
|
||||
|
||||
|
||||
|
||||
* RefSeq
|
||||
|
||||
|
||||
* GenBank
|
||||
|
||||
|
||||
* Entrez Gene
|
||||
|
||||
|
||||
* UCSC Genome Browser
|
||||
|
||||
|
||||
* Ensembl
|
||||
|
||||
|
||||
However, I have no idea if I have implemented all types of these IDs. Does anyone know a place where to look these information up?
|
||||
|
||||
(On a related note: my thesis defense will be on January 14th, 2008, so I have to get the printing going)
|
Reference in a new issue