Results 1 to 2 of 2
2007-03-15, 20:47 #1
- Join Date
- Jan 2001
- Redcliff, Alberta, Canada
- Thanked 5 Times in 5 Posts
Mistaken Identifiers: Gene name errors (Excel 2003)
http://www.biomedcentral.com/1471-2105/5/80 You may be interested in this article
When processing microarray data sets, we recently noticed that some gene names were being changed inadvertently to non-gene names.
A little detective work traced the problem to default date format conversions and floating-point format conversions in the very useful Excel program package. The date conversions affect at least 30 gene names; the floating-point conversions affect at least 2,000 if Riken identifiers are included. These conversions are irreversible; the original gene names cannot be recovered.
Users of Excel for analyses involving gene names should be aware of this problem, which can cause genes, including medically important ones, to be lost from view and which has contaminated even carefully curated public databases. We provide work-arounds and scripts for circumventing the problem.
[b]Catharine Richardson (WebGenii)
WebGenii Home Page
Moderator: Spreadsheets, Other MS Apps, Presentation Apps, Visual Basic for Apps, Windows Mobile
2007-03-15, 20:57 #2
- Join Date
- Mar 2002
- Thanked 28 Times in 28 Posts
Re: Mistaken Identifiers: Gene name errors (Excel 2003)
Gives a whole new meaning to "genetically modified"...