Power and sample size for DNA microarray studies

Stat Med. 2002 Dec 15;21(23):3543-70. doi: 10.1002/sim.1335.

Abstract

A microarray study aims at having a high probability of declaring genes to be differentially expressed if they are truly expressed, while keeping the probability of making false declarations of expression acceptably low. Thus, in formal terms, well-designed microarray studies will have high power while controlling type I error risk. Achieving this objective is the purpose of this paper. Here, we discuss conceptual issues and present computational methods for statistical power and sample size in microarray studies, taking account of the multiple testing that is generic to these studies. The discussion encompasses choices of experimental design and replication for a study. Practical examples are used to demonstrate the methods. The examples show forcefully that replication of a microarray experiment can yield large increases in statistical power. The paper refers to cDNA arrays in the discussion and illustrations but the proposed methodology is equally applicable to expression data from oligonucleotide arrays.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Analysis of Variance
  • Animals
  • Bayes Theorem
  • DNA, Complementary / genetics
  • Kidney Diseases, Cystic / genetics
  • Mice
  • Models, Statistical*
  • Morphine / metabolism
  • Mutation
  • Narcotics / metabolism
  • Oligonucleotide Array Sequence Analysis / methods*
  • Reproducibility of Results
  • Research Design*
  • Sample Size
  • Spinal Cord / metabolism
  • Substance-Related Disorders / metabolism

Substances

  • DNA, Complementary
  • Narcotics
  • Morphine