RESOURCE

Detecting genetic variation in microarray expression data

    • 1 The Salk Institute for Biological Studies, La Jolla, California 92037, USA;
    • 2 Neurosciences Graduate Program, School of Medicine, University of California, San Diego, California 92093, USA;
    • 3 Biomedical Sciences Graduate Program, School of Medicine, University of California, San Diego, California 92093, USA;
    • 4 Polymorphism Research Laboratory, Department of Psychiatry, University of California, San Diego, California 92093, USA;
    • 5 Genes and Disease Program, Center for Genomic Regulation (CRG-UPF), Barcelona 08003, Spain;
    • 6 Brain Cells, Inc., San Diego, California 92121, USA;
    • 7 Amicus Therapeutics, Cranbury, New Jersey 08512, USA
    • 8 These authors contributed equally to this work.
    • 9 Corresponding author. E-mail [email protected]; fax (609) 662-2001.
Published July 3, 2007. Vol 17 Issue 8, pp. 1228-1235. https://doi.org/10.1101/gr.6307307
Download PDF Please log-in to or register for your personal account in order to access PDF Cite Article Permissions Share
cover of Genome Research Vol 36 Issue 4
Current Issue:

Abstract

The use of high-density oligonucleotide arrays to measure the expression levels of thousands of genes in parallel has become commonplace. To take further advantage of the growing body of data, we developed a method, termed “GeSNP,” to mine the detailed hybridization patterns in oligonucleotide array expression data for evidence of genetic variation. To demonstrate the performance of the algorithm, the hybridization patterns in data obtained previously from SAMP8/Ta, SAMP10/Ta, and SAMR1/Ta inbred mice and from humans and chimpanzees were analyzed. Genes with consistent strain-specific and species-specific hybridization pattern differences were identified, and ∼90% of the candidate genes were independently confirmed to harbor sequence differences. Importantly, the quality of gene expression data was also improved by masking the probes of regions with putative sequence differences between species and strains. To illustrate the application to human disease groups, data from an inflammatory bowel disease study were analyzed. GeSNP identified sequence differences in candidate genes previously discovered in independent association and linkage studies and uncovered many promising new candidates. This approach enables the opportunistic extraction of genetic variation information from new or pre-existing gene expression data obtained with high-density oligonucleotide arrays.

Loading
Loading
Back to top