Gene expression profiling of human breast tissue samples using SAGE-Seq

  1. X. Shirley Liu1,2
  1. 1 Department of Biostatistics and Computational Biology Dana-Farber Cancer Institute, Boston, Massachusetts 02115, USA;
  2. 2 Harvard School of Public Health, Boston, Massachusetts 02115, USA;
  3. 3 Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, Massachusetts 02115, USA;
  4. 4 Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts 02115, USA;
  5. 5 Department of Medicine, Harvard Medical School, Boston, Massachusetts 02115, USA;
  6. 6 Vavilov Institute for General Genetics, Russian Academy of Sciences, Moscow 119331, Russia;
  7. 7 Johns Hopkins Oncology Center, Baltimore, Maryland 21231, USA;
  8. 8 Department of Statistics, Harvard University, Science Center 715, Cambridge, Massachusetts 02138, USA
    1. 9 These authors contributed equally to this work.

    Abstract

    We present a powerful application of ultra high-throughput sequencing, SAGE-Seq, for the accurate quantification of normal and neoplastic mammary epithelial cell transcriptomes. We develop data analysis pipelines that allow the mapping of sense and antisense strands of mitochondrial and RefSeq genes, the normalization between libraries, and the identification of differentially expressed genes. We find that the diversity of cancer transcriptomes is significantly higher than that of normal cells. Our analysis indicates that transcript discovery plateaus at 10 million reads/sample, and suggests a minimum desired sequencing depth around five million reads. Comparison of SAGE-Seq and traditional SAGE on normal and cancerous breast tissues reveals higher sensitivity of SAGE-Seq to detect less-abundant genes, including those encoding for known breast cancer-related transcription factors and G protein–coupled receptors (GPCRs). SAGE-Seq is able to identify genes and pathways abnormally activated in breast cancer that traditional SAGE failed to call. SAGE-Seq is a powerful method for the identification of biomarkers and therapeutic targets in human disease.

    Footnotes

    • Received April 1, 2010.
    • Accepted September 24, 2010.

    Articles citing this article

    | Table of Contents

    Preprint Server