Functional Cloning, Sorting, and Expression Profiling of Nucleic Acid-Binding Proteins

  1. Y. Ramanathan1,
  2. Haibo Zhang1,2,
  3. Virginie Aris1,2,
  4. Patricia Soteropoulos1,3,
  5. Stuart A. Aaronson4, and
  6. Peter P. Tolias1,3,5
  1. 1Center for Applied Genomics, Public Health Research Institute, International Center for Public Health W420M, Newark, New Jersey 07103, USA; 2Center for Computational Biology and Bioengineering, New Jersey Institute of Technology, New Jersey 07102, USA; 3Department of Microbiology and Molecular Genetics, University of Medicine and Dentistry of New Jersey-New Jersey Medical School, Newark, New Jersey 07103, USA; 4The Derald H. Ruttenberg Cancer Center, Mount Sinai School of Medicine of New York University, New York, New York 10029, USA

Abstract

A major challenge in the post-sequencing era is to elucidate the activity and biological function of genes that reside in the human genome. An important subset includes genes that encode proteins that regulate gene expression or maintain the structural integrity of the genome. Using a novel oligonucleotide-binding substrate as bait, we show the feasibility of a modified functional expression-cloning strategy to identify human cDNAs that encode a spectrum of nucleic acid-binding proteins (NBPs). Approximately 170 cDNAs were identified from screening phage libraries derived from a human colorectal adenocarcinoma cell line and from noncancerous fetal lung tissue. Sequence analysis confirmed that virtually every clone contained a known DNA- or RNA-binding motif. We also report on a complementary sorting strategy that, in the absence of subcloning and protein purification, can distinguish different classes of NBPs according to their particular binding properties. To extend our functional annotation of NBPs, we have used GeneChip expression profiling of 14 different breast-derived cell lines to examine the relative transcriptional activity of genes identified in our screen and cluster analysis to discover other genes that have similar expression patterns. Finally, we present strategies to analyze the upstream regulatory region of each gene within a cluster group and select unique combinations of transcription factor binding sites that may be responsible for dictating the observed synexpression.

[The following individual kindly provided reagents, samples, or unpublished information as indicated in the paper: M. Stempher.]

Footnotes

  • 5 Corresponding author.

  • E-MAIL tolias{at}phri.org; FAX (973) 854-3453.

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.156002.

    • Received February 5, 2002.
    • Accepted May 29, 2002.
| Table of Contents

Preprint Server