RT Journal A1 Clark, Hilary F. A1 Gurney, Austin L. A1 Abaya, Evangeline A1 Baker, Kevin A1 Baldwin, Daryl A1 Brush, Jennifer A1 Chen, Jian A1 Chow, Bernard A1 Chui, Clarissa A1 Crowley, Craig A1 Currell, Bridget A1 Deuel, Bethanne A1 Dowd, Patrick A1 Eaton, Dan A1 Foster, Jessica A1 Grimaldi, Christopher A1 Gu, Qimin A1 Hass, Philip E. A1 Heldens, Sherry A1 Huang, Arthur A1 Kim, Hok Seon A1 Klimowski, Laura A1 Jin, Yisheng A1 Johnson, Stephanie A1 Lee, James A1 Lewis, Lhney A1 Liao, Dongzhou A1 Mark, Melanie A1 Robbie, Edward A1 Sanchez, Celina A1 Schoenfeld, Jill A1 Seshagiri, Somasekar A1 Simmons, Laura A1 Singh, Jennifer A1 Smith, Victoria A1 Stinson, Jeremy A1 Vagts, Alicia A1 Vandlen, Richard A1 Watanabe, Colin A1 Wieand, David A1 Woods, Kathryn A1 Xie, Ming-Hong A1 Yansura, Daniel A1 Yi, Sothy A1 Yu, Guoying A1 Yuan, Jean A1 Zhang, Min A1 Zhang, Zemin A1 Goddard, Audrey A1 Wood, William I. A1 Godowski, Paul T1 The Secreted Protein Discovery Initiative (SPDI), a Large-Scale Effort to Identify Novel Human Secreted and Transmembrane Proteins: A Bioinformatics Assessment JF Genome Research JO Genome Research YR 2003 FD October 01 VO 13 IS 10 SP 2265 OP 2270 DO 10.1101/gr.1293003 UL http://genome.cshlp.org/content/13/10/2265.abstract AB A large-scale effort, termed the Secreted Protein Discovery Initiative (SPDI), was undertaken to identify novel secreted and transmembrane proteins. In the first of several approaches, a biological signal sequence trap in yeast cells was utilized to identify cDNA clones encoding putative secreted proteins. A second strategy utilized various algorithms that recognize features such as the hydrophobic properties of signal sequences to identify putative proteins encoded by expressed sequence tags (ESTs) from human cDNA libraries. A third approach surveyed ESTs for protein sequence similarity to a set of known receptors and their ligands with the BLAST algorithm. Finally, both signal-sequence prediction algorithms and BLAST were used to identify single exons of potential genes from within human genomic sequence. The isolation of full-length cDNA clones for each of these candidate genes resulted in the identification of >1000 novel proteins. A total of 256 of these cDNAs are still novel, including variants and novel genes, per the most recent GenBank release version. The success of this large-scale effort was assessed by a bioinformatics analysis of the proteins through predictions of protein domains, subcellular localizations, and possible functional roles. The SPDI collection should facilitate efforts to better understand intercellular communication, may lead to new understandings of human diseases, and provides potential opportunities for the development of therapeutics.