Abstract

Recent genetic analyses in worms, flies, and mammals illustrate the importance of bioactive peptides in controlling numerous complex behaviors, such as feeding and circadian locomotion. To pursue a comprehensive genetic analysis of bioactive peptide signaling, we have scanned the recently completed Drosophila genome sequence for G protein-coupled receptors sensitive to bioactive peptides (peptide GPCRs). Here we describe 44 genes that represent the vast majority, and perhaps all, of the peptide GPCRs encoded in the fly genome. We also scanned for genes encoding potential ligands and describe 22 bioactive peptide precursors. At least 32 Drosophila peptide receptors appear to have evolved from common ancestors of 15 monophyletic vertebrate GPCR subgroups (e.g., the ancestral gastrin/cholecystokinin receptor). Six pairs of receptors are paralogs, representing recent gene duplications. Together, these findings shed light on the evolutionary history of peptide GPCRs, and they provide a template for physiological and genetic analyses of peptide signaling in Drosophila.


The recent publication of entire genomes for the worm, the fly, and human species has initiated the era of functional genomic analysis. The experiences to date have indicated that such analysis involves multiple stages, in which improvements are recorded as the databases are completed and analytic programs become more precise (Reese et al. 2000), and as more comparative information is made available (Sonnhammer et al. 1997). G protein-coupled receptors (GPCRs) provide sensitivity to a variety of environmental, developmental, and physiological signals. They display a uniform topology with seven transmembrane (TM) domains and represent one of the largest recognizable groups of proteins (Bockaert and Pin 1999). Here we have organized all genomic sequences that encodeDrosophila GPCRs to identify and classify those devoted to peptide hormone and neuropeptide ligands (peptide GPCRs).

Given the availability of the human and mouse genomic sequences, what can we gain by a thorough analysis of Drosophila peptide GPCRs? We propose two reasons to motivate such efforts. First, our understanding of GPCR signaling mechanisms appears incomplete. Recent advances have indicated means by which GPCR signaling potential may be increased, including receptor oligomerization and association with a variety of accessory proteins (Bockaert and Pin 1999), and receptor translocation to the nucleus (Chen et al. 2000). There is great need, therefore, to address new hypotheses of GPCR signaling mechanisms in vivo. For this purpose, it will be very helpful to use the powerful tools for genetic analysis that are afforded by model organisms such asDrosophila. The second reason we favor the pursuit ofDrosophila GPCRs invokes the success of genetic analysis in another model genetic system, Caenorhabditis elegans. In the past few years, rapid progress has been made in the analysis of insulin signaling in the worm; C. elegans insulin regulates metabolism, development, and longevity by mechanisms that are similar to the endocrine regulation of metabolism and fertility by mammalian insulin (Kimura et al. 1997; Tissenbaum and Ruvkun 1998). In addition, the genetic analysis has extended our understanding of insulin signaling by revealing novel molecular features that may be variant in diabetic pedigrees (Ogg et al. 1997; Ogg and Ruvkun 1998). Although insulin binds to a different class of receptors, it is likely that the same rapid development of new information will accompany the genetic analysis of peptide GPCRs in Drosophila.

In a recent review, Brody and Cravchik (2000) began the process of categorizing Drosophila GPCRs by describing ∼100 genes, including 21 receptors for classical neurotransmitters and neuromodulators (biogenic amines, related compounds, and purines) and 26–30 peptide receptor genes. We have extended that analysis by re-searching the original genomic sequences for peptide receptors (we found one additional GPCR) and by improving the annotations of 20 previously-predicted genes. We classified receptor genes according to phylogenetic trees constructed with the aid of the Pfam 7 TM databases (Bateman et al. 2000). In addition, we refined the DrosophilaGPCR classifications by incorporating information we deduced by examining gene organizations. Through this analysis, we expanded the set of known and candidate peptide GPCRs from ∼30 to ∼45. Finally, to gain a sense of the potential peptide ligands, we assembled a list of 22 Drosophila genes known or predicted to encode bioactive peptides that may activate these receptors. Together, these results shed light on the evolutionary history of neuropeptide signaling. They also are intended to aid in future efforts to analyze peptide receptor function in development, physiology, and behavior by using the power of Drosophila genetics.

RESULTS AND DISCUSSION

We searched the Drosophila melanogaster genome sequence with the goal of identifying all peptide GPCRs. Initially, we scanned the gene annotations developed jointly by the BerkeleyDrosophila Genome Project (BDGP) and Celera Genomics for all putative GPCRs (Adams et al. 2000; Brody and Cravchik 2000). Based onBLASTP scores obtained with each sequence, we excluded entries that are likely to represent nonpeptide GPCRs (neurexins, HE6- and methuselah-related proteins, rhodopsins, developmental genes, taste and odorant receptors, and receptors for biogenic amines and other small neurotransmitters). The remaining set of 44 known or putative peptide receptors and unclassified GPCRs was retained for further analysis (Table 1).

Table 1.

Cloned and Candidate Drosophila Neuropeptide Receptors

Gene Synonyms Annotation corrected Molecularly cloned ESTs (#)
AlstR DAR-1, EG:121E7.2, CG2872 no[iii] Birgül et al. 1999; Lenz et al. 2000a
BG:BACR48G21.1 n/ano
CG1147 yesno
CG2114 nono
CG4187 yesno
CG4395 yesno
CG5042 CG5046 yesno
CG5811 NepYr, NYR, PR4 no Li et al. 1992b; St.-Onge et al. 2000
CG5911 yes, splice variantsno
CG5936 yesno
CG6111 nono
CG6857 yesno
CG6881 CG6894 yesno
CG6986 yesno
CG7285 yesno
CG7395 CG18639 yesnoGH23382 (1)
CG8422 nonoGH15162 (5)
CG8784 nono
CG8795 nono
CG8985 no, (RNA editing[ii])noHL01032 (1)
CG9918 nono
CG10001 DAR-2 no Lenz et al. 2000b
CG10626 nono
CG10698 nono
CG10823 yesno
CG12370 nono
CG12610 CG15049, CG15050 yesno
CG13229 nono
CG13575 nono
CG13702 yesno
CG13758 EG:BACR25B3.3 nono
CG13803 nono
CG13995 nono
CG14003 yesno
CG14484 CG18192 yesno
CG14575 yesno
CG14593 CG14594 yesno
CG16726 yesno
CG17415 yesno
Fsh DLGR-1, CG7665 no Hauser et al. 1997
GRHR CG11325 no Hauser et al. 1998 GH20680 (3)
rickets (rk) DLGR-2, CG8930 no[iii] Eriksen et al. 2000
Takr86C NKD, CG6515 no Monnier et al. 1992
Takr99D DTKR, CG7887 no[iii] Li et al. 1991;1992b GH10154 (4)

[i] EST, representative expressed sequence tag sequence (Rubin et al. 2000), with the number of ESTs given in parentheses.

[ii] Evidence of RNA editing from comparison of EST versus genomic DNA sequence (both from same genotype).

[iii] CG annotations incomplete, but complete sequences provided in independently published reports.

[iv] n/a, not applicable (not previously annotated).

To gauge the completeness of this set, we scanned theDrosophila genome for additional, nonannotated peptide GPCRs in three ways. First, we scanned a set of annotations of GPCR genes obtained through a GENSCAN search of the entireDrosophila genome (K. Scott and L. Vosshall, pers. comm.). This list contained one receptor sequence, BG:BACR48G21.1(BACR48G21.1), which had not been annotated previously. Second, we performed BLASTP searches using a “GPCR query set,” which included the previously cloned or annotated peptide, amine, and related/unclassified Drosophila GPCRs as well as ∼200 sequences representing a diverse set of Family A GPCRs (unless indicated, we used the nomenclature for GPCR Families and Groups given by Kolakowski [1994] [see http://www.gcrdb.uthscsa.edu/]). All of the putative peptide receptors listed in Table 1 (including BACR48G21.1) were detected with several queries (for the vast majority, more than 30 times). However, this analysis did not reveal any additional candidate peptide GPCRs. Finally, we used the GPCR query set to perform TBLASTN searches of the Celera/BDGP whole genome shotgun sequence. As with the BLASTP survey, theTBLASTN search yielded scaffold sequences corresponding to all of the GPCRs on our list, but no additional candidate peptide receptors. The whole genome shotgun sequence currently represents ∼98% of the Drosophila eukaryotic genome (Adams et al. 2000). Therefore, we conclude that the set of 45 cloned and candidate peptide GPCRs is essentially complete.

We focused on sequences encoding the seven TM domains. More than 50% of the BDGP/Celera annotations for GPCRs in this set (23 out of 43) were missing sequences representing one or more TM domains or, in the case of two receptors (CG4187 and CG5042), an N-terminal domain containing conserved, leucine-rich repeats (Table 1). In three cases, the correct gene sequences were published previously (Li et al. 1991;Ashburner et al. 1999; Birgül et al. 1999). We revised the remaining 20 incorrect or incomplete annotations using software-based gene prediction methods and manual inspection. For six GPCRs, there were two to three neighboring annotations that contained nonoverlapping GPCR sequence motifs. In each of these cases, we did not detect open reading frames encoding conserved TM domains in the intervening genomic sequence. Therefore, we merged these sequences to generate single, revised annotations. Additionally, for CG5911, we detected two adjacent sets of exons encoding alternative versions of TM4–7, including a conserved splice acceptor site in TM4. Thus, CG5911 appears to encode two distinct receptor isoforms (50% identical in TM5–7) through alternative splicing. Although final confirmation of these predictions will require direct sequencing of cDNAs, we conclude that our revised annotations are of sufficiently high quality to perform phylogenetic analysis, based on the presence of well conserved motifs (Baldwin et al. 1997; Tams et al. 1998) throughout the TM domains of each of these receptors.

After assembling the list of 45 cloned and candidate peptide GPCRs, we classified these proteins based on BLASTP scores, on the locations of these receptors on representative phylogenetic trees for each GPCR Family (A and B), and on the degree to which these locations were supported by bootstrap analysis. We examined the genomic location of each peptide GPCR as well as all biogenic amine and small transmitter GPCRs to identify linked (possibly paralogous) genes. To detect conserved gene organizations, we noted the intron locations and phasing for each cloned and candidate peptide GPCR within the TM regions, as well as for a few related vertebrate GPCRs. The intron analysis of the vertebrate GPCRs was not comprehensive, in part because >93% of vertebrate GPCR genes lack introns within the coding sequence (Gentles and Karlin 1999). Finally, based on the results of the above tests, we were able to place most of the Family A receptors in one of seven alignments, each of which contained one or more related receptor subgroups.

We found strong evidence supporting the classification of 32 receptors as peptide GPCRs (Table 2). In addition, there were two receptors that are clear orthologs of the orphan receptor, LGR7, a member of a receptor clade containing several peptide GPCRs. For seven additional receptors, we found weaker evidence to indicate that they are peptide GPCRs. Finally, we regard four of the receptors as unclassifiable. Interestingly, we found at least six pairs of paralogs (variants generated by gene duplications or other processes), most of which appear to be related to common ancestors of vertebrate GPCR subgroups (rather than derived independently from vertebrate paralogs; e.g., Fig. 1A,D [see below]). Based on the presence of ESTs and/or cDNAs (Table 1), at least 25% of the genes in Table 1 are expressed. Pseudogenes are rare in Drosophila(Petrov and Hartl 2000), and based on strong sequence conservation in the TM domains, most of the remaining genes are likely to be expressed as well. Thus, we conclude that there are 39–41 peptide GPCRs in Drosophila, with an additional four GPCRs that may later be included in this category. The following sections describe this analysis in detail and provide a listing of potential cognate ligands.

Table 2.

Classification of Cloned and Candidate DrosophilaNeuropeptide Receptors by BLAST and Phylogenetic Analysis

Gene F/G Best BLAST hit(s) Phylogenetic analysis full tree neighbors Bootstrap
Diagnostic P fly P Worm P
CG6857 A/III-BCCKR35 CG6881 38GASR, CCKR>900
CG6881 A/III-BGASR45 CG6857 71GASR, CCKR>900
CG5811 A/III-BGRL10658 CG10626 48GRL106>700
CG10626 A/III-BLKR77 CG5811 38LKR,  LSR>990
Takr86C A/III-BNK3R[ii] 65 Takr99D 72NKR>500
Takr99D A/III-BNK3R64 Takr86C 67NKR>500
CG10823 A/III-BNFFR45 CG5811 43 C50F7.1 38OXR<300
BG:BACR48G21.1 A/III-BNK2R20 Takr86C 21NMBR/GRPR/BR<100
CG1147 A/III-BNY2R31 C25G6.5 33NPYR<500
CG7395 A/III-BNY2R40 F41E7.3 32NY2R<500
CG12610 A/III-BNPYR14PRPR>500
CG13995 A/IIIGASR6NMBR/GRPR/BR<100
CG14593 A/III-BBRS436 CG14484 52NMBR/GRPR/BR>700
CG14484 A/III-BGRPR40 CG14593 43NMBR/GRPR/BR>700
CG8784 A/III-BNMUR36 CG8795 117NTR/GHSR>700
CG8795 A/III-BNMUR39 CG8784 114NTR/GHSR>700
CG9918 A/III-BNMUR34 CG8784 61NTR/GHSR>700
CG14575 A/III-BNMUR42 CG8784 39 C48C5.1 39NTR/GHSR>700
CG14003 A/III?GHSR19 CG10001 20NTR/GHSR<100
CG5911A A/III-BTRFR25TRFR<500
CG5911B A/III-BGHSR28ND
CG2114 A/III?TRFR11 T14C1.1[iii] 28TRFR<300
CG16726 A/III?NK3R14TRFR<300
CG6986 A/III?TRFR10 CG16726 11TRFR<300
CG13575 A/IIIGHSR, LSR8 CG10626 9GPRJ<300
CG8985 A/III?AlstR4 CG13803 130 F57B7.1[iii] 29TRFR<300
CG13229 A/III?NY5R6 CG13803 46 F57B7.1[iii] 31TRFR<300
CG13803 A/III?AlstR7 CG8985 130 F57B7.1[iii] 25TRFR<300
CG5936 A/III?NTR2 B0563.6[iii] 18TRFR<300
AlstR A/VGALR41 CG10001 67 ZK455.3 43GALR/GALS/GALT<500
CG10001 A/VGALR28 AlstR 53 ZK455.3 31GALR/GALS/GALT<500
CG7285 A/VSSR249 CG13702 78SSR<500
CG13702 A/VSSR250 CG7285 75SSR<500
GRHR A/VGRHR51GRHR/VPR/OXYR>700
CG10698 A/VGRHR36 GRHR 37GRHR/VPR/OXYR>700
CG6111 A/VV1BR38GRHR/VPR/OXYR>700
Fsh A/V (c)LSHR66FSHR/TSHR/LSHR/LGR4–6>990
rk A/V (c)LSHR68FSHR/TSHR/LSHR/LGR4–6>990
CG4187 A/V (c)GRL101λ57 CG5042 43LGR7>900
CG5042 A/V (c)GRL101λ34 CG4187 40LGR7>900
CG4395 B/ICALR22 CG17415 24CALR/CGRR>500
CG17415 B/ICALR44CALR/CGRR>500
CG13758 B/I–IIICALR38 C13B9.4 50B/I–III<500
CG8422 B/IDIHR54 CG12370 65DIHR>900
CG12370 B/IDIHR71 CG8422 81DIHR>900

[i] F/G, Family/Group classification. Diagnostic, Highest scoring receptor with known ligand/function. Fly, Highest scoringDrosophila GPCR (with P value near or exceeding the best diagnostic score). Worm, Highest scoring Caenorhabditis elegans GPCR (with P value near or exceeding the best diagnostic score). P, P value for respective diagnostic, fly or worm GPCR (le−x, where x is the value displayed in the table).

[ii] P value for stable fly TKR is 74.

[iii] Several other C. elegans GPCRs displayed Pvalues exceeding the value obtained for the top “diagnostic” hit. CG10001 tends to be promiscuous on BLASTP with other, even very distantly related, fly receptors. λ, GRL101, as well as the other members of this class of GPCRs, is an orphan receptor (LGR7 is not in the public sequence databases).

[iv] ND, not done. (The remaining receptor abbreviations are described in the legends of Figs. 1 F2 F3.)

Figure 1.

Neighbor-joining phylogenetic trees for the Family A, Group III-B receptors. For Family and Group classifications, see Kolakowski (1994) (see http://www.gcrdb.uthscsa.edu/). (A) Rooted tree for the gastrin/cholecystokinin (CCK) receptors. (B) Unrooted tree for the neurokinin receptors (NKRs) and related GPCRs. The midpoint of the tree is indicated with an “X.” (C)Unrooted tree for the neuropeptide Y (NPY) receptors (NPYRs) and prolactin releasing receptors (PRPRs). (D) Rooted tree for the bombesin/gastrin releasing peptide receptors. (E) Unrooted tree for the neuromedin U receptors (NMURs), growth hormone secretagogue receptors (GHSRs), neurotensin receptors (NTRs), thyrotropin releasing hormone receptors (TRFRs), and a large family of related orphan receptors from Caenorhabditis elegans. The C. elegans orphan receptors included here belong to one of three clades (classes A–C). (*) Omitted receptors are additional C. elegansorphan GPCRs; (**) omitted receptors are additional GHSRs and closely related orphan GPCRs. In A and D, a monophyletic set of 26 biogenic amine receptors (not shown) was used as the outgroup to determine the root of the trees (see Methods). In C andE, the location of the tree midpoint was ambiguous and is therefore not indicated. Portions of the trees representing groups of closely related receptors were omitted (the number of related receptors on each branch is indicated in parentheses). Drosophila GPCRs are listed in bold and italics. (BRS3) Bombesin receptor subtype 3; (BRS4) bombesin receptor subtype 4; (CCKR) CCK receptor type A; (CCKR XL) Xenopus laevis CCKR; (GASR) gastrin/CCK receptor type B; (GCRC) glucocorticoid-induced receptor; (GRL106) Lymnaea stagnalis cardioexcitatory receptor; (GRPR) gastrin releasing peptide (GRP) receptor; (LKR) Boophilus microplus (tick) leucokinin-like peptide receptor; (LSR) L. stagnalislymnokinin receptor; (NFFR) neuropeptide FF/neuropeptide AF receptor; (NK1R–NK3R) NKR types 1–3; (NMU1R and NMU2R) neuromedin U receptor types 1 and 2; (NPR-1) product of the C. elegans npr-1 gene; (NPYRYA–NPYRYC) orphan zebrafish NPYRs; (NPYRB) Gadus morhua(Atlantic cod) NPYR; (NTR1 and NTR2) neurotensin receptor types 1 and 2; (NY1R–NY6R) NPY receptor types 1–6; (OT7T022) putative mammalian RFRP receptor; (OXR) orexin/hypocretin receptor; (STKR) Stomoxys calcitrans (stable fly) tachykinin receptor. The remaining non-Drosophila sequences are orphan GPCRs from C. elegans. Symbols denote bootstrap support, out of 1000 replicates, that was >500: (filled circles) >990; (open circles) >900; (open squares) >700; (open triangles) >500.

15f1_L1TT_rev1

Overview of the Drosophila Peptide GPCRs

Together, the set of known and candidate Drosophila peptide GPCRs contains representatives of at least 15 monophyletic vertebrate GPCR subgroups. Family A/Group III-B contains the largest number ofDrosophila peptide GPCRs (at least 19; Table 2). These include 17 Drosophila representatives of six vertebrate GPCR subgroups: the gastrin/cholecystokinin, neurokinin, neuropeptide FF and hypocretin/orexin, neuropeptide Y, bombesin/gastrin releasing peptide, and neurotensin receptors (and the neurotensin-related receptors for neuromedin U, growth hormone secretagogue, and thyrotropin releasing hormone). Family A/Group V also contained a large number ofDrosophila peptide GPCRs (11; Table 2), representing seven vertebrate GPCR subgroups: the galanin, somatostatin/opioid, gonadotropin releasing hormone, oxytocin/vasopressin, and glycoprotein hormone receptors, as well as two subgroups represented by vertebrate orphan receptors (LGR4–6 and LGR7). Finally, there were fiveDrosophila peptide GPCRs that belong to Family B. Four of these receptors belong to one of two vertebrate GPCR subgroups: the calcitonin and corticotropin releasing factor receptors. Thus, a large majority of the Drosophila and vertebrate neuropeptide signaling pathways appear to share common evolutionary origins. It remains to be seen whether the functions of these signals have been similarly conserved.

Family A/Group III-B: Gastrin/Cholecystokinin Receptors

Cholecystokinin (CCK) and gastrin are related neuroendocrine peptides that act through two closely related families of receptors (type A, CCKR, and type B, GASR). These receptors likely evolved from a common ancestor (Johnsen 1998). Two Drosophila GPCRs (CG6857, CG6881) displayed strong evidence of evolutionary kinship with this receptor subgroup (Table 2). On the subgroup-specific tree (Fig.1A), CG6857 and CG6881 (as well asXenopus laevis CCKR) were located on the base of the tree, before the branches leading to the CCKR and GASR receptors. Therefore, it appears that the fly receptors diverged from a common ancestor of the CCKR and GASR lineages. Consistent with this interpretation,CG6857 and CG6881 are closely linked genes (∼30 kb apart), and they each display the strongest sequence similarity with each other (by BLASTP and on the phylogenetic trees; Table2, Fig. 1A). Therefore, they likely arose through a gene duplication event rather than independently from the CCKR and/or GASR lineages.CG6857 and CG6881 both share an intron (same position and same phase) in TM3 (Table 3) with genes encoding both CCKR (accession #AF015959-AF015963) and human GASR (L10822). Likewise, all of these genes have an intron in a similar position within the highly variable cytoplasmic loop between TM5 and TM6. This conservation of introns further indicates that CG6857 and CG6881 are members of the CCKR/GASR receptor subgroup.

Table 3.

Locations and Phasing of Introns among Genes EncodingDrosophila Family A Peptide GPCRs

TM1 TM2 TM3 TM4 TM5 TM6 TM7
after G before D after D before P after P before P after P before P after P before P after P before P
CG6857 24 (1)82 (1)[ii] 31 (2)
CG6881 24 (1)14 (1)66 (2)
CG5811 8 (2)20 (2)19 (0)5 (0)
CG10626 12 (0)2 (2)16 (1)17 (0)
Takr86C 10 (1)7 (2)11 (2)13 (2)
Takr99D 6 (1)7 (2)13 (2)17 (1)
CG10823 3 (0)19 (1)7 (2)
BACR48G21.1 13 (0)
CG1147 0 (1)4 (0)
CG7395
CG12610 24 (0)12 (0)33 (0)33 (1)62 (2)[ii]
CG13995 24 (0) 57 (2)
CG14593 9 (2)6 (0)14 (2)8 (0)8 (2)
CG14484 9 (2)8 (1)8 (0) 22 (0) 11 (2)
CG8784 7 (1)2 (2)18 (0)12 (1)
CG8795 7 (1)2 (2)18 (0)12 (1)
CG9918 12 (1)
CG14575 7 (1)28 (2)15 (1)14 (2)4 (2)
CG14003 19 (2)14 (0)6 (0)79 (0)9 (2)
CG5911A/B 4 (0)29 (1)1 (2)
CG2114
CG16726 18 (0)30 (2)29 (1)12 (0)
CG6986 10 (2)22 (0)18 (1)32 (0)32 (0)
CG13575
CG8985 15 (1)
CG13229 2 (1)2 (2)15 (2)29 (2)
CG13803 1 (1)15 (1)
CG5936 11 (1)29 (2)11 (2)
AlstR 7 (2)18 (2)14 (2)35 (0)23 (2)3 (0)
CG10001 9 (0)3 (0)_ _ _
CG7285 10 (0)6 (0)
CG13702 10 (0)18 (0)
GRHR 7 (1)2 (0)28
CG10698 2 (0)17 (1)20 (0)
CG6111 0 (1)7 (1)26 (1)
Fsh 12 (2)31 (1)30 (2)9 (0)
rk
CG4187 28 (1)7 (2)6 (2)14 (2)11 (1)?
CG5042 23 (2)19 (1)

[i] Family A peptide GPCRs are listed in the same order as shown in Table 2. The position of each intron is given with respect to the nearest amino acid landmark (position 0, underlined): TM1, GX2 GNXLV; TM2, NX2NLAXADLL; TM3, SX3LX2ISXDRYX2IX2 P; TM4, AX7WX2SX5 P; TM5, FX2 PLX6YX2I; TM6, FX2CWXP; TM7, LX3NSX2NPXIY. Intron phase is specified as: ‖NNN phase 0; N‖NN, phase 1; NN‖N, phase 2 (where NNN is the codon and “‖” is the position of the intron). Shaded items indicate shared introns. Dashed line, intron shared in two related GPCR subgroups.

[ii] Only one out of multiple neighboring introns is indicated.

Family A/Group III-B: Neurokinin Receptors

The neurokinin (tachykinin) receptors (NKRs) are a monophyletic group of GPCRs that are also closely related to the orexin/hypocretin receptors (OXRs), the neuropeptide FF/AF receptor (NFFR), and a class of orphan, glucocorticoid-induced receptors (GCRCs). We found sixDrosophila members of this subgroup: CG5811 (Li et al. 1992b;St-Onge et al. 2000), CG10626, TAKR86C (Monnier et al. 1992), TAKR99D (Li et al. 1991, 1992b), CG10823, and BACR48G21.1. On the subgroup-specific tree (Fig. 1B), TAKR86C and TAKR99D were located near the base of a branch leading to NK1R-NK3R, which indicates that the two fly proteins (and the two C. elegans orthologs) arose before the diversification the vertebrate NKRs. TAKR86C and TAKR99D are located together on the subgroup tree (with stable fly neurokinin receptor, STKR; Fig. 1B), and in BLASTP searches, each detects the other with the lowest P values (Table 2). Moreover, the Takr86C (Rosay et al. 1995) and Takr99Dgenes share two introns in the same position and with the same phase (Table 3). Thus, TAKR86C and TAKR99D appear to be paralogs, and they are therefore likely to share similar ligands and functional properties.

Two additional Drosophila receptors, CG5811 and CG10626, are related to the true NKRs. However, based on BLASTP and phylogenetic analysis (Table 2, Fig. 1B), each of these receptors appears to be the ortholog of a NKR-related class of GPCRs that to date have been identified only in invertebrates. CG10626 is closely related to the tick NKR (LKR; Holmes et al. 2000) and the snail lymnokinin receptor (LSR) (Table 2), and these three receptors are located on a single branch of the subgroup-specific tree (Fig. 1B). Likewise, CG5811 displays strong sequence similarity with GRL106, a snail NKR-like protein (Table 2). The branching pattern of this portion of the NKR subgroup tree is unstable (Fig. 1B). However, CG5811 andCG10626 have two introns that are in similar locations and display the same phasing (Table 3). Thus, these genes appear to be paralogs that diverged independently of the true NKRs. Consistent with this interpretation, the midpoint root of the NKR subgroup tree is located between the branch leading to the true NKRs and the branches of the tree leading to CG5811, CG10626, and the related GPCRs.

Finally, there are two additional GPCRs, CG10823 and BACR48G21.1, that display moderate to weak homology with the NKRs. ByBLASTP, CG10823 displays strongest homology with the vertebrate neuropeptide FF/neuropeptide AF receptor (NFFR), the putative mammalian RF-amide-related peptide receptor (OT7T022; Hinuma et al. 2000), as well as CG5811 (Table 2). In addition, theCG10823 gene has an intron that is located in the same position (and phase) as one of the two introns shared byTakr86C and Takr99D (Table 3). On the subgroup-specific tree, CG10823 is located near the base of a branch leading to the orexin/hypocretin receptors (OXRs), OT7T022 and NFFR (Fig. 1B). Therefore, CG10823 appears to have arisen from a common ancestor of these vertebrate relatives. Finally, BACR48G21.1 also appears to be a member of the NKR subgroup. However, this relationship was not well supported by the phylogenetic analysis (Table 2), and additional sequence data will be required to evaluate this finding.

Family A/Group III-B: Neuropeptide Y Receptors

The receptors for the neuropeptide Y (NPY) family of peptides (NPYRs) and the prolactin releasing peptide (PRPR) form a subgroup of related GPCRs (Hinuma et al. 1998; Hoyle 1999). FourDrosophila proteins, CG1147, CG7395, CG12610, and CG13995, appear to be members of this subgroup (Table 2). On the subgroup-specific tree, the position of the root was unclear (Fig. 1C). In addition, although the branching pattern for the portions of the tree containing the vertebrate NPYR receptors (except NY2R) was stable, the rest of the tree was not clearly resolved. In theBLASTP analysis and on the phylogenetic trees (Table 2; Fig. 1C), CG1147 showed the strongest sequence homology with a class of receptors that includes a C. elegans orphan GPCR (C25G6.5) and the vertebrate neuropeptide Y Y2 receptors (NY2Rs). CG7395, which also displays strong general sequence homology with the other members of this subgroup, appears to be most closely related to a diversified group of orphan NPYR-like C. elegans receptors. In contrast, CG12610 appears to be most closely related to PRPR. The fourthDrosophila receptor in this group, CG13995, was located on the Group III portion of the full Family A tree, which consists almost exclusively of peptide GPCRs. However, CG13995 did not show strong evidence of homology with any specific class of peptide GPCRs (Table2). Interestingly, the CG13995 gene shares an intron in TM3 (same position and phase) with CG12610. Therefore, we propose that CG13995 is distantly related to the NPYR subgroup. Finally, it has been suggested that CG5811 is a NPYR-like receptor, despite its greater sequence similarity with the NKRs (see above), based on the activation of functionally expressed CG5811 by NPY and related peptides (at micromolar concentrations) and the lack of activation by vertebrate neurokinins (Li et al. 1992b). However, in competitive displacement experiments with CG5811 (St-Onge et al. 2000), PQGRF-amide-like peptides (e.g., NPFF and Lymnaea cardioexcitatory peptide) displayed IC50s in the subnanomolar range. Thus, CG5811 does not appear to be a member of the NPYR subgroup.

Family A/Group III-B: Bombesin/Gastrin Releasing Peptide Receptors

The bombesin-like neuropeptides, which include bombesin, gastrin releasing peptide (GRP), and neuromedin B (NMB), exert a wide variety of physiological actions in the CNS and the periphery through a class of related receptors (Sun et al. 2000). These receptors include the GRP-preferring receptor (GRPR), the neuromedin B-preferring receptor (NMBR), and an orphan class of receptors, characterized by bombesin receptor subtype 3 (BRS3). There are two Drosophila GPCRs, CG14484 and CG14593, that belong to this phylogenetic subgroup (Table2). On the subgroup-specific tree, the three types of vertebrate bombesin/GRP receptors formed a clade, whereas CG14484 and CG14593 branch out from the base of the tree (Fig. 1D). Therefore, it appears that the fly receptors diverged from a common ancestor of the vertebrate bombesin/GRP receptor lineages. The organizations of theCG14484 and CG14593 genes are similar; each has one intron in the same position and phase, and there are two additional introns in similar positions (Table 3). Thus, CG14484 and CG14593 appear to be paralogs. Together, these results indicate that CG14484 and CG14593 are bombesin/GRP receptors; to our knowledge, this is the first clear molecular evidence for bombesin/GRP signaling in invertebrates.

Family A/Group III-B: Growth Hormone Secretagogue, Neurotensin, Neuromedin U, and Thyrotropin Releasing Hormone Receptors

The receptors for neurotensin (NTR), neuromedin U (NMUR), thyrotropin releasing hormone (TRFR), and growth hormone secretagogue (GHSR) form a large and diverse subgroup of GPCRs (Fujii et al. 2000). Among these, NTR, GHSR, and NMUR display strong sequence similarity, whereas TRFR is more distantly related. At least sevenDrosophila GPCRs appear to be members of this subgroup: CG8784, CG8795, CG9918, CG14575, CG5911A, CG5911B, and CG14003 (Table2). An additional seven GPCRs (CG2114, CG5936, CG6986, CG8985, CG13229, CG13803, and CG16726) are all most closely related to a large set of related orphan receptors that had been identified previously only inC. elegans (C. Bargmann, pers. comm.). These orphan GPCRs fall into at least three classes, and there are one to threeDrosophila GPCRs in each class (Fig. 1E). The three receptors in class A (CG8985, CG13229, and CG13803) display strong sequence homology. In addition, CG8985 and CG13803 are linked genes (∼30 kb apart), and they share an intron (Table 3). Thus, theDrosophila class A receptors appear to be paralogs. All three classes display weak sequence similarity with TRFR, NTR, and GHSR, indicating that this entire family of orphan receptors may be derived from an ancestor of these vertebrate receptors and therefore may encode peptide GPCRs. However, confirmation of such a relationship will require functional analysis of one or more members of these orphan GPCR classes.

CG8784 and CG8795 are two of the seven Drosophila GPCRs displaying the strongest sequence similarity with this vertebrate subgroup, and they appear to be paralogs. They display strong sequence similarity with each other (Table 2; Fig. 1E). Moreover, theCG8784 and CG8795 genes are closely linked (∼10 kb apart) and share four introns with identical positions and phasing (Table 3). Similarly, CG9918 and CG14575 each share one intron with CG8784/CG8795 (Table 3), indicating that all four of these receptors are closely related. Their closest vertebrate homologs are NMUR, GHSR, and NTR, based onBLASTP analysis and on their positions in the phylogenetic trees (Table 2; Fig. 1E). Consistent with this finding, the shared intron located in the TM6 domain of CG8784 and CG8795 is also found in the same position and with the same phasing in the pufferfish GHSR gene (AF082211). However, the branching pattern for the subgroup-specific tree was unstable, and the evolutionary relationships among these receptors are unclear. Three additional receptors, CG5911A and CG5911B (generated by putative alternative splicing of the CG5911 gene) and CG14003, also displayed moderate to weak sequence homology with this subgroup and appear to be most closely related to vertebrate TRFR.

Family A/Group V: Galanin/Allatostatin and Opioid/Somatostatin Receptors

There were four Drosophila receptors, AlstR(Birgül et al. 1999; Lenz et al. 2000a), CG7285, CG10001 (Lenz et al. 2000b), and CG13702, that displayed strong sequence similarity with galanin, somatostatin, and opioid receptors (Table 2). Because these three classes of vertebrate receptors display extensive sequence similarity, we grouped them together to construct a subgroup-specific tree (Fig. 2A). The root of this tree is located between the branch leading to the galanin receptors and the branch leading to the somatostatin and opioid receptors. CG7285 and CG13702 were located on the branch containing all of the somatostatin and opioid receptors and related orphan GPCRs. The opioid receptors form a clade, and two groups of somatostatin receptors also form clades (SSR1/4 in one and SSR2/3/5 in the other). The remaining branches on this side of the tree are unstable. Together, these results indicate that CG7285 and CG13702 are orthologous to the vertebrate somatostatin and opioid receptors, although it is not clear whether they diverged from a common ancestor or from a point deeper within the tree. CG7285 and CG13702 appear to be paralogs; they display strong sequence homology (Table 2; Fig. 2A), and they are encoded by linked genes (∼90 kb apart) that share an intron with the same location and phasing (Table 3).

Figure 2.

Neighbor-joining phylogenetic trees for the Family A, Group V receptors. (A) Rooted tree for the opioid, somatostatin, galanin, and allatostatin receptors. (B) Unrooted tree for the gonadotropin releasing hormone (GnRH), vasopressin, and oxytocin receptors. The likely midpoint of the tree is indicated with an “X.” (C) Rooted tree for the glycoprotein hormone receptors and related leucine-rich repeat containing receptors (LGRs). Bootstrap scores, omitted branches, and Drosophila GPCRs are indicated as in Fig. 1. (ALGR) Anthopleura elegantissima (sea anemone) LGR; (FSHR) follicle-stimulating hormone receptor; (GALR) galanin receptor type 1; (GALS) galanin receptor type 2; (GALT) galanin receptor type 3; (GPR24 and GPR54) mammalian orphan GPCRs; (GRHR) GnRH receptor; (ITR) isotocin receptor; (LGR4–7) LGR types 4–7; (LSCPR and LSCPR2) Lymnaea stagnalis conopressin receptor types 1 and 2; (LSHR) lutropin-choriogonadotropic hormone receptor; (MTR) mesotocin receptor; (NLGR) C. elegans LGR; (ORPH4) Lymnaea stagnalis orphan GPCR; (OPRD) delta-type opioid receptor; (OPRK) kappa-type opioid receptor; (OPRM) mu-type opioid receptor; (OPRX) nociceptin/orphanin FQ receptor; (OXYR) oxytocin receptor; (SLGR)L. stagnalis GRL101; (SSR1–SSR5) somatostatin receptor types 1–5; (TSHR) thyrotropin receptor; (V1AR and V1BR) vasopressin V1A and V1B receptors; (V2R) vasopressin V2 receptor; (VTR) vasostocin receptor. The remaining non-Drosophila sequences are orphan GPCRs from C. elegans.

15f2_L1TT

The allatostatin receptor, AlstR (Birgül et al. 1999), and CG10001 were located on the portion of the tree containing all of the galanin receptors (Fig. 2A), indicating that AlstR and CG10001 are Drosophila orthologs of the mammalian galanin receptors. This finding is in agreement with an earlier phylogenetic analysis of AlstR (Birgül et al. 1999). The AlstR and CG10001genes share an intron at the same location and with the same phasing (Table 3; Lenz et al. 2000b). Thus, AlstR and CG10001appear to be paralogs and are likely to share many functional properties. Interestingly, immunocytochemical studies, using anti-porcine galanin and anti-porcine galanin message-associated peptide, as well as receptor autoradiography studies using125I-porcine galanin, showed the presence of galanin-like peptides in several locations in the adult CNS of blowflies, including the fan-shaped body of the central complex and a ring of cells in the medulla (Lundquist et al. 1991, 1993; Johard et al. 1992). Similar patterns of staining in the fan-shaped body and medulla have been obtained in Drosophila with a specific monoclonal anti-allatostatin antiserum (Yoon and Stay 1995). These comparative data provide additional support for the conclusion thatAlstR and CG10001 are closely related to the vertebrate galanin receptors (cf., Birgül et al. 1999; Lenz et al. 2000b).

Family A/Group V: Gonadotropin Releasing Hormone, Vasopressin, and Oxytocin Receptors

The receptors for gonadotropin releasing hormone (GRHR) and the receptors for vasopressin (VPR) and oxytocin (OXYR) belong to two closely related clades of GPCRs (Hoyle 1999). In Drosophila, there are three GPCRs that belong to this subgroup; CG6111, CG10698, and Dm-GRHR (Table 2; Hauser et al. 1998). The branching pattern near the base of the subgroup-specific tree was unstable (Fig. 2B), and the evolutionary history of this subgroup is unclear. However, when the tree is midpoint rooted, Dm-GRHR and CG10698 branch from the side of the tree leading to the vertebrate GRHRs, and CG6111 branches from the side of the tree leading to VPR, OXYR, and related GPCRs. These results are in agreement with the results of BLASTP analysis. Moreover, the Dm-GRHR gene shares an intron near TM4 (identical location and phasing) with the rat GRHR gene (U92471)(Hauser et al. 1998); CG10698 also shares this intron. Thus, Drosophila appears to have two GRHR-like receptors and one VPR/OXYR-like receptor.

Family A/Group V (Type 1c): Glycoprotein Hormone Receptors

Four glycoprotein hormones have been identified in mammals: thyroid-stimulating hormone (TSH) and the gonadotropins, follicle-stimulating hormone (FSH), choriogonadotropin (CG), and luteinizing hormone (LH). These four hormones bind to a subgroup of receptors (the LGRs) that all bear a characteristic, large, N-terminal “ectodomain” that participates in the binding of the large glycoprotein ligands (Hsu et al. 2000) (type 1c receptors; Bockaert and Pin 1999). Four Drosophila receptors, CG4187, CG5042, and the proteins encoded by the Fsh (Hauser et al. 1997) andrk (Ashburner et al. 1999; Eriksen et al. 2000) genes, display sequence similarity with the LGRs, including the N-terminal ectodomain (Table 2). On the subgroup-specific tree (Fig. 2C), there were three distinct clades (cf., Hsu et al. 2000). The first includes LGR7, aLymnaea ortholog (SLGR), CG4187, and CG5042. The second includes LGR4–LGR6, and the third includes the glycoprotein hormone receptors (LSHR, FSHR, and TSHR). Fsh is located at the base of a branch leading to the glycoprotein hormone receptors, indicating that this gene may have evolved from a common ancestor of LSHR, FSHR, and TSHR. Three additional receptors, C. elegans LGR (NLGR), sea anemone LGR (ALGR), and rk, were grouped only weakly with the glycoprotein hormone receptors; the branching pattern of this portion of the tree was unstable. Therefore, these could not be assigned to any one class of LGRs by basis of the phylogenetic analysis alone.

Within the ectodomain, all of the LGRs contain a variable number of leucine-rich repeats and a functionally important hinge region located between the leucine-rich repeats and the seven-TM core. At the borders of the hinge region, there are two sequences that are diagnostic of the three different subclasses of LGRs (Table4; Hsu et al. 2000). These groupings are also supported by BLASTP analysis of the ectodomains (data not shown). These sequences support the placement ofFsh in the subfamily of glycoprotein hormone receptors.

Table 4.

Conserved LGR Hinge Sequences

LGR4–6 consensus YAYQCC GXFKPCEX
rk YAYHCC GPFLPCAD
LGR4 YAYQCC GAFKPCEY
LGR5 YAYQCC GPFKPCEH
FEX SAYQCC GPFKPCEH
FSHR/LSHR/TSHR  consensus YPSHCC DXFNPCED
Fsh HSFHCC NDLNPCED
NLGR YPHHCC DALNPCEN
FSHR_human YPSHCC DAFNPCED
LSHR_human YPSHCC DAFNPCED
TSHR_human YPSHCC DEFNPCED
ALGR NGFLCC DAFHPCED
LGR7 consensus XZXZCX DGZSSXXX
CG4187 NVRVCD DGISSKLH
CG5042 RFFYCS DGVSSFQD
LGR7 KFQYCG DGISSLEN
SLGR SYRFCC DEFSSCED
LDL receptor motif
CG4187[ii] NCDGSVDCDDASDEVNC
CG5042 CVPRRQMCDSRNDCADSSDENPVEC

[i] The consensus hinge sequences for LGR4–6 and FSHR/LSHR/TSHR were described by Hsu et al. (2000). The consensus hinge sequences for LGR7 are based on a ClustalX alignment of these four proteins.

[ii] This sequence is truncated N-terminally, presumably due to missing sequence at the N-terminal end of the annotation.

Placement of CG4187 and CG5042 in the LGR7 clade is supported byBLASTP analysis of the ectodomains (data not shown) and the presence of subgroup-specific hinge sequences (Table 4). Unlike the other two subgroups of LGRs, the ectodomains of LGR7 and snail LGR have low density lipoprotein (LDL) receptor-like cysteine-rich motifs at the N terminus (Tensen et al. 1994; Hsu et al. 2000). CG4187 and CG5042 also each contain at least one LDL motif (Table 4). The function of the LDL motif is unclear, but it indicates a possible role for lipoprotein-like molecules in neuronal G protein-mediated signal transduction (Tensen et al. 1994). Alternatively, given the presence of leucine-rich repeats, these receptors may bind to glycoproteins.

Although phylogenetic analysis of the LGRs did not place rk in any of the three subgroups of LGRs, analysis of the ectodomain indicates that this receptor is orthologous to LGR4–6. This is based on the presence of hinge sequences most similar to LGR4–6 and onBLASTP analysis (data not shown). The other members of this family are orphan receptors. However, the presence of the leucine-rich repeats indicates that these proteins also bind to glycoproteins.

Family B/Group I: Calcitonin and Diuretic Hormone Receptors

In addition to the 40 proteins in Family A (the rhodopsin-like receptors), there are 5 Drosophila peptide GPCRs in Family B (the secretin-like receptors). Based on BLASTP analysis and their positions on the phylogenetic tree (Fig.3), at least four of these receptors (CG4395, CG8422, CG12370, and CG17415) belong to Group I. This group contains the receptors for calcitonin (CALR), calcitonin gene related peptide (CGRR), corticotropin releasing factor (CRFR and CRF2), and diuretic hormone (DIHR). The position of the fifth Drosophila peptide GPCR in this family (CG13758) is unclear, and it may be a member of Group I, II, or III. CG8422 and CG12370 appear to paralogs, and they are orthologous to the DIHRs. These receptors belong to a clade containing CRFR and CRF2, which indicates that the ancestor to the insect DIHRs evolved from a common ancestor of the vertebrate corticotropin releasing factor receptors (Fig. 3). In contrast, CG4395 and CG17415 are most closely related to CALR and CGRR, although the bootstrap scores more deeply located within this branch of the tree were not strong enough to determine whether CALR and CGRR diverged before or after the related Drosophila receptors. We did not find evidence for well defined GPCR-associated proteins (e.g., RAMPs [Bockaert and Pin 1999] and RCPs [Evans et al. 2000]).

Figure 3.

Unrooted neighbor-joining tree for the Family B receptors. The location of the tree midpoint is ambiguous and is therefore not indicated. Bootstrap scores, omitted branches, and Drosophila GPCRs are indicated as in Fig. 1. The four groups of Family B receptors are indicated with vertical bars. (BAI) brain-specific angiogenesis inhibitors 1–3; (CALR) calcitonin receptor; (CAR1) cyclic AMP receptor 1; (CD97) leucocyte antigen CD97; (CGRR) calcitonin gene-related peptide type 1 receptor; (CRF2) corticotropin releasing factor (CRF) receptor 2; (CRFR) CRF receptor 1; (DIHR) diuretic hormone receptor; (EMR1) cell surface glycoprotein EMR1; (GIPR) gastric inhibitory polypeptide receptor; (GLP2R) glucagon-like peptide 2 receptor; (GLPR) glucagon-like peptide 1 receptor; (GLR) glucagon receptor; (GRFR) growth hormone releasing hormone receptor; (HE6) G protein-coupled receptor HE6; (LRP1–3) calcium-independent alpha-latrotoxin receptors (latrophilins) 1–3; (MEGF2) seven-pass transmembrane proteins CELSR1–2 and MEGF2; (PACR) pituitary adenylate cyclase activating polypeptide (PACAP) type I receptor; (PTR2) parathyroid hormone receptor; (PTRR) parathyroid hormone/parathyroid hormone-related peptide receptor; (SCRC) secretin receptor; (TM7XM1) human EGF-TM7 like protein; (VIPR) vasoactive intestinal polypeptide (VIP) receptor 1; (VIPS) VIP receptor 2. The remaining non-Drosophila sequences are orphan GPCRs from Caenorhaloditis elegans.

15f3_L1TT

Drosophila Genes Encoding Neuropeptides and Peptide Hormones

We wished to compare the number of peptide GPCRs with the number of neuropeptides present (or suspected to exist) in Drosophila. Based on the literature and on some genomic analysis, we have assembled a list of 22 Drosophila neuropeptide genes (Table5). These genes are either known or predicted to encode bioactive neuropeptides and peptide hormones. Eight of these, which encode neuropeptides described for Drosophilaor other arthropods, were described previously only in gene annotations generated by Celera/BDGP and in a parallel survey, which was just published recently (Vanden Broeck 2001). An additional peptide listed by Vanden Broeck (2001) (“IFa”) was not included, because the precursor did not match our criteria for putative neuropeptide genes. Because neuropeptide-encoding precursors do not display multiple, uniform characteristics found in GPCRs, we are certain to have missed many peptide genes and thus consider this list incomplete. However, assuming a 1 : 1 ratio of neuropeptide and peptide hormone genes to peptide GPCRs, these 22 genes appear to encode the ligands for at least 50% of the Drosophila peptide GPCRs that we have described. This may be an underestimate, given the fact that many of these neuropeptide genes encode multiple peptides. In addition to these 22 neuropeptide genes, we list several insect peptides and peptide hormones known in other insects and for which Drosophilahomologs have been inferred by observation or simply by conjecture. Although the structures of these genes are not yet available, they are included here to permit consideration of all plausible ligands for the identified peptide GPCRs.

Table 5.

Drosophila Neuropeptides and Peptide Hormones

Peptide name Gene Location EST Precursor Molecularly cloned Related sequence(s)/evidence for gene
amnesiac CG11937 19A1LP07893170 Feany and Quinn 1995
diuretic hormone CG13094 29D1116 Furuya et al. 2000
M-ASH CG14919 32D2122 Kramer et al. 1991
LRLRFamide CG13968 38B4300 Feng et al. 1999
dFMRFamide CG2346 46C2347 Nambu et al. 1988;Schneider and Taghert 1988
ion transport peptide CG13586 60D5430 Lee et al. 1995; Meredith et al. 1996
ETH/P-ETH CG18105 60D16203 Park et al. 1999
AKH CG1171 64A1079 Noyes et al. 1995
sex peptide CG17673 70A455 Kubli 1992; Ottiger et al. 2000
leucokinin CG13480 70E3153 Terhzaz et al. 1999
allatostatin (B-type) CG6456 74B1GH13904211 Williamson et al. 2001
drosulfakinin CG18090 82A1128 Nichols et al. 1988
diuretic hormone CG8348 85E2GH27214>183 Coast 1996
neurokinin CG14734 87A9289 Siviter et al. 2000
corazonin CG3302 88B7LP1143972 Veenstra 1994
NP-PP CG10342 89DGH04563102 Brown et al. 1999
EH CG5400 90B197 Horodyski et al. 1993
CCAP CG4910 94C4151 Cheung et al. 1992; Lehman et al. 1993; Ewer and Truman 1996
dromyosuppressin CG6440 95F1GH10451100 Nichols 1992
allatostatin CG13633 96A23151 Lenz et al. 2000c
PDF CG6496 97B2102 Park and Hall 1998
CAP2B/pyrokinin CG15520 99D1GH28004148 Huesmann et al. 1995; Davies et al. 1998
allatotropin ND Kataoka et al. 1989; Z̆itn̆an et al. 1993
anterior retraction factor,  pupariation tanning  factorND Sivasubramanian et al. 1974
baratinND Nässel et al. 2000
bursiconND Fraenkel et al. 1966; Kostron et al. 1999
colliculostatin, NEB, TMOFND De Loof et al. 1995
ecdysiotropinND Koolman et al. 1995
GBPND Hayakawa et al. 1995
neuroparsin, parsinND Girardie et al. 1998
PBANND Kawano et al. 1992; Sato et al. 1993;Masler et al. 1994; Zdarek et al. 1997
proctolinND Anderson et al. 1988
PTTH (large)ND Kim et al. 1997
somatostatin-likeND Ui-Tei et al. 1995
steroidogenic factorND Brown et al. 1998
vasopressin-likeND Baines et al. 1995

[i] (Location) cytological location determined directly or inferred from the location of the sequenced genomic clones. (EST) expressed sequence tag clone. (Precursor) known or deduced length of the prepropeptide. (Molecularly cloned), gene identification verified by cDNA (excluding ESTs) and/or in situ hybridization. (ND) not determined. (Related sequence(s)/evidence for gene) existence of neuropeptide gene inferred from peptide sequences fromDrosophila or other arthropods, physiological assays, and/or immunocytochemical data. An additional predicted neuropeptide gene (IFa, Vanden Broeck 2001) lacks structural features.

Ligands for Family A Peptide GPCRs

There are multiple genes encoding potential ligands for theDrosophila NKR-like receptors. CG14734 produces neurokinin-like peptides (Siviter et al. 2000) that likely bind to TAKR86C and TAKR99D, as shown by functional expression of these receptors and specific binding to mammalian (Li et al. 1991) and insect (Monnier et al. 1992) neurokinins and related peptides. Based on the pharmacological observations of the Lymnaea lymnokinin receptor, LSR (Cox et al. 1997), we speculate that theDrosophila ortholog, CG10626, is a receptor for the leucokinin-like peptides encoded by CG13480 (Terhzaz et al. 1999).

In addition to the neurokinin-like peptides, there are also several genes that are known to encode (or potentially encode) peptides terminating in the sequence RF-amide. These include the putative peptide products of a novel gene, CG13968. Along with the products of the dFMRFa (Nambu et al. 1988; Schneider and Taghert 1988) andDMS (CG6440) genes, these peptides may interact with multiple receptors for RF-amide peptides. CG5811 has been shown to bind with high affinity to molluscan -PQGRF-amide peptides and therefore is likely to represent the first of several Drosophila RF-amide peptide receptors (St-Onge et al. 2000). A second potential RF-amide peptide receptor, CG10823, is orthologous to NFFR and OT7T022, both of which bind ligands bearing the C-terminal consensus sequence PXRF-amide (Elshourbagy et al. 2000; Hinuma et al. 2000). The otherDrosophila neuropeptides ending in RF-amide are found within the dsk gene and display structural similarity to the vertebrate cholecystokinins (Nichols et al. 1988). We speculate that these peptides may interact with either or both of the paralogous CCKR/GASR-related receptors (CG6857 and CG6881).

AlstR binds a native Drosophila allatostatin peptide (AST-1) with high affinity (Birgül et al. 1999). This peptide, along with multiple other related peptides, is encoded by CG13633 (Lenz et al. 2000c). Because AlstR and CG10001 are paralogs, the latter receptor is also likely to interact with one or more of the products ofCG13633.

Ligands for Family B Peptide GPCRs

We speculate that the corticotropin releasing factor (CRF)-related peptides encoded by CG8348 and CG13094 (similar to Locustadiuretic hormone; Coast 1996; Furuya et al. 2000) interact with the CRFR-related CG8422 and/or CG12370, both of which are orthologs (Fig.3) of the Acheta domesticus diuretic hormone receptor (Reagan 1994). Additionally, Zhong and Pena (1995) found evidence for a PACAP-like peptide in flies. Feany and Quinn (1995) and Moore et al. (1998) provided genetic evidence to implicate the amnesiacgene (potentially encoding peptides of the PACAP family) in variousDrosophila behaviors. We speculate that the peptides in this group may interact with one or more of the remaining Family B receptors (CG4395, CG17415, and CG13758).

Peptide Genes Still Awaiting Identification

There are several insect neuropeptides and peptide hormones that have not as yet been cloned in Drosophila. These include three large protein hormones—PTTH, bursicon, and the anterior retraction factor (ARF)—that are known to exist in Drosophila but currently lack molecular definition. At least two of these proteins, PTTH and bursicon, are glycoprotein hormones (Fraenkel et al. 1966; Kim et al. 1997), whereas the structure of ARF remains undefined (Sivasubramanian et al. 1974). As noted above, the structure of the receptor encoded by the fsh gene indicates that it binds to a glycoprotein hormone ligand. All of the mammalian glycoprotein hormones share a similar structure, consisting of common α- and specific β-subunits (Hsu et al. 2000). However, to date, no similar proteins have been identified in flies. We speculate that PTTH, bursicon, and ARF are good candidate ligands for members of the LGR class of receptors.

Relatives of several peptides identified in other insects may also be present in Drosophila. These include PBAN and diapause hormone (DH), which are found in diverse insects. Both are peptide hormones of moderate size that are cosynthesized along with shorter pyrokinin peptides (Kawano et al. 1992; Sato et al. 1993; Masler et al. 1994; Xu et al. 1995). It is notable that the Drosophila CG15520 precursor includes a single FXPRLamide (pyrokinin-like) peptide but lacks any sequences similar to PBAN or DH. Because NMUR is activated by peptides displaying a LXXPRX-amide consensus (Fujii et al. 2000), we speculate that this pyrokinin-like peptide may interact with CG14484 and/or CG14593, which are orthologs of NMUR. Likewise, theDrosophila ecdysis triggering hormones (ETHs), which have a PRX-amide C-terminal sequence (Park et al. 1999), may signal through these receptors.

With the completion of the D. melanogaster genome sequence, we are now able to take a comprehensive picture of the genes encoding peptide GPCRs in this species, and a complete catalog of the cognate ligands should soon follow. This is an important first step toward detailed physiological and genetic analyses of neuropeptide signaling in Drosophila.

METHODS

Peptide GPCR Sequence Acquisition

To identify all predicted Drosophila GPCRs, we first scanned the gene annotations developed jointly by the BerkeleyDrosophila Genome Project (BDGP) and Celera Genomics for all proteins predicted to contains domains matching seven-TM motifs (Adams et al. 2000; Brody and Cravchik 2000). We rejected sequences identified recently as odorant receptors by a committee representing scientists working in the field ( Drosophila Odorant Receptor Nomenclature Committee 2000). Each remaining cloned and candidate receptor gene was used as a BLASTP search query of the database of predicted Drosophila proteins using the BDGP server (http://www.fruitfly.org/) and/or of the “non-redundant” database of all proteins using the NCBI server (http://www.ncbi.nlm.nih.gov/). Sequences were not considered further if the resulting top-scoring proteins yielded P values for nonpeptide receptors (and associated orphan receptors) that were at least 10-fold greater than the top P value for a putative peptide receptor. Three sequences (CG18314, CG12796, CG13579) generated a smaller range of P values following BLASTPsearches on the BDGP server. Nevertheless, analysis of these proteins using the NCBI server yielded hits that were exclusively amine/small neurotransmitter receptors (or orphan receptors). These proteins therefore are likely to encode nonpeptide receptors, and they also were excluded.

We first scanned a set of GPCR sequence annotations obtained through aGENSCAN search of the complete Drosophila genome sequence (see Vosshall et al. 1999) and identified based on sequence similarity to GPCRs in the NCBI nonredundant protein database (K. Scott and L. Vosshall, pers. comm.). For the BLASTP andTBLASTN analyses, we assembled a “GPCR query set,” which included the previously annotated peptide, amine, and related/unclassified Drosophila GPCRs as well as ∼200 sequences representing a diverse set of Family A (rhodopsin receptor-like family) GPCRs from the Pfam database (7TM-1; http://pfam.wustl.edu/). These sequences were used as queries for BLASTP andTBLASTN searches on the BDGP server, using the predicted proteins and the Celera/BDGP whole genome shotgun sequence datasets, respectively. To expedite the latter search, we assumed thatTBLASTN hits to genomic sequences that were already on our list were due to the detection of previously annotated GPCR genes.

GPCR Alignments

We used the hidden Markov model–based protein alignments contained in Version 5.5 of PFAM (Sept., 2000; Bateman et al. 2000) as a template for the manual alignment of the Drosophilacloned and candidate peptide receptors. The alignments were viewed using ClustalX (Version 1.8; Thompson et al. 1997), and, in some cases, this program was used to help resolve the alignment of variable regions (e.g., between TM domains 4 and 5). We used these alignments to build phylogenetic trees and also to detect missing or incorrect sequences in the gene annotations.

The N-terminal and C-terminal non-TM sequences in GPCRs tend to be poorly conserved, making accurate alignment difficult, and the seven-TM core region is sufficient for the subclassification of these proteins (Strader et al. 1994). Therefore, for Family A receptors, we deleted sequences N-terminal to the conserved GNXXLV motif (single-letter amino acid code) in TM1 and C-terminal to the conserved NPXIY motif in TM7. For Family B receptors (secretin receptor family), we deleted sequences flanking the X10GX3S motif in TM1 and the QGX2V X4CX5X motif in TM7.

Correction of Annotations

To locate missing TM domains among the putative peptide receptor annotations, we scanned for potential coding exons in flanking genomic sequence using the GENSCAN server at MIT (http://genes.mit.edu/GENSCAN.html), and the FGENES (gene prediction) and FEX (exon prediction) programs on the Baylor College of Medicine (BCM Search Launcher) server (http://www.hgsc.bcm.tmc.edu/). We also scanned for potential mRNA splice sites using the SPL program on the BCM Search Launcher server and by manual inspection of potential open reading frames displayed using MacVector (Genetics Computer Group, Madison, WI). The DNA sequences for all of the predicted donor and acceptor splice sites were NN‖GT and AG‖NN, respectively. Finally, we examined neighboring gene annotations to identify duplicate annotations of single GPCR genes. The annotations were judged to be complete when each of the TM domains displayed features that were clearly recognizable among closely related receptors. Except for the LGR subgroup of receptors (see Results and Discussion), which all share a large and subgroup-specific N-terminal domain, we did not evaluate the quality of the annotations for the N-terminal and C-terminal non-TM regions.

Tree Building

We classified the cloned and candidate peptide GPCRs based on five criteria. First, we noted the highest scoring BLASTP hits obtained on the NCBI server (Table 2). Second, we constructed alignments of Family A and Family B receptors, including all of theDrosophila peptide GPCRs identified above, for the purpose of generating full phylogenetic trees for each family. For Family A, we included mostly complete (TM1–TM7) sequences representing each of the five receptor groups, as well as sequences representing each of the various subgroups of receptors (e.g., the three types of galanin receptors) and representative orphan receptors contained within the full list of Pfam 7TM-1 (Family A) GPCRs. For Family B, we included all of the Group I–III receptors and a representative set of Family B, Group IV receptors within the full list of Pfam 7TM-2 GPCRs. After manual editing of the alignments, we constructed neighbor-joining phylogenetic trees for each family using ClustalX, using the correction for multiple substitutions provided by the software, followed by bootstrap analysis (1000 replicates).

For the subsequent subgroup-specific trees, we attempted to include all complete TM1–TM7 sequences belonging to each subgroup (as well as some partial sequences). These were identified by scanning the full Pfam 7TM-1 alignment and the GPCRDB listing of available GPCR sequences (http://www.gpcr.org/7tm/), and by performing BLASTPsearches with the cloned and candidate Drosophila peptide GPCRs as well as other representatives of each subgroup. After manual editing of the alignments, the construction of neighbor-joining trees and the bootstrap analysis was performed as above. A set of 26 indoleamine (biogenic amine) receptors, which form a monophyletic group (Kolakowski 1994), was used as an outgroup for the purpose of rooting the subgroup-specific trees. When the position of the root was unclear, the outgroup was omitted. All alignments, revised annotations, and unabridged versions of the trees are located athttp://thalamus.wustl.edu/flyGPCR/peptideGPCR.html. In addition, the revised annotations have been submitted to FlyBase (http://flybase.bio.indiana.edu/).

This work was supported by National Institutes of Health Grant NS21749 and the Human Frontier Science Program Organization (P.H.T.). We thank Sean Eddy for helpful discussions, Kirstin Scott and Leslie Vosshall for sharing Drosophila GPCR sequence data, Lin Yang and Dori Sztipanovits for technical assistance, and Aguan Wei for comments on the manuscript. We also thank Cori Bargmann and Kemal Payza for sharing unpublished results.

The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked “advertisement” in accordance with 18 USC section 1734 solely to indicate this fact.

Notes

[14] E-MAIL ; FAX (314) 362-3446.

[15] E-MAIL ; FAX (314) 362-3446.

Notes

[16] Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.169901.

REFERENCES

  1. M.D. AdamsS.E. CelnikerR.A. HoltC.A. EvansJ.D. GocayneP.G. AmanatidesS.E. SchererP.W. LiR.A. HoskinsR.F. Galle(2000) The genome sequence of Drosophila melanogaster. Science 287:2185–2195.
  2. M.S. AndersonM.E. HalpernH. Keshishian(1988) Identification of the neuropeptide transmitter proctolin in Drosophila larvae: Characterization of muscle fiber-specific neuromuscular endings. J. Neurosci. 8:242–255.
  3. M. AshburnerS. MisraJ. RooteS.E. LewisR. BlazejT. DavisC. DoyleR. GalleR. GeorgeN. Harris(1999) An exploration of the sequence of a 2.9-Mb region of the genome of Drosophila melanogaster. The Adh region. Genetics 153:179–219.
  4. R.A. BainesK.S. ThompsonR.C. RayneJ.P. Bacon(1995) Analysis of the peptide content of the locust vasopressin-like immunoreactive (VPLI) neurons. Peptides 16:799–807.
  5. J.M. BaldwinG.F. SchertlerV.M. Unger(1997) An α-carbon template for the transmembrane helices in the rhodopsin family of G-protein-coupled receptors. J. Mol. Biol. 272:144–164.
  6. A. BatemanE. BirneyR. DurbinS.R. EddyK.L. HoweE.L.L. Sonnhammer(2000) The Pfam protein families database. Nucleic Acids Res. 28:263–266.
  7. N. BirgülC. WeiseH.-J. KreienkampD. Richter(1999) Reverse physiology in Drosophila: Identification of a novel allatostatin-like neuropeptide and its cognate receptor structurally related to the mammalian somatostatin/galanin/opioid receptor family. EMBO J. 18:5892–5900.
  8. J. BockaertJ.P. Pin(1999) Molecular tinkering of G protein-coupled receptors: An evolutionary success. EMBO J. 18:1723–1729.
  9. T. BrodyA. Cravchik(2000) Drosophila melanogaster G protein-coupled receptors. J. Cell Biol. 150:F83–F88.
  10. M.R. BrownR. GrafK.M. SwiderekD. FendleyT.H. StrackerD.E. ChampagneA.O. Lea(1998) Identification of a steroidogenic neurohormone in female mosquitoes. J. Biol. Chem. 273:3967–3971.
  11. M.R. BrownJ.W. CrimR.C. ArataH.N. CaiC. ChunP. Shen(1999) Identification of a Drosophila brain-gut peptide related to the neuropeptide Y family. Peptides 20:1035–1042.
  12. R. ChenY.V. MukhinM.N. GarnovskayaT.E. ThielenY. IijimaC. HuangJ.R. RaymondM.E. UllianR.V. Paul(2000) A functional angiotensin II receptor-GFP fusion protein: Evidence for agonist-dependent nuclear translocation. Am. J. Physiol. Renal. Physiol. 279:F440–F448.
  13. C.C. CheungP.K. LoiA.W. SylwesterT.D. LeeN.J. Tublitz(1992) Primary structure of a cardioactive neuropeptide from the tobacco hawkmoth, Manduca sexta. FEBS Lett. 313:165–168.
  14. G.M. Coast(1996) Neuropeptides implicated in the control of diuresis in insects. Peptides 17:327–336.
  15. K.J. CoxC.P. TensenR.C. Van der SchorsK.W. LiH. van HeerikhuizenE. VreugdenhilW.P. GeraertsJ.F. Burke(1997) Cloning, characterization, and expression of a G-protein-coupled receptor from Lymnaea stagnalis and identification of a leucokinin-like peptide, PSFHSWSamide, as its endogenous ligand. J. Neurosci. 17:1197–1205.
  16. S.A. DaviesE.J. StewartG.R. HuesmannN.J. SkaerS.H. MaddrellN.J. TublitzJ.A. Dow(1998) Neuropeptide stimulation of the nitric oxide signaling pathway in Drosophila melanogaster Malpighian tubules. Am. J. Physiol. 273:R823–R827.
  17. A. De LoofD. BylemansL. SchoofsI. JanssenK. SpittaelsJ. Vanden BroeckR. HuybrechtsD. BorovskyY.J. HuaJ. Koolman(1995) Folliculostatins, gonadotropins and a model for control of growth in the grey fleshfly, Neobellieria (sarcophaga) bullata. Insect Biochem. Mol. Biol. 25:661–667.
  18. Drosophila Odorant Receptor Nomenclature Committee (2000) A unified nomenclature system for the Drosophila odorant receptors. Cell 102:145–146.
  19. N.A. ElshourbagyR.S. AmesL.R. FitzgeraldJ.J. FoleyJ.K. ChambersP.G. SzekeresN.A. EvansD.B. SchmidtP.T. BuckleyG.M. Dytko(2000) Receptor for the pain modulatory neuropeptides FF and AF is an orphan G protein-coupled receptor. J. Biol. Chem. 275:25965–25971.
  20. K.K. EriksenF. HauserM. SchiottK.M. PedersenL. SondergaardC.J. Grimmelikhuijzen(2000) Molecular cloning, genomic organization, developmental regulation, and a knock-out mutant of a novel leu-rich repeats-containing G protein-coupled receptor (DLGR-2) from Drosophila melanogaster. Genome Res. 10:924–938.
  21. B.N. EvansM.I. RosenblattL.O. MnayerK.R. OliverI.M. Dickerson(2000) CGRP-RCP, a novel protein required for signal transduction at calcitonin gene-related peptide and adrenomedullin receptors. J. Biol. Chem. 275:31438–31443.
  22. J. EwerJ.W. Truman(1996) Increases in cyclic 3′, 5′-guanosine monophosphate (cGMP) occur at ecdysis in an evolutionarily conserved crustacean cardioactive peptide-immunoreactive insect neuronal network. J. Comp. Neurol. 370:330–341.
  23. M.B. FeanyW.G. Quinn(1995) A neuropeptide gene defined by the Drosophila memory mutant amnesiac. Science 268:869–873.
  24. G. FengV. RealeK. KennedyH.M. ChatwinP.D. EvansL.M. Hall(1999) Cloning and functional characterization of a novel neuropeptide F-like receptor from Drosophila melanogaster. Soc. Neurosci. Abst. 29:183.
  25. G. FraenkelC. HsiaoM. Seligman(1966) Properties of bursicon: an insect protein hormone that controls cuticular tanning. Science 151:91–93.
  26. R. FujiiM. HosoyaS. FukusumiY. KawamataY. HabataS. HinumaH. OndaO. NishimuraM. Fujino(2000) Identification of neuromedin U as the cognate ligand of the orphan G protein-coupled receptor FM-3. J. Biol. Chem. 275:21068–21074.
  27. K. FuruyaR.J. MilchakK.M. ScheggJ. ZhangS.S. TobeG.M. CoastD.A. Schooley(2000) Cockroach diuretic hormones: Characterization of a calcitonin-like peptide in insects. Proc. Natl. Acad. Sci. 97:6469–6474.
  28. A.J. GentlesS. Karlin(1999) Why are human G-protein-coupled receptors predominantly intronless? Trends Genet. 15:47–49.
  29. J. GirardieJ.C. HuetZ. Atay-KadiriS. EttaouilJ.P. DelbecqueB. FournierJ.C. PernolletA. Girardie(1998) Isolation, sequence determination, physical and physiological characterization of the neuroparsins and ovary maturing parsins of Schistocerca gregaria. Insect Biochem. Mol. Biol. 28:641–650.
  30. F. HauserH.-P. NothackerC.J.P. Grimmelikhuijzen(1997) Molecular cloning, genomic organization, and developmental regulation of a novel receptor from Drosophila melanogaster structurally related to members of the thyroid-stimulating hormone, follicle-stimulating hormone, luteinizing hormone/choriogonadotropin receptor family from mammals. J. Biol. Chem. 272:1002–1010.
  31. F. HauserL. SondergaardC.J. Grimmelikhuijzen(1998) Molecular cloning, genomic organization and developmental regulation of a novel receptor from Drosophila melanogaster structurally related to gonadotropin-releasing hormone receptors for vertebrates. Biochem. Biophys. Res. Commun. 249:822–828.
  32. Y. HayakawaA. OhnishiA. YamanakaS. IzumiS. Tomino(1995) Molecular cloning and characterization of cDNA for insect biogenic peptide, growth-blocking peptide. FEBS Lett. 376:185–189.
  33. S. HinumaY. HabataR. FujiiY. KawamataM. HosoyaS. FukusumiC. KitadaY. MasuoT. AsanoH. Matsumoto(1998) A prolactin-releasing peptide in the brain. Nature 393:272–276.
  34. S. HinumaY. ShintaniS. FukusumiN. IijimaY. MatsumotoM. HosoyaR. FujiiT. WatanabeK. KikuchiY. Terao(2000) New neuropeptides containing carboxy-terminal RFamide and their receptor in mammals. Nat. Cell. Biol. 2:703–708.
  35. S.P. HolmesH. HeA.C. ChenG.W. IvieP.V. Pietrantonio(2000) Cloning and transcriptional expression of a leucokinin-like peptide receptor from the southern cattle tick, Boophilus microplus (Acari: Ixodidae). Insect Mol. Biol. 9:457–465.
  36. F.M. HorodyskiJ. EwerL.M. RiddifordJ.W. Truman(1993) Isolation, characterization and expression of the eclosion hormone gene of Drosophila melanogaster. Eur. J. Biochem. 215:221–228.
  37. C.H.V. Hoyle(1999) Neuropeptide families and their receptors: Evolutionary perspectives. Brain Res. 848:1–25.
  38. S.Y. HsuM. KudoT. ChenK. NakabayashiA. BhallaP.J. van der SpekM. van DuinA.J.W. Hsueh(2000) The three subfamilies of leucine-rich repeat-containing G protein-coupled receptors (LGR): Identification of LGR6 and LGR7 and the signaling mechanism for LGR7. Mol. Endocrinol. 14:1257–1271.
  39. G.R. HuesmannC.C. CheungP.K. LoiT.D. LeeK.M. SwiderekN.J. Tublitz(1995) Amino acid sequence of CAP2b, an insect cardioacceleratory peptide from the tobacco hawkmoth Manduca sexta. FEBS Lett. 371:311–314.
  40. H.A. JohardC.T. LundquistA. RokaeusD.R. Nässel(1992) Autoradiographic localization of 125I-galanin binding sites in the blowfly brain. Regul. Pept. 42:123–134.
  41. A.H. Johnsen(1998) Phylogeny of the cholecystokin/gastrin family. Front. Neuroendocrinol. 19:73–99.
  42. H. KataokaA. ToschiJ.P. LiR.L. CarneyD.A. SchooleyS.J. Kramer(1989) Identification of an allatropin from adult Manduca sexta. Science 243:1481–1483.
  43. T. KawanoH. KataokaH. NagasawaA. IsogaiA. Suzuki(1992) cDNA cloning and sequence determination of the pheromone biosynthesis activating neuropeptide of the silkworm, Bombyx mori. Biochem. Biophys. Res. Commun. 189:221–226.
  44. A.J. KimG.-H. ChaK. KimL.I. GilbertC.C. Lee(1997) Purification and characterization of the prothoracicotropic hormone of Drosophila melanogaster. Proc. Natl. Acad. Sci. 94:1130–1135.
  45. K.D. KimuraH.A. TissenbaumY. LiuG. Ruvkun(1997) daf-2, an insulin receptor-like gene that regulates longevity and diapause in Caenorhabditis elegans. Science 15:942–946.
  46. L.F. Kolakowski(1994) GCRDb: A G-protein-coupled receptor database. Receptors Channels 2:1–7.
  47. J. KoolmanY.-J. HuaD. ByelansA. De Loof(1995) A potential prothoracostatic hormone (PTSH) from flies. in Molecular mechanisms of insect metamorphosis and diapause, ed A. Suzuki(Industrial Pub. Tokyo), pp 45–54.
  48. B. KostronD. MarketJ. KellermannC.E. CarterH.W. Honegger(1999) Antisera against Periplaneta americana Cu,Zn-superoxide dismutase (SOD): Separation of the neurohormone bursicon from SOD, and immunodetection of SOD in the central nervous system. Insect Biochem. Mol. Biol. 29:861–871.
  49. S.J. KramerA. ToschiC.A. MillerH. KataokaG.B. QuistadJ.P. LiR.L. CarneyD.A. Schooley(1991) Identification of an allatostatin from the tobacco hornworm Manduca sexta. Proc. Natl. Acad. Sci. 88:9458–9462.
  50. E. Kubli(1992) The sex-peptide. Bioessays 14:779–784.
  51. K.J. LeeT.S. EltonA.K. BejS.A. WattsR.D. Watson(1995) Molecular cloning of a cDNA encoding putative molt-inhibiting hormone from the blue crab, Callinectes sapidus. Biochem. Biophys. Res. Commun. 209:1126–1131.
  52. H.K. LehmanC.M. MurgiucT.A. MillerT.D. LeeJ.G. Hildebrand(1993) Crustacean cardioactive peptide in the sphinx moth, Manduca sexta. Peptides 14:735–741.
  53. C. LenzL. SondergaardC.J. Grimmelikhuijzen(2000a) Molecular cloning and genomic organization of a novel receptor from Drosophila melanogaster structurally related to mammalian galanin receptors. Biochem. Biophys. Res. Commun. 269:91–96.
  54. C. LenzM. WilliamsonC.J. Grimmelikhuijzen(2000b) Molecular cloning and genomic organization of a second probable allatostatin receptor from Drosophila melanogaster. Biochem. Biophys. Res. Commun. 273:571–577.
  55. (2000c) Molecular cloning and genomic organization of an allatostatin preprohormone from Drosophila melanogaster. Biochem. Biophys. Res. Commun. 273:1126–1131, ibid.
  56. X.J. LiW.J. WolfgangY.N. WuR.A. NorthM. Forte(1991) Cloning, heterologous expression and developmental regulation of a Drosophila receptor for tachykinin-like peptides. EMBO J. 10:3221–3229.
  57. X.J. LiY.N. WuR.A. NorthM. Forte(1992b) Cloning, functional expression, and developmental regulation of a neuropeptide Y receptor from Drosophila melanogaster. J. Biol. Chem. 267:9–12.
  58. C.T. LundquistA. RokaeusD.R. Nässel(1991) Galanin immunoreactivity in the blowfly nervous system: Localization and chromatographic analysis. J. Comp. Neurol. 312:77–96.
  59. C.T. LundquistH.A. JohardA. RokaeusD.R. Nässel(1993) Galanin immunoreactivity and 125I-galanin binding sites in the blowfly brain. Acta. Biol. Hung. 44:51–54.
  60. E.P. MaslerA.K. RainaR.M. WagnerJ.P. Kochansky(1994) Isolation and identification of a pheromonotropic neuropeptide from the brain-suboesophageal ganglion complex of Lymantria dispar: A new member of the PBAN family. Insect Biochem. Mol. Biol. 24:829–836.
  61. J. MeredithM. RingA. MacinsJ. MarschallN.N. ChengD. TheilmannH.W. BrockJ.E. Phillips(1996) Locust ion transport peptide (ITP): Primary structure, cDNA and expression in a baculovirus system J. Exp. Biol. 199:1053–1061.
  62. D. MonnierJ.F. ColasP. RosayR. HenE. BorrelliL. Maroteaux(1992) NKD, a developmentally regulated tachykinin receptor in Drosophila. J. Biol. Chem. 267:1298–1302.
  63. M.S. MooreJ. DeZazzoA.Y. LukT. TullyC.M. SinghU. Heberlein(1998) Ethanol intoxication in Drosophila: Genetic and pharmacological evidence for regulation by the cAMP signaling pathway. Cell 93:997–1007.
  64. J.R. NambuC. Murphy-ErdoshP.C. AndrewsG.J. FeistnerR.H. Scheller(1988) Isolation and characterization of a Drosophila neuropeptide gene. Neuron 1:55–61.
  65. D.R. NässelM.G. PerssonJ.E. Muren(2000) Baratin, a nonamidated neurostimulating neuropeptide, isolated from cockroach brain: Distribution and actions in the cockroach and locust nervous systems. J. Comp. Neurol. 422:267–286.
  66. R. Nichols(1992) Isolation and structural characterization of Drosophila TDVDHVFLRFamide and FMRFamide-containing neural peptides. J. Mol. Neurosc. 3:213–218.
  67. R. NicholsS.A. SchneuwlyJ.E. Dixon(1988) Identification and characterization of a Drosophila homologue to the vertebrate neuropeptide cholecystokinin. J. Biol. Chem. 263:12167–12170.
  68. B.E. NoyesF.N. KatzM.H. Schaffer(1995) Identification and expression of the Drosophila adipokinetic hormone gene. Mol. Cell. Endocrinol. 109:133–141.
  69. S. OggG. Ruvkun(1998) The C. elegans PTEN homolog, DAF-18, acts in the insulin receptor-like metabolic signaling pathway. Mol. Cell 2:887–893.
  70. S. OggS. ParadisS. GottliebG.I. PattersonL. LeeH.A. TissenbaumG. Ruvkun(1997) The Fork head transcription factor DAF-16 transduces insulin-like metabolic and longevity signals in C. elegans. Nature 389:994–999.
  71. M. OttigerM. SollerR.F. StockerE. Kubli(2000) Binding sites of Drosophila melanogaster sex peptide pheromones. J. Neurobiol. 44:57–71.
  72. J.H. ParkJ.C. Hall(1998) Isolation and chronobiological analysis of a neuropeptide pigment-dispersing factor gene in Drosophila melanogaster. J. Biol. Rhythms 13:219–228.
  73. Y. ParkD. ŽitňanS.S. GillM.E. Adams(1999) Molecular cloning and biological activity of ecdysis-triggering hormones in Drosophila melanogaster. FEBS Lett. 463:133–138.
  74. D.A. PetrovD.L. Hartl(2000) Pseudogene evolution and natural selection for a compact genome. J. Hered. 91:221–227.
  75. J.D. Reagan(1994) Expression cloning of an insect diuretic hormone receptor. A member of the calcitonin/secretin receptor family. J. Biol. Chem. 269:9–12.
  76. M.G. ReeseG. HartzellN.L. HarrisU. OhlerJ.F AbrilS.E. Lewis(2000) Genome annotation assessment in Drosophila melanogaster. Genome Res. 10:483–501.
  77. P. RosayJ.F. ColasL. Maroteaux(1995) Dual organisation of the Drosophila neuropeptide receptor NKD gene promoter. Mech. Dev. 51:329–339.
  78. G.M. RubinM.D. YandellJ.R. WortmanG.G.L. MiklosC.R. NelsonI.K. HariharanM.E. FortiniP.W. LiR. ApweilerW. Fleischmann(2000) Comparative genomics of the eukaryotes. Science 287:2204–2215.
  79. Y. SatoM. OguchiN. MenjoK. ImaiH. SaitoM. IkedaM. IsobeO. Yamashita(1993) Precursor polyprotein for multiple neuropeptides secreted from the suboesophageal ganglion of the silkworm Bombyx mori: Characterization of the cDNA encoding the diapause hormone precursor and identification of additional peptides. Proc. Natl. Acad. Sci. 90:3251–3255.
  80. L.E. SchneiderP.H. Taghert(1988) Isolation and characterization of a Drosophila gene that encodes multiple neuropeptides related to Phe-Met-Arg-Phe-NH2 (FMRFamide). Proc. Natl. Acad. Sci. USA 85:1993–1997.
  81. P. SivasubramanianS. FriedmanG. Fraenkel(1974) Nature and role of proteinaceous hormonal factors acting during puparium formation in flies. Biol. Bull. 147:163–185.
  82. R.J. SiviterG.M. CoastA.M. WintherR.J. NachmanC.A. TaylorA.D. ShirrasD. CoatesR.E. IsaacD.R. Nässel(2000) Expression and functional characterization of a Drosophila neuropeptide precursor with homology to mammalian preprotachykinin A. J. Biol. Chem. 275:23273–23280.
  83. E.L. SonnhammerS.R. EddyR. Durbin(1997) Pfam: A comprehensive database of protein domain families based on seed alignments. Proteins 28:405–420.
  84. S. St-OngeJ.-P. FortinM. LabarreA. SteyaertR. SchmidtS. AhmadP. WalkerK. Payza(2000) In vitro pharmacology of NPFF and FMRFamide-related peptides at the PR4 receptor of Drosophila melanogaster. Soc. Neurosci. Abstr. 26:140.9.
  85. C.D. StraderT.M. FongM.R. TotaD. Underwood(1994) Structure and function of G protein-coupled receptors. Annu. Rev. Biochem. 63:101–132.
  86. B. SunA.V. SchallyG. Halmos(2000) The presence of receptors for bombesin/GRP and mRNA for three receptor subtypes in human ovarian epithelial cancers. Regul. Pept. 90:77–84.
  87. J.W. TamsS.M. KnudsenJ. Fahrenkrug(1998) Proposed arrangement of the seven transmembrane helices in the secretin receptor family. Receptors Channels 5:79–90.
  88. C.P. TensenE.R. Van KesterenR.J. PlantaK.J. CoxJ.F. BurkeH. van HeerikhuizenE. Vreugdenhil(1994) A G protein-coupled receptor with low density lipoprotein-binding motifs suggests a role for lipoproteins in G-linked signal transduction. Proc. Natl. Acad. Sci. 91:4816–4820.
  89. S. TerhzazF.C. O'ConnellV.P. PollockL. KeanS.A. DaviesJ.A. VeenstraJ.A. Dow(1999) Isolation and characterization of a leucokinin-like peptide of Drosophila melanogaster. J. Exp Biol. 202:3667–3676.
  90. J.D. ThompsonT.J. GibsonF. PlewniakF. JeanmouginD.G. Higgins(1997) The ClustalX windows interface: Flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 24:4876–4882.
  91. H.A. TissenbaumG. Ruvkun(1998) An insulin-like signaling pathway affects both longevity and reproduction in Caenorhabditis elegans. Genetics 148:703–717.
  92. K. Ui-TeiM. SakumaY. WatanabeT. MiyakeY. Miyata(1995) Chemical analysis of neurotransmitter candidates in clonal cell lines from Drosophila central nervous system, II: Neuropeptides and amino acids. Neurosci. Lett. 195:187–190.
  93. J. Vanden Broeck(2001) Neuropeptides and their precursors in the fruitfly, Drosophila melanogaster. Peptides 22:241–254.
  94. J.A. Veenstra(1994) Isolation and structure of the Drosophila corazonin gene. Biochem. Biophys. Res. Commun. 204:292–296.
  95. L.B. VosshallH. AmreinP.S. MorozovA. RzhetskyR. Axel(1999) A spatial map of olfactory receptor expression in the Drosophila antenna. Cell 96:725–736.
  96. M. WilliamsonC. LenzM.E. WintherD.R. NässelC. Grimmelikhuijzen(2001) Molecular cloning, genomic organization, and expression of a B-type (cricket type) allatostatin preprohormone from Drosophila melanogaster. Biochem. Biophys. Res. Commun. 281:544–550.
  97. W.H. XuY. SatoM. IkedaO. Yamashita(1995) Molecular characterization of the gene encoding the precursor protein of diapause hormone and pheromone biosynthesis activating neuropeptide (DH-PBAN) of the silkworm, Bombyx mori and its distribution in some insects. Biochim. Biophys. Acta 1261:83–89.
  98. J.G. YoonB. Stay(1995) Immunocytochemical localization of Diploptera punctata allatostatin-like peptide in Drosophila melanogaster. J. Comp. Neurol. 363:475–488.
  99. J. ZdarekR.J. NachmanT. Hayes(1997) Insect neuropeptides of the pyrokinin/PBAN family accelerate pupariation in the fleshfly (Sarcophaga bullataria) larvae. Ann. NY Acad. Sci. 814:67–71.
  100. Y. ZhongL.A. Pena(1995) A novel synaptic transmission mediated by a PACAP-like neuropeptide in Drosophila. Neuron 14:527–536.
  101. D. ŽitňanF. SehnalP.J. Bryant(1993) Neurons producing specific neuropeptides in the central nervous system of normal and pupariation-delayed Drosophila. Dev. Biol. 156:117–135.
NOTE ADDED IN PROOF

We have identified two additional ESTs for peptide GPCRs: AT008361(CG1147) and AT0019640 (CG13229). Also, based on discussions with Jan Veenstra (Universite Bordeaux) we now add two additional peptide genes to our list: the SIFamide gene (currently listed as part ofCG4681; Ifa, Vanden Broeck 2001) and the hugingene (CG6371).

Loading
Loading
Loading
Back to top