|
|
|
|
|
Genome Res. 14:640-650, 2004 ©2004 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/04 $5.00 Letter Identification of Candidate Disease Genes by EST Alignments, Synteny, and Expression and Verification of Ensembl Genes on Rat Chromosome 1q43-541 Incyte Corporation, Palo Alto, California 94304, USA 2 Human and Molecular Genetics Center and Department of Physiology, Medical College of Wisconsin, Milwaukee, Wisconain 53226, USA
We aligned Incyte ESTs and publicly available sequences to the rat genome and analyzed rat chromosome 1q43 [PDB] -54, a region in which several quantitative trait loci (QTLs) have been identified, including renal disease, diabetes, hypertension, body weight, and encephalomyelitis. Within this region, which contains 255 Ensembl gene predictions, the aligned sequences clustered into 568 Incyte genes and gene fragments. Of the Incyte genes, 261 (46%) overlapped 184 (72%) of the Ensembl gene predictions, whereas 307 were unique to Incyte. The rat-to-human syntenic map displays rearrangement of this region on rat chr. 1 onto human chromosomes 9 and 10. The mapping of corresponding human disease phenotypes to either one of these chromosomes has allowed us to focus in on genes associated with disease phenotypes. As an example, we have used the syntenic information for the rat Rf-1 disease region and the orthologous human ESRD disease region to reduce the size of the original rat QTL to only 11.5 Mb. Using the syntenic information in combination with expression data from ESTs and microarrays, we have selected a set of 66 candidate disease genes for Rf-1. The combination of the results from these different analyses represents a powerful approach for narrowing the number of genes that could play a role in the development of complex diseases.
The recent publication of the rat genome sequence makes it possible to study genes in the context of their genomic location as well as their syntenic association to genes characterized on the human and mouse genomes. This is of particular value for the identification of genes responsible for disease phenotypes (Leo et al. 2002
The Ensembl database provides information on known rat genes as well as rat gene predictions that are derived by an automated pipeline using a combination of ab-initio Genscan predictions, GeneWise protein homology alignments, or a set of aligned rat and mouse cDNAs. However, computationally based gene predictions have inherent limitations (Burge and Karlin 1997
In addition to the characterization of genes on the rat genome, the identification of human orthologs is critical to link identified disease genes across species. Reciprocal BLAST sequence comparison between genes from two species is a commonly used approach to identify orthologs. However, this method is most reliable when used in species with complete transcriptomes, which are not yet available in either human or rat (Tatusov et al. 1997
Rat chromosome 1 (chr. 1) at 1q43
[PDB]
-54 encompasses several different QTLs, including renal failure 1 (Rf-1; Brown et al. 1996 We use the segmentation of the syntenic map between rat and human to reduce the size of the rat Rf-1 QTL region. Furthermore, by using rat EST alignments to rat chr. 1q43 [PDB] -54, we have provided transcriptional evidence for rat genes, as well as information on organ expression, which was previously unavailable for a large set of these genes. We demonstrate a distinctive approach in the study of candidate disease genes that combines syntenic and disease linkage information across two or more species, as well as expression information, to reduce the number of candidate disease genes.
Genomic Alignments To determine EST-based genes, we aligned and clustered 1.2 M Incyte and 0.3 M public rat ESTs, 4248 rat Refseq, and 5605 protein-containing GenPept mRNA sequences from GenBank to the rat genomic sequence. A total of 4248 RefSeq sequences generated 3890 rat genes. In addition, 35,860 EST-inferred gene fragments that contained three or more clones were formed. Some of these rat gene fragments represent underclustered segments of the same genes and need to be studied to derive a final number of genes. Of the Incyte EST-inferred genes, 20,452 (57%) were composed of mixed public and Incyte EST content, whereas 15,408 (43%) contained only Incyte ESTs, which could potentially represent novel genes. To further validate Incyte EST-based genes, we examined protein predictions and homology to human genes. A total of 16,017 (45%) of the Incyte rat gene fragments contained protein predictions and 26,061 (72%) had human syntenic homologs. To assess how many of these genes are not represented in the Ensembl database, we compared overlaps between our set of genes and the 20,906 rat gene predictions found in the Ensembl database. The number of genes overlapping in both the Ensembl and the Incyte data set is shown in Figure 1. The Incyte data set contains EST-based gene evidence for 81% of the Ensembl gene predictions, whereas more than half of the Incyte gene fragments do not overlap any of the Ensembl gene predictions.
Aligned Sequences on Rat chr. 1q43-54 Within the rat chr. 1 coordinates, 232 Mb (1q43 [PDB] ) to 264 Mb (1q54 [PDB] ), a total of 11,827 Incyte and 2891 public domain ESTs from 435 libraries were aligned (from 11,357 and 2704 clones, respectively). In addition to EST sequences, 42 RefSeq and 61 mRNA sequences with corresponding protein sequences in GenPept aligned to rat chr. 1q43 [PDB] -54. The RefSeq sequences and GenPept-derived mRNA sequences clustered into 42 genes containing 4129 clones. A total of 9932 additional clones formed 526 gene fragments, containing three or more clones or at least one mRNA splice site. As RefSeq sequences are mapped in both the Incyte and Ensembl data set, we compared the agreement between the RefSeq-containing genes in both data sets. The Ensembl database contained 50 known rat genes and 205 novel gene predictions in the same region of chr. 1q43 [PDB] -54. Of the known genes, 46 referenced a RefSeq sequence, but two genes at two separate locations shared the same RefSeq sequence (NM_012963 [GenBank] and NM_017272 [GenBank] ), whereas RefSeq locations in the Incyte database are unique. A total of 40 of the RefSeq sequences in both data sets mapped to corresponding same gene locations. The ensembl data set contained three additional RefSeq sequences that Incyte did not map to this region, because the sequence alignments had low identity over the coding region (NM_020094 [GenBank] 91%, NM_020095 [GenBank] 92%, NM_053646 [GenBank] 60%). In addition, Ensembl localized RefSeq NM_033539 [GenBank] to chr. 1. However, in Incyte's database, this RefSeq was mapped with higher identity (100%) and coverage (100%) to rat chromosome 9. Moreover, NM_033539 [GenBank] maps to 17 different locations on the genome, and thus, the locus on chr. 1 is questionable. Another difference between the data sets was Incyte's mapping of NM_138512 [GenBank] to this region on chromosome 1q, which is a unique alignment of this RefSeq sequence to the genome. Furthermore, NM_017042 [GenBank] was also mapped to chr. 1q, whereas this RefSeq is mapped to chr. 15 in the Ensembl data set. We chose the mapping of this RefSeq to chr. 1 over the mapping to chr. 15 due to higher coverage (97%) and identity (95%).
Clustering of Aligned Sequences Into Genes and Gene Fragments
We identified 307 rat genes and gene fragments that did not overlap with Ensembl gene predictions. To derive further information about these gene fragments, we examined their BLAST hit comparisons to other species, their EST and splice site content, and potential protein content. Overall, 210 gene fragments contained transcripts with one or more significant BLAST sequence comparisons to other species. A total of 204 and 125 genes and gene fragments had significant BLAST sequence homology to mouse and to human genes, respectively. Most notably, 17 of the 307 genes were composed of Incyte ESTs only, had an ORF of >60 amino acids, had sequence homology to human or mouse cDNA, and showed splicing when aligned to the genome (Table 1). In contrast, two other gene fragments were identified as potential pseudogenes, as they contained single exon copies of multiexon RefSeq sequences, thus indicating a potential retrotransposition.
Verification of Ensembl Novel Gene Predictions Of the 255 Ensembl rat gene predictions in chr. 1q43 [PDB] -54, 184 (72%) had overlaps to Incyte gene and gene fragments. Of these genes with overlap to EST-containing Incyte genes, 134 were novel Ensembl predictions. To validate the Ensembl gene predictions, we examined the EST content provided by the overlapping Incyte ESTs as well as the splice sites and BLAST hits to other species. A total of 133 (65%) of the Ensembl novel genes overlapped gene fragments that had sequence homology to human, mouse, or dog sequences, as identified by BLAST sequence comparisons; 44 of the novel Ensembl gene predictions overlapped Incyte gene fragments that were composed of Incyte ESTs only and could not be confirmed with any public domain ESTs; 94 (64%) Ensembl novel genes had splice sites that could be verified by Incyte gene fragments (for example, see Fig. 2); and 40 had exonic overlaps only. Two of the Ensembl genes that overlapped Incyte EST sequences were identified as potential pseudogenes, as they overlapped Incyte genes that contained a single-exon copy of a multiexon RefSeq sequence, thus indicating a possible retrotransposon event. In addition to Ensembl genes with overlap to Incyte gene fragments, the chr. 1q43 [PDB] -54 region contained 71 Ensembl gene predictions that did not overlap an exon of an Incyte gene.
Annotation Comparison of Ensembl Gene Predictions Overlapping Incyte Genes and Gene Fragments
Differential Expression and Electronic Northern for Genes on Chromosome 1q43-54 As organ expression information can indicate the potential association of a gene to one of the disease phenotypes localized on rat chr. 1q43 [PDB] -54, we examined the anatomical expression of the genes and gene fragments in this region using both cDNA microarrays and electronic Northern analysis. Microarray analysis was performed using RNA isolated from 27 different organs and run in competitive hybridizations against a pool of all of the RNAs. The microarrays used contained 275 clones that mapped to 177 of the gene and gene fragments on rat chr. 1q43 [PDB] -54. A significant differential expression of twofold or greater for an individual organ relative to the pool of all organs was seen for 41 genes and gene fragments (Table 3).
Because not all of the rat genes in this region contained clones that were represented on the microarrays, the source organs for clones that map to each gene were examined. Mapped clones were derived from 322 Incyte libraries and 95 public domain libraries. Across all aligned clones, 40 organs were represented, of which brain (2346 clones) and liver (3405 clones) were the most abundant. A total of 57 rat genes and gene fragments showed a high specificity for one or more organs. Overall, 22 organs with gene expression information were found on chr. 1q43 [PDB] -54, including brain, pancreas, and heart, which represent some of the organs that might be related to the QTLs found in this region. To assess how redundant or additive the information of these two approaches is, we compared the results from the electronic Northern analyses and microarray experiments (Table 3). Of the 55 occurrences of increased organ expressions in the hybridization experiment, 31 (56%) were confirmed by electronic Northern. In 15 cases, the number of clones for that particular organ in the gene was below three, our threshold for deriving electronic Northern data. In nine instances, none of the clones in the gene were derived from the organ that was shown to have increased expression in the microarray experiments. The electronic Northern approach revealed an additional 39 gene-to-organ associations that were not found by the microarray data analyses.
Synteny and Homologs The syntenic map was used to verify sequence homology pairs between rat and human. The presence of a syntenically confirmed homolog in human increases the confidence level for a particular rat gene as it reveals not only BLAST hit similarity, but consistent location of the genes on both genomes, which indicates that these might be orthologous genes. A total of 274 of the rat Incyte genes and gene fragments had overlaps to a syntenic homolog in human. Most of these rat Incyte genes with syntenic homologs also had overlaps to rat Ensembl genes, with only 89 of them being Incyte unique genes. In total, 160 human genes were classified as syntenic homologous to rat genes. Of these human genes, 142 contained protein-coding sequences found in GenBank, and 18 genes contained EST sequences from GenBank and Incyte. Thus, rat genes on chr. 1q43 [PDB] -54 did not reveal syntenic homologs in human that represented potential novel human genes. The mouse region syntenic to rat chr. 1q43 [PDB] -54 contained a total of 263 gene predictions, of which 143 were known genes (54%). In contrast, the rat Ensembl database contains only 20% known genes within the corresponding syntenic region. Of these mouse genes, 195 were homologous to the rat genes on chr. 1q43 [PDB] -54, and can thus be used for additional validation of the rat genes.
Human chr. 10 was shown to be linked to end-stage renal disease (ESRD) which is similar to the Rf-1 disease found on rat chr. 1 (Freedman 2002
Candidate Rf-1 Genes The Rf-1 phenotype is likely to be caused by the misfunction of a gene expressed in kidney (Churchill et al. 1997
To investigate genes that meet this criteria, we identified all genes expressed in kidney via either microarray analyses or clone origin of the genes. In microarray studies, three genes were found to have significant differential increase of expression in the kidney (see Table 3). The human homologs for all three of these genes contained kidney EST sequences as well. In addition, we also studied microarray analyses available in the gene expression omnibus database (GEO; http://www.ncbi.nlm.nih.gov/geo/ As comparison between microarray and electronic Northern results showed that additive expression information could be derived by electronic Northern, we also included the analysis of clone content of the rat genes for the identification of genes potentially expressed in kidney tissues. The total number of clones from 23 rat kidney libraries was 74,787. Of these, 1029 mapped to rat chr. 1q43 [PDB] -54 (941 Incyte and 88 public, respectively). We selected gene or gene fragments that contained clones from at least two different kidney libraries for verification of organ expression. A total of 90 of the Incyte novel gene fragments had clones from two or more kidney libraries. As the Rf-1 orthologous human gene is believed to map to human chr. 10q, we considered only genes and gene fragments that were confirmed to have a human syntenic homolog. Thus, we derived 66 rat genes that (1) are located close to the rat markers for Rf-1 as well as human markers for ESRD, (2) have a verified human syntenic homolog, and (3) contain clones from more than two kidney libraries, or (4) had significant differential expression in the microarray studies (Supplemental file 2). For further validation of these genes and their potential involvement in the Rf-1 phenotype, we studied functional gene ontology association of these genes. The gene ontology associations found for these genes are summarized in Figure 4. The candidate disease genes contained a cluster of genes that are predicted to be involved in plasma membrane transport, which might be associated with kidney function, as well as gene products predicted to be involved in protein binding and signal transduction. Furthermore, two predicted kinases and two predicted structural proteins are represented in this set of genes. This information, as well as title lines and annotation, is provided for each gene in Supplemental file 2.
We have studied a region on rat chr. 1 that contains QTLs for several disease phenotypes, including renal failure, diabetes, hypertension, encephalomyelitis, and body weight. We validated the rat genes by identification of EST-inferred genes that are found both in rat and human. In addition, we studied microarray experiments and electronic Northern data to generate organexpression information. Using the rat-to-human syntenic map, we significantly reduced the size of the rat Rf-1 QTL locus, and thus, the number of candidate disease genes. Through the combination of expression information as well as syntenic mapping and disease linkage data for the renal failure phenotype, we present a unique approach for the selection of candidate disease genes. Within the large pool of genes mapping to a disease region, we thus highlight 66 candidate disease genes for Rf-1. The rat chr. 1q43 [PDB] -54 region contained a total of 307 Incyte rat gene or gene fragments that did not overlap any of the Ensembl gene predictions, and approximately half of these were composed of Incyte ESTs only. Manual curation and analysis of 31 of these fragments to evaluate the amount of underclustering suggests a gene fragmentation ratio of 3:1. In addition, multiple rat genes were verified syntenically as homologs of the same human gene, which also indicates underclustering in the data set. Thus, the 307 gene fragments could represent a set of 100 additional rat genes on rat chr. 1q43 [PDB] -54.
The use of chromosomal synteny has allowed us to avoid the problems inherent with a reciprocal best-hit approach. Rat-to-human syntenic comparisons enabled the identification of 26 k rat-human syntenic homolog pairs genome wide that have EST evidence in both rat and human. This is more than the ortholog pairs derived via reciprocal BLAST hits (
The complete sequencing of the rat genome enabled a number of rat gene-prediction studies to be performed, including those predictions found in the Ensembl database (Rat Genome Sequencing Project Consortium 2004
We have shown how the rearrangement of syntenic segments in human compared with rat is a valuable characteristic that can be exploited for the selection of candidate disease genes. The Rf-1 locus was described in the fawn-hooded hypertensive rat, which uniformly develops end-stage renal disease (Brown et al. 1996
The mapping of rat clones to the genomic sequence establishes a rat transcriptome (Panda et al. 2003 The impact of expression information is increased if more expression experiments can be included that address a larger number of the sequences studied. Thus, we have also investigated the experiments presented in the GEO database. This approach added two potential genes that might be of interest. Unfortunately, this expression information was limited to human, and no appropriate rat hybridization data could be found in the GEO data set. It would be interesting to extend this comparison to additional expression experiments, especially if the number of expression experiments in rat increases.
The selected rat candidate disease genes were further evaluated for their potential function and association to biological processes. To date, the information on Rf-1 phenotype is not conclusive enough to be able to associate it with a particular type of gene. The human marker associated with ESRD (D10S677; Freedman et al. 2002 The histological evaluation of affected rat Rf-1 kidney tissues indicates that this phenotype represents a lack of resistance to long-term high blood pressure in the renal glomerula. This might well be accounted for by differences in the extracellular matrix components, which resist the pressure of the blood that is being filtered at the basal lamina of the glomerula. Thus, the candidate disease genes characterized as structural proteins might be of higher interest. It is also perceivable that the four candidate disease genes that were classified to have a potential role as plasmamembrane transporters might play a crucial role in kidney function. Furthermore, two rat genes, which are the syntenic homologs of human Hps1 and Hps6, have clones from kidney and are located within this region. These genes in human have been thought to be associated with the Hermansky-Pudlak syndrome, a heterogeneous syndrome complex that involves different severe symptoms, including renal failure. It is conceivable that a separate mutation of one of these genes could lead to renal failure only, without the other associated symptoms of the syndrome. Further studies evaluating these genes are necessary to determine which gene(s)' misfunction is responsible for the Rf-1 phenotype.
To date, no mouse phenotype involved in renal failure has been shown to be localized to the mouse region syntenic to rat Rf-1 and human ESRD. However, several mouse phenotypes of gene knockouts localized on chr. 19, such as cyclooxygenase-2 and apolipoprotein-E, have been described to be associated with kidney impairment (Norwood et al. 2000 We have shown how the rearrangement of genomic segments between rat and human can be used to reduce the size of a QTL region, and thus, also the number of candidate disease genes. Furthermore, by using alignments of rat ESTs to rat chr. 1q43 [PDB] -54, which contains several QTLs, we have provided transcriptional evidence for rat genes as well as information on organ expression, which was to date unavailable for a large set of these genes. We have shown how this expression information in combination with synteny and disease-linkage information can be used to select candidate disease genes for the rat Rf-1 QTL.
cDNA Library Construction and Sequencing The Incyte Database contains 876,507 rat cDNA clones from 417 libraries that represent 40 different organs (Supplemental file 3) from Sprague-Dawley rats. From the Incyte clones, 1.2 M EST sequences were derived. In addition, 300 k public rat sequences were downloaded from GenBank (release 134). Low-quality regions, sequencing artifacts, vector sequences, and 3' polyadenylated termini were clipped from cDNA sequences; low-complexity regions and repetitive elements were masked. A total of 4248 RefSeq (3/2/2003) and 5605 nucleotide sequences derived from GenPept entries were downloaded from GenBank (release 134).
Alignment of Sequences to the Rat Genome and Clustering Into Potential Gene Loci
Analysis of Additional EST-Inferred Gene Fragments That Do Not Overlap Ensembl Genes
Verification of Ensembl Genes
Gene Annotation
Title lines and predictive Gene Ontology (GO; Ashburner et al. 2000
BKT Pfam analysis uses HMMer (version 2.1.1; Krogh et al. 1994 The Direct Transfer method uses BLAST analysis to identify the best BLAST hit among all BKL proteins. Proteins are ranked according to their Smith-Waterman scores, and preference is given to the highest-ranking characterized protein. If there are no characterized proteins among the BLAST targets, an uncharacterized protein is selected as the best BLAST target and is used in the title line, but no GO properties are captured by Direct Transfer. If the best BLAST hit is a characterized protein, GO properties from the BLAST hit are filtered to obtain more general parent GO properties, and these are applied to the uncharacterized protein. The Consensus Transfer method considers the entire set of BLAST target proteins with an E-value of 1e-10 or less, in order to identify the most predominant GO properties shared among the list of targets.
Title lines for uncharacterized proteins are written to contain both the standard phrases to describe Pfam membership, and information about similarity to the best BLAST target, with a phrase describing the function of the BLAST target. The level of similarity to the best BLAST target (very strong, strong, high, moderate, low, weak; Sander and Schneider 1991
Incyte transcripts generated by clustering of ESTs aligned to the genome were analyzed for ORFs using a 6-frame translation. Transcripts were allowed to have more than one ORF and were scored as ORF containing if they had either a terminal ORF with length
Differential Expression Analysis Furthermore, the GEO database was searched for rat or human microarray experiments that used kidney tissues. Experiments with treated or tumorigenic tissues were excluded. Data from hybridization experiments with kidney samples GSM2830 [NCBI GEO] , GSM2843 [NCBI GEO] , and GSM2871 [NCBI GEO] to human platform GDS181 were examined for clones that showed a significant increase in kidney tissue. Genes derived from clones found on this platform were identified in the human EST-based gene set and analyzed for their synteny to rat genes on chr. 1q43 [PDB] -54.
Electronic Northern
Syntenic Map and Syntenic Confirmation of Rat-to-Human Homologs
To leverage the syntenic map to determine a set of putative orthologous gene pairs, the rat EST-derived transcript sequences were compared with a corresponding set of human known and Incyte genes (Incyte LifeSeq Foundation Release 11). BLASTN (Altschul et al. 1990
Verification of Genes in the Rf-1 Disease Region
We thank George Weinstock and Kim Worley, Baylor Collegue of Medicine, for their support. We acknowledge the support of our past and present Incyte colleagues during the course of this work. We thank Richard Goold, Robert Lagace, and Brent Louie for assistance in gene analysis. We thank Scott Anderson, Qixin Bei, John Blanchard, Anissa Jones, Sarah Mullahy, Iqbal Panesar, Srikanth Patury, Pierre Rioux, Wayne Wonchoba, and Mingham Wu for providing algorithms for the analyses. We thank Cindy Dole for her editing assistance. The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1932304.
3 Corresponding author. [Supplemental material is available online at www.genome.org.]
Altschul, S.F., Gish, W., Miller, W., Myers, E.W., and Lipman D.J. 1990. Basic local alignment search tool. J. Mol. Biol. 215: 403-410.[CrossRef][Medline] Ashburner, M., Ball, C.A., Blake, J.A., Botstein, D., Butler, H., Cherry, J.M., Davis, A.P., Dolinski, K., Dwight, S.S., Eppig, J.T., et al. 2000. Gene Ontology: Tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 25: 25-29.[CrossRef][Medline]
Bateman, A., Birney, E., Durbin, R., Eddy, S.R., Howe, K.L., and Sonnhammer, E.L.L. 2000. The Pfam Protein Families Database. Nucleic Acids Res. 28: 263-266.
Bergsteinsdottir, K., Yang, H.T., Pettersson, U., and Holmdahl, R. 2000. Evidence for common autoimmune disease genes controlling onset, severity, and chronicity based on experimental models for multiple sclerosis and rheumatoid arthritis. J. Immunol. 164: 1564-1568. Brown, D.M., Provoost, A.P., Daly, M.J., Lander, E.S., and Jacob, H.J. 1996. Renal disease susceptibility and hypertension are under independent genetic control in the fawn-hooded rat. Nat. Genet. 12: 44-51.[CrossRef][Medline] Burge, C. and Karlin, S. 1997. Prediction of complete gene structures in human genomic DNA. J. Mol. Biol. 268: 78-94.[CrossRef][Medline]
Chen, G., Paka, L., Kako, Y., Singhal, P., Duan, W., and Pillarisetti, S. 2001. A protective role for kidney apolipoprotein E. Regulation of mesangial cell proliferation and matrix expansion. J. Biol. Chem. 276: 49142-49147. Churchill, P.C., Churchill, M.C., Bidani, A.K., Griffin, K.A., Picken, M., Pravenec, M., Kren, V., Lezin, E., Wang, J.M., Wang, N., et al. 1997. Genetic susceptibility to hypertension-induced renal damage in the rat. Evidence based on kidney-specific genome transfer. J. Clin. Invest. 100: 1373-1382.[Medline] Dracheva, S.V., Remmers, E.F., Chen, S., Chang, L., Gulko, P.S., Kawahito, Y., Longman, R.E., Wang, J., Du, Y., Shepard, J., et al. 2000. An integrated genetic linkage map with 1,137 markers constructed from five F2 crosses of autoimmune disease-prone and -resistant inbred rat strains. Genomics 63: 202-226.[CrossRef][Medline]
Freedman, B.I. 2002. End-stage renal failure in African Americans: Insights in kidney disease susceptibility. Nephrol. Dial. Transplant. 17: 198-200. Freedman, B.I., Rich, S.S, Yu, H., Roh, B.H., and Bowden, D.W. 2002. Linkage heterogeneity of end-stage renal disease on human chromosome 10. Kidney Int. 62: 770-774.[CrossRef][Medline] Galli, J., Li, L.S., Glaser, A., Ostenson, C.G., Jiao, H., Fakhrai-Rad, H., Jacob, H.J., Lander, E.S., and Luthman, H. 1996. Genetic analysis of non-insulin dependent diabetes mellitus in the GK rat. Nat. Genet. 12: 31-37.[CrossRef][Medline] Jacob, H.J., Brown, D.M., Bunker, R.K., Daly, M.J., Dzau, V.J., Goodman, A., Koike, G., Kren, V., Kurtz, T., Lernmark, A., et al. 1995. A genetic linkage map of the laboratory rat, Rattus norvegicus. Nat. Genet. 9: 63-69.[CrossRef][Medline] Kovacs, P., Voigt, B., and Kloting, I. 1997. Novel quantitative trait loci for blood pressure and related traits on rat chromosomes 1, 10, and 18. Biochem. Biophys. Res. Commun. 18: 343-348. . 1998. Congenic strain confirms putative quantitative trait locus for body weight in the rat. Mamm. Genome 9: 294-296.[CrossRef][Medline] Krogh, A., Brown, M., Mian, I.S., Sjolander, K., and Haussler, D. 1994. Hidden Markov models in computational biology: Applications to protein modeling. J. Mol. Biol. 235: 1501-1531.[CrossRef][Medline]
Leo, C.P., Hsu, S.Y., and Hsueh, A.J. 2002. Hormonal genomics. 2002. Endocr. Rev. 23: 369-381. Myers, G. and Durbin, R. 2003. A table-driven, full-sensitivity similarity search algorithm. J. Comput. Biol. 10: 103-117.[CrossRef][Medline] Norwood, V.F., Morham, S.G., and Smithies, O. 2000. Postnatal development and progression of renal dysplasia in cyclooxygenase-2 null mice. Kidney. Int. 58: 2291-2300.[CrossRef][Medline] Panda, S., Sato, T.K., Hampton, G.M., and Hogenesch, J.B. 2003. An array of insights: Application of DNA chip technology in the study of cell biology. Trends. Cell. Biol. 13: 151-156.[CrossRef][Medline]
Provoost, A.P., Shiozawa, M., Van Dokkum, R.P., and Jacob, H.J. 2002. Transfer of the Rf-1 region from FHH onto the ACI background increases susceptibility to renal impairment. Physiol. Genomics 8: 123-129. Rat Genome Sequencing Project Consortium. 2004. Genome sequence of the Brown Norway Rat yields insights into mammalian evolution. Nature (in press). Sander, D. and Schneider, D. 1991. Database of homology-derived protein structures and the structural meaning of sequence alignment. Proteins 9: 56-68.[CrossRef][Medline] Sankoff, D. 2001. Gene and genome duplication. Curr. Opin. Genet. Dev. 11: 681-684.[CrossRef][Medline] Schwartz, S., Kent, W.J., Smit, A., Zhang, Z., Baertsch, R., Hardison, Haussler, D., and Miller, W. 2003. Humanmouse alignments with BLASTZ. Genome Res. 13: 103-107.
Shiozawa, M., Provoost, A.P., van Dokkum, R.P., Majewski, R.R., and Jacob, H.J. 2000. Evidence of genegene interactions in the genetic susceptibility to renal impairment after unilateral nephrectomy. J. Am. Soc. Nephrol. 11: 2068-2078. Solovyev, V. and Salamov, A. 1997. The Gene-Finder computer tools analysis of human and model organisms genome sequences. Proc. Int. Conf. Intell. Syst. Mol. Biol. 5: 294-302.[Medline]
Song, C., Hu, C.D., Masago, M., Kariyai, K., Yamawaki-Kataoka, Y., Shibatohge, M., Wu, D., Satoh, T., and Kataoka, T. 2001. Regulation of a novel human phospholipase C, PLCepsilon, through membrane targeting by Ras. J. Biol. Chem. 276: 2752-2757.
Tatusov, R.L., Koonin, E.V., and Lipman, D.J. 1997. A genomic perspective on protein families. Science 278: 631-637. Waterston, R.H., Lindblad-Toh, K., Birney, E., Rogers J., Abril, J.F., Agarwal, P., Agarwala, R., Ainscough, R., Alexandersson, M., An, P., et al. 2002. Initial sequencing and comparative analysis of the mouse genome. Nature 420: 520-562.[CrossRef][Medline] Wei, S., Wei, K., Moralejo, D.H., Ogino, T., Koike, G., Jacob, H.J., Sugiura, K., Sasaki, Y., Yamada, T., and Matsumoto, K. 1999. Mapping and characterization of quantitative trait loci for non-insulin-dependent diabetes mellitus with an improved genetic map in the Otsuka Long-Evans Tokushima fatty rat. Mamm. Genome 10: 249-258.[CrossRef][Medline] Yu, H., Sale, M., Rich, S.S., Spray, B.J., Roh, B.H., Bowden, D.W., and Freedman, B.I. 1999. Evaluation of markers on human chromosome 10, including the homologue of the rodent Rf-1 gene, for linkage to ESRD in black patients. Am. J. Kidney. Dis. 33: 294-300.[Medline]
http://ratmap.gen.gu.se/; The rat genome database. http://www.ncbi.nlm.nih.gov/genome/sts/; UniSTS: http://www.ncbi.nlm.nih.gov/geo/; GEO http://www.ensembl.org/; Ensembl
Received November 4, 2003;
accepted in revised format December 2, 2003.
|