The completion of the Mammalian Gene Collection (MGC)

Click on image to view larger version.

Table 1.
  • aGenes counted in Classes B, C, and D are subsets of Class A and not mutually exclusive.

  • bCurated RefSeq transcripts (NM-accession transcripts) are a subset of RefSeq transcripts that have been validated based on protein and DNA evidence.

  • cHuman genes in this category were identified by searching OMIM for records with “phenotype description, molecular basis known” and “gene with known sequence and phenotype” and then retrieving Gene Links that are not in the phenotype-only category. Mouse and rat genes in this category were identified using NCBI HomoloGene links for the above-mentioned human genes.

  • dConsensus CDS (CCDS) includes a subset of transcripts with agreement on the full CDS by annotation specialists at NCBI, European Bioinformatics Institute, University of California at Santa Cruz, and the Wellcome Trust-Sanger Institute (Pruitt et al. 2009a); because the numbers are based on RefSeq mRNAs in the CCDS set that are current as of March 23, 2009, they are less than the total CCDS gene number. (NA) Not applicable; CCDS genes have not been defined for rat.

This Article

  1. Genome Res. 19: 2324-2333

Preprint Server