
Flowchart for gene name and CDS annotation steps during MATRICS. The 60,770 FANTOM2 clones were clustered into 33,409 transcriptional units (The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase I & II Team 2002). The gene name and CDS of representative clones from each TU were annotated by MATRICS curators. Of 17,200 protein-coding clones, 12,454 and 2128 were annotated as having a complete and partial CDS, respectively. The remaining 2618 clones have problems such as “immature,” “UTR,” or “unknown” (see Methods). Another 187 complete CDS and 50 partial CDS clones could not be translated because of low sequence quality.











