
Database sequence similarity searches. The diagram depicts the extent of overlap between the (101B6) reference sequence (top solid line) and a subset (as of 12–99) of other highly paralogous (>90%) GenBank sequences (lower solid lines). Sequences with an * before them denote clones in htgs phase of GenBank. These overlaps are placed in the context of ancestral duplications from 4q24, Xq28, and 2p12 (see text). Horizontal broken lines indicate a gap in the target sequence, whereas vertical broken linesindicate the positions of repeat sequences. The paralogous nonprocessed pseudogene fragments of the adrenoleukodystrophy, AA393779 and Unigene cluster Hs. 135840, and the immunoglobulin κ-variable chain segment are shown as filled boxes. The direction of transcription (arrows) and the exon–intron structure with respect to the ancestral (expressed) sequence are indicated. GC-rich repeat elements such as the telomeric associated repeat (TAR) and GC-rich interspersed repeats are indicated by hatched boxes.











