Spectral Biclustering of Microarray Data: Coclustering Genes and Conditions

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 1.Figure 1.Figure 1.
Figure 1.

Overview of important parts of the biclustering process. (A) shows the problem: shuffling a gene expression matrix to reveal a checkerboard pattern associating genes with conditions. (B) shows how this problem can be approached through solving an “eigenproblem.” If a gene expression matrix A has a checkerboard structure, applying it to a step-like condition classification vector x will result in a step-like gene classification vector y. Moreover, if one then appliesAT to y, one will regenerate a step-like condition classification vector with the same partitioning structure asx. This suggests one can determine whether A has a checkerboard structure through solving an eigenvalue problem. In other words, if A has a (hidden) checkerboard structure, there exist some piecewise constant partition vectorsx = v * andy = u * such thatA T Av * = λ2 v *andAAT u * = λ2 u *(bottom quadrant of part B). Note that most eigenvectorsv of the eigenvalue problem A T Av = λ2 v (symbolized by a zigzag structure) are not embedded in the subspace of classification (step-like) vectors x possessing the same partitioning structure, as indicated by a gray arrow protruding from this subspace (parallelogram). On the other hand, piecewise constant (step-like) partition eigenvectors v * are embedded in this subspace and are indicated by a green arrow. To reveal whether the data have a checkerboard structure, one can inspect whether some of the pairs of monotonically sorted gene and tumor eigenvectorsv i and u i have an approximate stepwise (piecewise) constant structure. The outer product u * v * Tof the sorted partitioning eigenvectors gives a checkerboard structure. (C) shows how rescaling of matrix A can lead to improved copartitioning of genes and conditions.

This Article

  1. Genome Res. 13: 703-716

Preprint Server