Table 1B.
| Corpus | No. articles with N codes | Total articles | |||
| 1 | 2 | 3 | 4 | ||
| Training | 15444 | 888 | 60 | 9 | 16401 |
| Test 2000 | 2682 | 231 | 27 | 1 | 2941 |
| Test 2001 | 184 | 22 | 2 | 0 | 208 |
-
Some of the articles within the training set were obtained in more than one of the queries; thus these articles have more than a single relevant GO classification. This table lists the number of abstracts in each data set and the number of abstracts with one, two, three, and four relevant codes.











