Identification of polymorphic motifs using probabilistic search algorithms

Table 3.

Detailed results pertaining to synthetic data set 3 for five independent simulation runs


Characteristics of Synthetic Data Matrices


Population
Motif length
Simulation number
Sites in motifa
Frequencies of “1” at motif sites
Number of sweeps to convergence
Whether converged to “correct” motif
Population 1 (D1) 10 1 17 22 24 27 28 36 39 40 43 50 855 845 847 837 855 847 843 847 827 841 38 YES
2 1 4 8 15 19 20 22 37 39 44 855 832 848 867 861 874 850 858 851 849 27 YES
3 6 7 8 9 11 16 20 21 35 47 876 846 837 852 830 855 843 851 859 849 64 YES
4 3 11 13 27 35 39 42 46 47 50 846 868 851 834 874 874 837 848 832 856 35 YES
5 5 7 10 16 22 29 30 33 42 44 830 846 827 840 851 860 853 851 858 826 84 YES
Population 2 (D2) 15 1 17 22 24 27 28 36 39 40 43 50 18 31 34 42 48 863 845 867 854 875 846 851 858 844 832 845 827 816 888 855 13 YES
2 1 4 8 15 19 20 22 37 39 44 7 12 21 31 46 857 821 852 882 863 887 838 866 858 853 828 890 886 830 850 37 YES
3 6 7 8 9 11 16 20 21 35 47 2 27 36 37 41 873 835 839 859 828 853 863 851 876 848 845 846 890 824 816 77 YES
4 3 11 13 27 35 39 42 46 47 50 5 15 26 37 38 844 874 848 831 867 860 819 862 834 856 827 809 886 826 856 11 YES
5 5 7 10 16 22 29 30 33 42 44 1 8 27 35 46 843 840 821 848 854 880 838 836 847 820 821 845 876 860 881 28 YES
Population 3 (D3) 15 1 17 22 24 27 28 36 39 40 43 50 3 9 15 33 44 870 843 847 841 824 857 859 864 822 845 882 828 811 892 852 42 YES
2 1 4 8 15 19 20 22 37 39 44 9 18 26 29 49 865 812 831 868 863 877 833 855 836 824 833 837 890 886 854 23 YES
3 6 7 8 9 11 16 20 21 35 47 12 25 30 39 42 860 833 845 873 821 839 852 872 857 846 886 826 876 854 843 58 YES
4 3 11 13 27 35 39 42 46 47 50 10 20 21 32 50 846 861 826 838 882 867 834 847 837 872 829 889 883 826 840 61 YES


5
5 7 10 16 22 29 30 33 42 44 6 13 20 37 47
844 842 826 849 852 840 851 834 879 825 811 827 885 864 867
12
YES
  • a The sites indicated in italics are the five new sites that are specific to the daughter population, (D2 and D3) in each simulation run, in addition to the 10 sites of the ancestral population (D1)

This Article

  1. Genome Res. 15: 67-77

Preprint Server