RT Journal A1 Liu, Xiaoming A1 Fu, Yun-Xin A1 Maxwell, Taylor J. A1 Boerwinkle, Eric T1 Estimating population genetic parameters and comparing model goodness-of-fit using DNA sequences with error JF Genome Research JO Genome Research YR 2010 FD January 01 VO 20 IS 1 SP 101 OP 109 DO 10.1101/gr.097543.109 UL http://genome.cshlp.org/content/20/1/101.abstract AB It is known that sequencing error can bias estimation of evolutionary or population genetic parameters. This problem is more prominent in deep resequencing studies because of their large sample size n, and a higher probability of error at each nucleotide site. We propose a new method based on the composite likelihood of the observed SNP configurations to infer population mutation rate θ = 4Neμ, population exponential growth rate R, and error rate ɛ, simultaneously. Using simulation, we show the combined effects of the parameters, θ, n, ɛ, and R on the accuracy of parameter estimation. We compared our maximum composite likelihood estimator (MCLE) of θ with other θ estimators that take into account the error. The results show the MCLE performs well when the sample size is large or the error rate is high. Using parametric bootstrap, composite likelihood can also be used as a statistic for testing the model goodness-of-fit of the observed DNA sequences. The MCLE method is applied to sequence data on the ANGPTL4 gene in 1832 African American and 1045 European American individuals.