RT Journal A1 Earl, Dent A1 Bradnam, Keith A1 St. John, John A1 Darling, Aaron A1 Lin, Dawei A1 Fass, Joseph A1 Yu, Hung On Ken A1 Buffalo, Vince A1 Zerbino, Daniel R. A1 Diekhans, Mark A1 Nguyen, Ngan A1 Ariyaratne, Pramila Nuwantha A1 Sung, Wing-Kin A1 Ning, Zemin A1 Haimel, Matthias A1 Simpson, Jared T. A1 Fonseca, Nuno A. A1 Birol, İnanç A1 Docking, T. Roderick A1 Ho, Isaac Y. A1 Rokhsar, Daniel S. A1 Chikhi, Rayan A1 Lavenier, Dominique A1 Chapuis, Guillaume A1 Naquin, Delphine A1 Maillet, Nicolas A1 Schatz, Michael C. A1 Kelley, David R. A1 Phillippy, Adam M. A1 Koren, Sergey A1 Yang, Shiaw-Pyng A1 Wu, Wei A1 Chou, Wen-Chi A1 Srivastava, Anuj A1 Shaw, Timothy I. A1 Ruby, J. Graham A1 Skewes-Cox, Peter A1 Betegon, Miguel A1 Dimon, Michelle T. A1 Solovyev, Victor A1 Seledtsov, Igor A1 Kosarev, Petr A1 Vorobyev, Denis A1 Ramirez-Gonzalez, Ricardo A1 Leggett, Richard A1 MacLean, Dan A1 Xia, Fangfang A1 Luo, Ruibang A1 Li, Zhenyu A1 Xie, Yinlong A1 Liu, Binghang A1 Gnerre, Sante A1 MacCallum, Iain A1 Przybylski, Dariusz A1 Ribeiro, Filipe J. A1 Yin, Shuangye A1 Sharpe, Ted A1 Hall, Giles A1 Kersey, Paul J. A1 Durbin, Richard A1 Jackman, Shaun D. A1 Chapman, Jarrod A. A1 Huang, Xiaoqiu A1 DeRisi, Joseph L. A1 Caccamo, Mario A1 Li, Yingrui A1 Jaffe, David B. A1 Green, Richard E. A1 Haussler, David A1 Korf, Ian A1 Paten, Benedict T1 Assemblathon 1: A competitive assessment of de novo short read assembly methods JF Genome Research JO Genome Research YR 2011 FD September 16 DO 10.1101/gr.126599.111 UL http://genome.cshlp.org/content/early/2011/11/02/gr.126599.111.abstract AB Low-cost short read sequencing technology has revolutionized genomics, though it is only just becoming practical for the high-quality de novo assembly of a novel large genome. We describe the Assemblathon 1 competition, which aimed to comprehensively assess the state of the art in de novo assembly methods when applied to current sequencing technologies. In a collaborative effort, teams were asked to assemble a simulated Illumina HiSeq data set of an unknown, simulated diploid genome. A total of 41 assemblies from 17 different groups were received. Novel haplotype aware assessments of coverage, contiguity, structure, base calling, and copy number were made. We establish that within this benchmark: (1) It is possible to assemble the genome to a high level of coverage and accuracy, and that (2) large differences exist between the assemblies, suggesting room for further improvements in current methods. The simulated benchmark, including the correct answer, the assemblies, and the code that was used to evaluate the assemblies is now public and freely available from http://www.assemblathon.org/.