The first Korean genome sequence and analysis: Full genome sequencing for a socio-ethnic group

Sung-Min Ahn; Tae-Hyung Kim; Sunghoon Lee; Deokhoon Kim; Ho Ghang; Daesoo Kim; Byoung-Chul Kim; Sang-Yoon Kim; Woo-Yeon Kim; Chulhong Kim; Daeui Park; Yong Seok Lee; Sangsoo Kim; Rohit Reja; Sungwoong Jho; Chang Geun Kim; Ji-Young Cha; Kyung-Hee Kim; Bonghee Lee; Jong Bhak; Seong-Jin Kim

doi:10.1101/gr.092197.109

The first Korean genome sequence and analysis: Full genome sequencing for a socio-ethnic group

¹ Lee Gil Ya Cancer and Diabetes Institute;
² Korean BioInformation Center;
³ Soongsil University;
⁴ National Center for Standard Reference Data;
⁵ Gachon University Gil Hospital;
⁶ Gachon University of Medicine and Science

E-mail: jongbhak{at}yahoo.com

Abstract

We present the first Korean individual genome sequence (SJK) and analysis results. The diploid genome of a Korean male was sequenced to 28.95-fold redundancy using the Illumina paired-end sequencing method. SJK covered 99.9% of the NCBI human reference genome. We identified 420,083 novel SNPs that are not in the dbSNP database. Despite a close similarity, significant differences were observed between the Chinese genome (YH), the only other Asian genome available, and SJK: 1) 39.87% (1,371,239 out of 3,439,107) SNPs were SJK-specific (49.51% against Venter's, 46.94% against Watson's, and 44.17% against the Yoruba genomes), 2) 99.5% (22,495 out of 22,605) of short indels (< 4 bp) discovered on the same loci had the same size and type as YH, and 3) 11.3% (331 out of 2920) deletion structural variants were SJK-specific. Even after attempting to map unmapped reads of SJK to unanchored NCBI scaffolds, HGSV, and available personal genomes, there were still 5.77% SJK reads that could not be mapped. All these findings indicate that the overall genetic differences among individuals from closely related ethnic groups may be significant. Hence, constructing reference genomes for minor socio-ethnic groups will be useful for massive individual genome sequencing.

Footnotes

- Received February 3, 2009.
- Accepted May 22, 2009.
This manuscript is Open Access.

The first Korean genome sequence and analysis: Full genome sequencing for a socio-ethnic group

Abstract

Footnotes

This Article

Article Category

Services

Citing Articles

Google Scholar

PubMed/NCBI

Related Content

Share

Preprint Server

Current Issue

In This Issue

The first Korean genome sequence and analysis: Full genome sequencing for a socio-ethnic group

Abstract

Footnotes

Related Article

This Article

Article Category

Services

Citing Articles

Google Scholar

PubMed/NCBI

Related Content

Share

Preprint Server

Current Issue

In This Issue