Resource

A platform for curated products from novel open reading frames prompts reinterpretation of disease variants

    • 1Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom;
    • 2Department of Biology, Indian Institute of Science Education and Research, Pune, Maharashtra 411008, India;
    • 3Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff CF14 4XN, United Kingdom;
    • 4St Edmund's College, University of Cambridge, Cambridge CB3 0BN, United Kingdom
    • 5 These authors contributed equally to this work.
Published January 19, 2021. Vol 31 Issue 2, pp. 327-336. https://doi.org/10.1101/gr.263202.120
Download PDF Please log-in to or register for your personal account in order to access PDF Cite Article Permissions Share
cover of Genome Research Vol 36 Issue 4
Current Issue:

Abstract

Recent evidence from proteomics and deep massively parallel sequencing studies have revealed that eukaryotic genomes contain substantial numbers of as-yet-uncharacterized open reading frames (ORFs). We define these uncharacterized ORFs as novel ORFs (nORFs). nORFs in humans are mostly under 100 codons and are found in diverse regions of the genome, including in long noncoding RNAs, pseudogenes, 3′ UTRs, 5′ UTRs, and alternative reading frames of canonical protein coding exons. There is therefore a pressing need to evaluate the potential functional importance of these unannotated transcripts and proteins in biological pathways and human disease on a larger scale, rather than one at a time. In this study, we outline the creation of a valuable nORFs data set with experimental evidence of translation for the community, use measures of heritability and selection that reveal signals for functional importance, and show the potential implications for functional interpretation of genetic variants in nORFs. Our results indicate that some variants that were previously classified as being benign or of uncertain significance may have to be reinterpreted.

Loading
Loading
Back to top