Genome-wide detection of human 5' UTR variants that impact protein translation.
| Publication Type | Academic Article |
| Authors | Chaldebas M, Ponsin K, Bohlen J, Conil C, Mourelatos H, Stenson P, Cooper D, Abel L, Casanova J, Cobat A, Zhang P |
| Journal | Am J Hum Genet |
| Volume | 113 |
| Issue | 4 |
| Pagination | 809-827 |
| Date Published | 03/24/2026 |
| ISSN | 1537-6605 |
| Keywords | 5' Untranslated Regions, Protein Biosynthesis, Genome, Human, Genetic Variation |
| Abstract | The 5' untranslated region (5' UTR) of messenger RNAs (mRNAs) plays a central role in regulating protein synthesis initiation, particularly through the Kozak sequence and upstream open reading frames (uORFs). Genetic variants within these regulatory elements could affect translation, altering gene expression and contributing to clinical phenotypes in humans. We developed a computational method called 5ULTRA (5' Untranslated Region Annotation) for analysis of whole-exome sequencing and whole-genome sequencing data to detect, annotate, and prioritize 5' UTR variants with potential translation impact. 5ULTRA identifies single-nucleotide variants, indels, and splicing variants that affect uORFs by creating or disrupting start/stop codons and that alter Kozak sequence strength of either the uORFs or the main coding sequence. 5ULTRA incorporates recent uORF databases and provides comprehensive annotations. 5ULTRA implements a machine-learning score to prioritize candidate variants with predicted effects on translation and also provides specific mechanistic predictions. The score correlates strongly with experimentally measured protein-level effects of 5' UTR variants. We applied 5ULTRA to multiple genetics datasets across diverse disease contexts, identifying candidate variants including potential cancer-driving somatic mutations predicted to decrease ABI1 level or increase NRAS abundance; common variants associated with traits such as multiple sclerosis, lung function, and cardiovascular function, by altering protein levels of TAGAP, VRTN, and SPAAR, respectively; and rare germline variants in our cohort, including a splicing variant of RPSA leading to 5' UTR sequence alteration that causes congenital asplenia and a variant of TNF that could predispose to tuberculosis. |
| DOI | 10.1016/j.ajhg.2026.02.020 |
| PubMed ID | 41881026 |
| PubMed Central ID | PMC13087467 |