Abstract
Structural variation and single-nucleotide variation of the complement factor H (CFH) gene family underlie several complex genetic diseases, including age-related macular degeneration (AMD) and atypical hemolytic uremic syndrome (AHUS). To understand its diversity and evolution, we performed high-quality sequencing of this ∼360-kbp locus in six primate lineages, including multiple human haplotypes. Comparative sequence analyses reveal two distinct periods of gene duplication leading to the emergence of four CFH-related (CFHR) gene paralogs (CFHR2 and CFHR4 ∼25–35 Mya and CFHR1 and CFHR3 ∼7–13 Mya). Remarkably, all evolutionary breakpoints share a common ∼4.8-kbp segment corresponding to an ancestral CFHR gene promoter that has expanded independently throughout primate evolution. This segment is recurrently reused and juxtaposed with a donor duplication containing exons 8 and 9 from ancestral CFH, creating four CFHR fusion genes that include lineage-specific members of the gene family. Combined analysis of >5,000 AMD cases and controls identifies a significant burden of a rare missense mutation that clusters at the N terminus of CFH [P = 5.81 × 10−8, odds ratio (OR) = 9.8 (3.67-Infinity)]. A bipolar clustering pattern of rare nonsynonymous mutations in patients with AMD (P < 10−3) and AHUS (P = 0.0079) maps to functional domains that show evidence of positive selection during primate evolution. Our structural variation analysis in >2,400 individuals reveals five recurrent rearrangement breakpoints that show variable frequency among AMD cases and controls. These data suggest a dynamic and recurrent pattern of mutation critical to the emergence of new CFHR genes but also in the predisposition to complex human genetic disease phenotypes.
Original language | English |
---|---|
Pages (from-to) | E4433-E4442 |
Number of pages | 10 |
Journal | Proceedings of the National Academy of Sciences of the United States of America |
Volume | 115 |
Issue number | 19 |
Early online date | 23 Apr 2018 |
DOIs | |
Publication status | Published - 8 May 2018 |
Bibliographical note
Data deposition: The data reported in this paper have been deposited as a National Center for Biotechnology Information BioProject (accession no. PRJNA401648).Author contributions: S.C. and E.E.E. designed research; S.C., C.B., L.H., K.P., K.M.M., M.S., A.E.W., V.D., T.A.G.-L., and R.K.W. performed research; S.C., J.H., C.B., L.H., K.P., K.M.M., M.S., A.E.W., V.D., F.G., A.J.R., R.H.G., T.A.G.-L., R.K.W., B.H.F.W., P.N.B., R.A., and E.E.E. contributed new reagents/analytic tools; S.C., B.J.N., J.H., and E.E.E. analyzed data; and S.C., B.J.N., and E.E.E. wrote the paper.
Keywords
- Age-related macular degeneration
- AMD
- CFH gene family
- Natural selection
- Structural variation