Genotype-Phenotype Taxonomy of Hypertrophic Cardiomyopathy.
Curran L., de Marvao A., Inglese P., McGurk KA., Schiratti P-R., Clement A., Zheng SL., Li S., Pua CJ., Shah M., Jafari M., Theotokis P., Buchan RJ., Jurgens SJ., Raphael CE., Baksi AJ., Pantazis A., Halliday BP., Pennell DJ., Bai W., Chin CWL., Tadros R., Bezzina CR., Watkins H., Cook SA., Prasad SK., Ware JS., O'Regan DP.
BACKGROUND: Hypertrophic cardiomyopathy (HCM) is an important cause of sudden cardiac death associated with heterogeneous phenotypes, but there is no systematic framework for classifying morphology or assessing associated risks. Here, we quantitatively survey genotype-phenotype associations in HCM to derive a data-driven taxonomy of disease expression. METHODS: We enrolled 436 patients with HCM (median age, 60 years; 28.8% women) with clinical, genetic, and imaging data. An independent cohort of 60 patients with HCM from Singapore (median age, 59 years; 11% women) and a reference population from the UK Biobank (n=16 691; mean age, 55 years; 52.5% women) were also recruited. We used machine learning to analyze the 3-dimensional structure of the left ventricle from cardiac magnetic resonance imaging and build a tree-based classification of HCM phenotypes. Genotype and mortality risk distributions were projected on the tree. RESULTS: Carriers of pathogenic or likely pathogenic variants for HCM had lower left ventricular mass, but greater basal septal hypertrophy, with reduced life span (mean follow-up, 9.9 years) compared with genotype negative individuals (hazard ratio, 2.66 [95% CI, 1.42-4.96]; P<0.002). Four main phenotypic branches were identified using unsupervised learning of 3-dimensional shape: (1) nonsarcomeric hypertrophy with coexisting hypertension; (2) diffuse and basal asymmetrical hypertrophy associated with outflow tract obstruction; (3) isolated basal hypertrophy; and (4) milder nonobstructive hypertrophy enriched for familial sarcomeric HCM (odds ratio for pathogenic or likely pathogenic variants, 2.18 [95% CI, 1.93-2.28]; P=0.0001). Polygenic risk for HCM was also associated with different patterns and degrees of disease expression. The model was generalizable to an independent cohort (trustworthiness, M1: 0.86-0.88). CONCLUSIONS: We report a data-driven taxonomy of HCM for identifying groups of patients with similar morphology while preserving a continuum of disease severity, genetic risk, and outcomes. This approach will be of value in understanding the causes and consequences of disease diversity.