Three-dimensional Cardiovascular Imaging-Genetics: A Mass Univariate Framework
Carlo Biffi, Antonio de Marvao, Mark I. Attard, Timothy J.W. Dawes,, Nicola Whiffin, Wenjia Bai, Wenzhe Shi, Catherine Francis, Hannah Meyer,, Rachel Buchan, Stuart A. Cook, Daniel Rueckert, Declan P. O'Regan

TL;DR
This paper introduces a 3D imaging-genetics framework that uses high-resolution cardiac MRI and mass univariate regression to map genetic influences on heart structure, enhancing statistical power and automation for large cohorts.
Contribution
It presents a novel 3D shape modeling and analysis framework for genotype-phenotype mapping in cardiac imaging, with improved power and false discovery control.
Findings
Enhanced statistical power for genotype-phenotype associations
Automatic analysis pipeline for large cohorts
Good control of false discovery rate
Abstract
MOTIVATION: Left ventricular (LV) hypertrophy is a strong predictor of cardiovascular outcomes, but its genetic regulation remains largely unexplained. Conventional phenotyping relies on manual calculation of LV mass and wall thickness, but advanced cardiac image analysis presents an opportunity for high-throughput mapping of genotype-phenotype associations in three dimensions (3D). RESULTS: High-resolution cardiac magnetic resonance images were automatically segmented in 1,124 healthy volunteers to create a 3D shape model of the heart. Mass univariate regression was used to plot a 3D effect-size map for the association between wall thickness and a set of predictors at each vertex in the mesh. The vertices where a significant effect exists were determined by applying threshold-free cluster enhancement to boost areas of signal with spatial contiguity. Experiments on simulated phenotypic…
| Full Cohort () | Males () | Females() | |
|---|---|---|---|
| Age [] | 43.4 13.3 (19-77) | 43.2 13.0 (19-77) | 43.5 13.2 (20-77) |
| BSA [] | 1.84 0.2 | 1.98 0.16 | 1.72 0.14 |
| SBP [] | 119.3 14 | 125.0 12.7 | 114.65 13.2 |
| beta | p-value | |
| rs409045 | 0.06 | 0.17 |
| rs6450415 | 0.01 | 0.75 |
| rs1833534 | -0.05 | 0.43 |
| rs6961069 | -0.01 | 0.96 |
| rs10499859 | 0.01 | 0.84 |
| rs10483186 | 0.01 | 0.74 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Three-dimensional Cardiovascular Imaging-Genetics:
A Mass Univariate Framework
Carlo Biffi
Department of Computing, Imperial College London, South Kensington Campus, London, UK.
MRC London Institute of Medical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, UK.
Antonio de Marvao
MRC London Institute of Medical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, UK.
Mark I. Attard
MRC London Institute of Medical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, UK.
Timothy J.W. Dawes
MRC London Institute of Medical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, UK.
Nicola Whiffin
MRC London Institute of Medical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, UK.
National Heart and Lung Institute, Imperial College London, London, UK.
NIHR Royal Brompton Cardiovascular Biomedical Research Unit, London, UK.
Wenjia Bai
Department of Computing, Imperial College London, South Kensington Campus, London, UK.
Wenzhe Shi
Department of Computing, Imperial College London, South Kensington Campus, London, UK.
Catherine Francis
National Heart and Lung Institute, Imperial College London, London, UK.
NIHR Royal Brompton Cardiovascular Biomedical Research Unit, London, UK.
Hannah Meyer
European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK.
Rachel Buchan
National Heart and Lung Institute, Imperial College London, London, UK.
NIHR Royal Brompton Cardiovascular Biomedical Research Unit, London, UK.
Stuart A. Cook
MRC London Institute of Medical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, UK.
National Heart and Lung Institute, Imperial College London, London, UK.
NIHR Royal Brompton Cardiovascular Biomedical Research Unit, London, UK.
National Heart Centre Singapore, Singapore.
Duke National University Singapore, Singapore.
Daniel Rueckert
Department of Computing, Imperial College London, South Kensington Campus, London, UK.
Declan P. O’Regan
MRC London Institute of Medical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, UK.
Abstract
Background: Left ventricular (LV) hypertrophy is a strong predictor of cardiovascular outcomes, but its genetic regulation remains largely unexplained. Conventional phenotyping relies on manual calculation of LV mass and wall thickness, but advanced cardiac image analysis presents an opportunity for high-throughput mapping of genotype-phenotype associations in three dimensions (3D). Methods: High-resolution cardiac magnetic resonance images were automatically segmented in 1,124 healthy volunteers to create a 3D shape model of the heart. Mass univariate regression was used to plot a 3D effect-size map for the association between wall thickness and a set of predictors at each vertex in the mesh. The vertices where a significant effect exists were determined by applying threshold-free cluster enhancement to boost areas of signal with spatial contiguity. Results: Experiments on simulated phenotypic signals and SNP replication show that this approach offers a substantial gain in statistical power for cardiac genotype-phenotype associations while providing good control of the false discovery rate. Conclusion: This framework models the effects of genetic variation throughout the heart and can be automatically applied to large population cohorts. Availability: The proposed approach has been coded in an R package freely available at https://doi.org/10.5281/zenodo.834610 together with the clinical data used in this work.
imaging-genetics, statistical parametric mapping, computational cardiac atlas.
††preprint: AIP/123-QED
I Introduction
One of the most complex unanswered questions in cardiovascular biology is how genetic and environmental factors influence the structure and function of the heart as a three-dimensional (3D) structure (Li et al., 2016). This is relevant for understanding the penetrance and expressivity of variants associated with inherited cardiac conditions as well as exploring the biology of heart development and within-population variation. Cardiac magnetic resonance (CMR) is the gold-standard for quantitative imaging (Hundley et al., 2010), providing a rich source of anatomic and motion-based data, however conventional phenotyping relies on manual analysis reducing the variables of interest to global volumes and mass. Computational image analysis, by which machine learning is used to annotate and segment the images, is gaining traction as a means of representing detailed 3D phenotypic variation at thousands of vertices in a standardized coordinate space (FIG. 1) (Young and Frangi, 2009; Fonseca et al., 2011).
One approach to inference is to transform the spatially-correlated data into a smaller number of uncorrelated principal components (Medrano-Gracia et al., 2015), however these modes would not provide an explicit model relating genotype to phenotype. A more powerful approach may be to derive a statistic expressing evidence of a given effect at each vertex of the 3D model, hence creating a so-called statistical parametric map - a concept widely used in functional neuroimaging (Friston, 2007). In this paper we extend techniques developed in the neuroscience domain to cardiovascular imaging-genetics by implementing a mass univariate framework to map associations between genetic variation and a 3D phenotype. Such an approach would provide overly-conservative inferences without considering spatial dependencies in the underlying data and so we validated the translation of threshold-free cluster-enhancement (TFCE) to cardiovascular phenotypes for the sensitive detection of coherent signal (Smith and Nichols, 2009) as well as implementing robust control for multiple testing. The feasibility of the proposed methodology to derive computationally-efficient inferences on imaging-genetics datasets has been tested through experiments on clinical and synthetic data using an R package developed for this purpose.
II Methods
II.1 Study population
The healthy volunteers dataset used in this study is part of the UK Digital Heart Project at Imperial College London (Bai et al., 2015) (see Appendix A for full cohort characteristic and acquisition details). To capture the whole-heart phenotype, a high-spatial resolution 3D balanced steady-state free precession cine sequence was performed on a 1.5-T Philips Achieva system (Best, the Netherlands). Images were stored on an open-source database (MRIdb, Imperial College London, UK) (Woodbridge et al., 2013). Conventional volumetric analysis of the cine images was performed using CMRtools (Cardiovascular Imaging Solutions, London, UK) following a standard protocol (Schulz-Menger et al., 2013).
Genotyping of common variants was carried out using an Illumina HumanOmniExpress-12v1-1 single nucleotide polymorphism (SNP) array (Sanger Institute, Cambridge). Clustering, calling and scoring of SNPs was performed using Illumina GenCall software. Samples were pre-phased with SHAPEIT (Delaneau et al., 2013) and imputation was performed using IMPUTE2 (Howie et al., 2009) with the UK10K dataset as a reference (www.uk10k.org). Quality of the genotypes was evaluated both on a per-individual and per-marker level using in-house Perl scripts. SNPs were removed if they had a Impute Information (INFO) score , missing call rate in more than 1 of samples, minor allele frequency of less than 1 or deviated significantly from Hardy-Weinberg equilibrium (). Only non-related individuals with ”CEU” ethnicity were retained. The total genotyping rate in these individuals was 0.997 and the total number of variants available was 9.4 million.
II.2 Atlas-based segmentation and co-registration
All image processing was performed with Matlab (MathWorks, Natick, Mass). A validated cardiac segmentation and co-registration framework was used which has previously been described in detail (Bai et al., 2015; de Marvao et al., 2015). A 3D shape model (at end-diastole and end-systole) was created encoding phenotypic variation in our study population at 49,876 epicardial vertices and visualised in a standard coordinate space (FIG. 1) (Bai et al., 2015). Wall thickness (WT) was measured by computing the distance between respective vertices on the endocardial and epicardial surfaces at end-diastole.
II.3 Overview of the approach
In the following sections we introduce a framework for deriving associations between clinical/genetic parameters and a 3D cardiac phenotype which is outlined in FIG. 2. Briefly, a general linear model is fitted at each vertex to extract the regression coefficient associated with the variable of interest (mass univariate regression). Threshold-free cluster enhancement (TFCE) is then applied to boost belief in extended areas of coherent signal in the derived vertex-wise statistical map. The points where a significant effect exists are determined by applying TFCE on the obtained t-statistic map and on t-statistic maps obtained through permutation testing, derived under the null hypothesis of no effect. Then, at each vertex the frequentist probability of having obtained a higher TFCE score by chance is regarded as the p-value related to the regression coefficient . Finally, the derived p-values are adjusted for multiple testing. The permutation testing procedure employed by this approach is the Freedman-Lane procedure (Freedman and Lane, 1983), whilst a false discovery rate (FDR) correction using the Benjamini-Hochberg procedure (Benjamin and Hochberg, 1995) is applied to correct for multiple testing.
II.4 Mass univariate analysis
The association between a ventricular phenotype mapped onto a 3D mesh and one or more clinical variables can be described using a general linear model of the form , where is a vector of, for example, WT values at each vertex and is a design matrix that can be used to model the effect of interest and contains in each column the subject’s values of clinical co-variates as well as the intercept term. These variables can be numerical (such as age or weight), categorical or expressing interaction between them. In particular, categorical variables can be exploited to express either different categories (such as gender or ethnicity) or the presence/absence in a binary form of a clinical condition (such as the presence of genetic mutation or a specific disease). is the regression coefficient vector to be estimated and represents the noise or error term, which is assumed to be a zero-mean Gaussian process and represents the variability of not explained by the model. The regression coefficient can be standardized by normalizing to mean 0 and unit-variance the columns of and . As a result, will represent the amount of variation of in units of its standard deviation when is increased by one standard deviation, allowing comparisons between variables.
The same model can be fitted at each ventricular vertex independently (mass univariate regression) and statistics can be extracted and corrected for multiple testing in order to test one or more statistical hypotheses. In a parametric framework, the t-statistic computed as is typically used in the neuroimaging literature to contrast the null hypothesis (no association between the predictor and the phenotype under study), where is the standard error of the estimator of (Friston, 2007). The regression coefficients and their related p-value thus obtained can be plotted to display, at high resolution on the whole 3D ventricle, the magnitude and spatial distribution of a given association. However, this approach underestimates associations where the signal is more spatially correlated than noise coherence. For this reason non-parametric statistics such as TFCE are valuable to increase the statistical power of the approach.
II.5 Threshold-free cluster enhancement on a cardiac atlas
The value of a statistic h obtained through mass univariate regression at a vertex p - a t-statistic in our context - is transformed by TFCE using the following integral:
[TABLE]
In the equation is the value of the vertex statistic, is the extent of the cluster with cluster-forming threshold that contains , and and are two parameters usually set to 0.5 and 2 for empirical and analytical motivations (Smith and Nichols, 2009). In computational algorithms the integral is estimated using a discrete sum. The computational model of the heart is defined as a 3D mesh composed of non-congruent triangles where at each vertex pointwise phenotypic variables are stored for each subject. The translation of TFCE to a cardiac model is not straightforward as the model is not composed of a regular grid of voxels (as in brain imaging applications) but instead forms a mesh of vertices. We addressed this problem by associating an area to each vertex i equivalent to the mesh area closest to that vertex. In computing Eq. 1 at each vertex, the most time consuming part is deriving - the area of all the elements connected to p that have a statistic value greater or equal to - as a different needs to be computed for each vertex of the mesh and for each term of the sum. However, the TFCE score associated to a vertex of a specific in the summation consists of the same score that should be associated with all the vertices which contribute to . Therefore, the computational time of the TFCE method can be significantly reduced by sampling the interval between the maximum and minimum statistic in the statistic map so as to use each sampled value as a threshold for the selection of vertices with a greater statistic value in the case of positive threshold, or less than in the case of a negative threshold. The edges of the graph are defined from a list containing the nearest neighbours of each vertex, and which is filtered at each iteration to contain only the vertices selected by the thresholding operation, resulting in one or more graphs of connected vertices. In this way, all the possible patterns of signal on the ventricle can be discovered without relying on assumptions about the geometry of cluster shapes. For all the obtained graphs including more than two vertices the TFCE score can be computed and associated to all the vertices that belong to them. The final TFCE score is the sum of all the TFCE scores thus obtained.
II.6 Permutation testing
The p-value associated with the regression coefficient computed at each atlas vertex can be derived via permutation testing. In particular, by permuting times the input data, TFCE scores maps can be obtained. It is important that the adopted permutation strategy guarantees the exchangeability assumption, i.e. permutations of given do not alter the joint distribution of the dependent variables under the null hypothesis. In the proposed context, the values themselves are not exchangeable under the null hypothesis, as the predictors included in the model together with the variable under study are nuisance variables that could explain some portion of variability of . In order to address this problem, among a number of available techniques, the Freedman-Lane procedure (Freedman and Lane, 1983) has proved to provide the best control of statistical power and false positives (type 1 error) (Winkler et al., 2014). This procedure proceeds as follows. If contains all the nuisance variables previously contained in , the general linear model can be rewritten as . Then, instead of permuting and extracting , the procedure computes the residual-forming matrix and performs different permutations by computing the model at each point, where is the permutation operator. For a full derivation of the Freedman-Lane strategy see Winkler et al. (Winkler et al., 2014).
II.7 False discovery rate correction for multiple comparisons
A multiple testing problem arises by testing tens of thousands of statistical hypotheses simultaneously. Control of the family wise error rate at 5% could be derived by extracting the maximum score from each map derived via permutation testing and by using the 95th percentile as a threshold for significance. However, in this context such a correction could be overly conservative as we are rarely interested in the exact number of vertices that reach significance. The main goal is to detect extended areas of coherent signal and therefore we can accept a maximum fixed percentage of false discoveries as provided by false discovery rate (FDR) procedures. In particular, these procedures can be applied to adjust the voxelwise p-values obtained at each vertex by computing the ratio between the number of times in which a TFCE score greater than the measured one has been obtained and the number of permutations . We have found adaptive procedures such as the two-stage Benjamini-Hochberg (Benjamini et al., 2006) not suitable for our dataset, since it led to lower p-values and increased areas of significance, as also reported in the neuroimaging literature (Reiss et al., 2012). For this reason, the original Benjamini-Hochberg (BH) (Benjamin and Hochberg, 1995) procedure has been employed for this work. It is important to underline that both FDR correction procedures are valid when the tested hypotheses are independent or satisfy a technical definition of dependence called positive regression dependency on a subset (Benjamini and Yekutieli, 2001). This condition for Gaussian data is translated into the requirement that the correlation between null voxels or between null and signal voxels is zero or positive, and for smoothed image data as those that compose a cardiac atlas, this assumption is generally considered satisfied (Friston, 2007).
II.8 Software
The proposed mass univariate framework has been coded as an R package (mutools3D) which benefits from the use of vectorized operations. Matrices containing the phenotypic data and templates to visualise the 3D models are also available with the software. Linear regression assumptions must be met in order to obtain reliable inferences (for a review of them and their importance in a mass univariate setting see Appendix B). Particularly important in this context are multicollinearity and heteroscedasticity problems which should be checked and solved for each model definition. For this latter, the R package implements mass univariate functions exploiting HC4m heteroscedascity consistent estimators (Cribari-Neto and da Silva, 2011).
III Results
III.1 GWAS Replication Study
As an exemplar application, 6 out of 9 exonic SNPs which have previously shown an association with LV mass in a case-control genome wide association study (GWAS), using echocardiography for phenotyping (Arnett et al., 2009), were also identified in the UK Digital Heart Project genotypes and were assessed for replication. For each SNP, WT at each vertex in the 3D model in 1,124 healthy Caucasian subjects was tested for association with the posterior estimate of the allele frequency by a regression model adjusted for age, gender, body surface area (BSA) and systolic blood pressure (SBP). The tested SNPs are rs409045, rs6450415, rs1833534, rs6961069, rs10499859 and rs10483186 and cohort characteristics are reported in Appendix E. Regression diagnosis through Breush-Pagan and White’s test showed how the homoscedasticity assumption was violated at a large number of vertices, therefore mass univariate regression was corrected using HC4m heteroscedascity consistent estimators (Cribari-Neto and da Silva, 2011). Regarding the assumption of multicollinearity, the condition number of the model matrix was 2.19 while the variance inflation factor was equal to 1.06, suggesting a very low degree of multicollinearity. All the simulations were executed on a high performance computer (Intel Xeon Quad-Core Processor (30M Cache, 2.40 GHz), 36Gb RAM), using the analysis pipeline and R package proposed in this paper (FIG. 2). A multiple comparisons procedure correcting for the number of vertices and the number of SNPs tested was applied by simultaneously testing in a BH FDR-controlling procedure all the TFCE-derived p-values from all the models as suggested in (Benjamini and Yekutieli, 2005). The number of permutations was fixed to 10,000 and simulations required less than 3 hours each. Finally, as a result of a preliminary study we conducted (full details in Appendix C), TFCE parameters E and H were set to 0.5 and 2, as suggested in the original TFCE paper (Smith and Nichols, 2009), since this choice provides good sensitivity and specificity on a range of synthetic signals.
Four SNPs showed a significant association with WT as reported in FIG. 3. These are rs409045 (maximum regression coefficient , percentage of the LV area significant ), rs6450415 (, ), rs6961069 (, ) and rs10499859 (, ). Conventional linear regression analysis using LV mass and the same model for all the SNPs did not reach significance (Appendix E).
III.2 Assessment of Sensitivity, Specificity and False Discovery Rate on Synthetic Data
Sensitivity, specificity and the rate of false discoveries of the proposed pipeline were estimated using synthetic data. A 3D model showing no correlation between WT and the posterior estimate of the allele frequency of an non-associated SNP (rs4288653) adjusted for age, gender, BSA and SBP was used to generate background noise. A synthetic data signal was generated by summing to the WT values of each subject a term at each vertex, where is the signal intensity and is a map of regression coefficients. Two contrasting maps (signal A and B) obtained from real clinical data were chosen and are available in *Appendix *. Signal A was characterized by non-null coefficients covering the 10% of total area of the LV and scaled to the (0,1] range, while signal B presented non-null regression coefficients scaled to the [-1,0) range in a more extended area covering the 60% of the LV surface. By subsampling the number of subjects and the signal intensity , different signals to be detected by the proposed standard mass univariate pipeline were obtained. The number of permutations for each simulation was fixed to 5,000 and results were linearly interpolated and plotted on the contour plots shown in FIG. 4.
Sensitivity increased at larger sample sizes and signal intensities , reaching the greatest values with the most extended signal (signal B) as expected. Given the sample size of our GWAS replication study and the intensity of the associations found, these results would assign a sensitivity of 70% for the first two discovered SNPs and more than 90% for the other two. Moreover, the rate of false discoveries was 0 for all the results of signal A and below 5% except for few simulations involving signal B and sample sizes greater than 1,600 (results reported in Appendix F). This effect is due to the large synthetic signal extension, which causes TFCE to extend its support to vertices near the true signal which show the same direction of effect. This is not considered a major limitation as TFCE will not enhance clusters that originate only from noise. Finally, sensitivity, specificity and the rate of false discoveries were also computed using our pipeline without TFCE - which showed how application of TFCE provides a relevant increase of up to 50% in sensitivity which only comes at the expense of a small decrease in specificity on large extended signals (Appendix F).
Finally, we have performed a comparative study between TFCE and a standard cluster-based thresholding method as reported in the original TFCE paper on brain imaging data (Smith and Nichols, 2009), which has been implemented in the proposed R package (Appendix D). Overall, in agreement with the neuroimaging literature, the sensitivity of the cluster-extent based thresholding method was lower than TFCE and proved to be very dependent on the cluster-forming threshold. Moreover, higher false discovery rates and lower specificities characterised cluster-extent based thresholding results in all cases when their sensitivity was comparable or greater than TFCE.
IV Discussion
The environmental and genetic determinants of cardiac physiology and function, especially in the earliest stages of disease, remain poorly characterized and morphological classification relies on one-dimensional metrics derived by manual image segmentation (Khouri et al., 2010). In contrast, computational cardiac analysis provides precise 3D quantification of shape and motion differences between disease groups and normal subjects (Medrano-Gracia et al., 2013). We have extended the application of these techniques by designing a general linear model framework that provides a powerful approach for modelling the relationship between phenotypic traits, genetic variation and environmental factors using high-fidelity 3D representations of the heart. By translating statistical parametric mapping techniques originally developed for brain mapping to the cardiovascular domain we exploit spatial dependencies in the data to identify coherent areas of biological effect in the myocardium. This framework also accounts for multiple testing correction at tens of thousands of vertices which is the main drawback of this class of techniques. In particular, the application of TFCE leads to a notable increase in power of the mass univariate approach at the expense of only a slight increase of the false discovery rate in large extended signals.
Genetic association studies using conventional 2D imaging leave much of the moderate heritability of LV mass unexplained (Vasan et al., 2009; Fox et al., 2013; Semsarian et al., 2015). One contribution may be the lack of phenotyping power of conventional imaging metrics, which require manual analysis and are insensitive to regional patterns of hypertrophy. Our simulations on synthetic data show that our approach has the power to detect anatomical regions associated with even small genetic effect sizes. In the reported exemplar application, we replicated the effect of four SNPs discovered in a GWAS for LV mass using a 3D WT phenotype with TFCE applied, while none of the SNPs replicated with conventional LV mass analysis. The genotype-phenotype associations that we report reflect that cardiac geometry is a complex phenotype with a highly polygenic architecture dependent on anatomical patterns of gene expression and spatially-varying adaptations to haemodynamic conditions (Wild et al., 2017; Srivastava and Olson, 2000).
One of the main limitations of the presented framework is that high-spatial resolution CMR is not available in all cohorts, although conventional two-dimensional images may be super-resolved to provide similar shape models (Oktay et al., 2017). A second limitation is that the true association may not be linear in the model parameters and nonlinear models could better fit the data. However, the advantages favouring a general linear model are its simplicity, the ability to easily design and adjust the results for multiple factors and its wide use in biomedical statistics. A third limitation of this work is with regards to the experiments using synthetic data as we only assessed noise in our single centre population and did not generalise this to other cohorts. A general limitation of these approaches is that they do not establish causal relationships, such as the interaction between genetic variants, blood pressure and LV mass, although this may be addressed in future work by Mendelian randomisation.
Mass univariate approaches do not directly consider the local covariance structure of the data, however this is accounted for when Random Field Theory or permutation tests define a threshold for significant activation (Bronzino and Peterson, 2014). In the neuroimaging literature, in the context of brain-wide candidate-SNP analyses, mass univariate approaches are used more extensively than multivariate approaches as the latter are less sensitive to regional effects and they require more observations than the dimension of the response variable (i.e. number of vertices) or the use of dimensionality reduction techniques (Friston, 2007).
As the methods are computationally-efficient and require no human input for phenotypic analysis, it is feasible to scale up the pipeline to larger population cohorts such as UK Biobank, which aims to investigate up to 100,000 participants using MR imaging (Petersen et al., 2013). Applying these concepts to revealing the effect of rare variants on LV geometry in participants without overt cardiomyopathy (Schafer et al., 2017) and to vertex-wise genome-wide analyses also represent an interesting area of future work. In this latter context, multivariate approaches may show promise for modelling high-dimensional imaging and genetic data (Vounou et al., 2012; Liu and Calhoun, 2014). Finally, while we have focused on LV geometry and shape, the same approach can be applied to time-resolved vertex-wise data to create a functional phenotype for regression modelling.
V Conclusion
We report a powerful and flexible framework for statistical parametric modelling of 3D cardiac atlases, encoding multiple phenotypic traits, which offers a substantial gain in power with robust inferences. We have implemented and validated the approach on both synthetic and clinical datasets, showing its suitability for detecting genotype-phenotype interactions on LV geometry. More generally, the proposed method can be applied to population-based studies to increase our understanding of the physiological, genetic and environmental effects on cardiac structure and function.
Acknowledgements.
The authors thank Dr. James Ware for advice on GWAS replication; our radiographers Ben Statton, Marina Quinlan and Alaine Berry; our research nurses Tamara Diamond and Laura Monje Garcia; and the study participants.
Funding
The study was supported by the Medical Research Council, UK; National Institute for Health Research (NIHR) Biomedical Research Centre based at Imperial College Healthcare NHS Trust and Imperial College London, UK; British Heart Foundation grants PG/12/27/29489, NH/17/1/32725, SP/10/10/28431 and RE/13/4/30184; Academy of Medical Sciences Grant (SGL015/1006) and a Wellcome Trust-GSK Fellowship Grant (100211/Z/12/Z).
Appendices
Appendix A UK Digital Heart Project protocols
Participants who had known cardiovascular or metabolic disease or were taking prescription medicines were excluded but simple analgesics, antihistamines, and oral contraceptives were acceptable. Female subjects were excluded if they were pregnant or breastfeeding. Standard safety contraindications to magnetic resonance imaging were applied including a weight limit of 120 kg. All subjects provided written informed consent for participation in the study, which was approved by a research ethics committee. Anthropometric measurements were collected by trained research nurses. Subjects fasted for four hours before the visit took place. Blood pressure was acquired in accordance with the guidelines of the European Society of Hypertension (O’Brien et al., 2002) using a calibrated oscillometric device (Omron M7, Omron Corporation, Kyoto, Japan). Five measures were taken, the first three were discarded and the last two were averaged. Subjects height (Ht) and weight (Wt) were assessed without shoes and body surface area (BSA) was calculated by the Mosteller formula: .
Cardiac MR (CMR) was performed on a 1.5-T Philips Achieva system (Best, the Netherlands). To capture the whole-heart phenotype, a high-spatial resolution 3D balanced steady-state free precession cine sequence was used that assessed the left and right ventricles in their entirety in a single breath-hold (60 sections, repetition time 3.0 ms, echo time 1.5 ms, flip angle 50*∘*, field of view 320 320 112 mm, matrix 160 95, reconstructed voxel size 1.2 1.2 2 mm, 20 cardiac phases, temporal resolution 100 ms, typical breath-hold 20 s). Images were stored on an open-source database (MRIdb, Imperial College London, UK) (Woodbridge et al., 2013). Conventional volumetric analysis of the cine images was performed using CMRtools (Cardiovascular Imaging Solutions, London, UK) following a standard protocol (Schulz-Menger et al., 2013).
Appendix B Gauss-Markov assumptions
In this section we review the statistical assumptions of the general linear model and their importance in a mass univariate context, so as to clarify under which conditions reliable inference can be obtained.
The regression coefficients in the general linear model under study can be obtained via ordinary least squares (OLS): and , where is the OLS estimate of the variance of the observations. According to the Gauss-Markov theorem, the ordinary least square method will be the best linear unbiased estimator (BLUE) of the regression coefficients if its five assumptions are satisfied. Here we report the importance of these assumptions for the approach proposed in this work.
No multicollinearity in X. Multicollinearity is present when a covariate is a linear combination (perfect) or is highly correlated (imperfect) with other covariates. The absence of perfect collinearity is necessary to guarantee that is full rank so that can be inverted when deriving the regression coefficients, assuring their uniqueness. Imperfect multicollinearity results in a reduction of the statistical efficiency of the derived regression coefficients, causing a reduction of power and ambiguous effects (Andrade et al., 1999). In order to diagnose this, low pairwise correlations between predictor variables are not sufficient to exclude multicollinearity with more than two explanatory variables, but they are a necessary condition. The variance inflation factor and the condition number of the design matrix can be instead employed to estimate the amount of variance of each regression coefficient increased because of collinearity and to detect the amount of collinearity respectively (Hair, 2010). Taking together these three latter indices, modifications of the design matrix via omission or orthogonalization of explanatory variables can be considered to correct for multicollinearity. 2. 2.
Random sampling of the population. This assumption is required in order to not introduce bias in the estimates and it is guaranteed when candidates have been randomly selected to participate to the study. 3. 3.
No endogeneity in X. Endogeneity happens when an explanatory variable is correlated with the error term, i.e. . In general, this problem can be due to a model misspecification problem where one or more predictors have not been included into the model, to a measurement error in X that will add bias to the estimation of the regression coefficients, or to a simultaneity error when one column of X is jointly determined with Y. As endogeneity is a conceptual problem, there are no direct statistical tests to verify this assumption, hence the results should be questioned each time. In the context under study, the last source of endogeneity can be considered negligible as imaging and clinical data comes from different sources. Bias due to measurement errors should be considered and addressed in the experimental design definition. Finally, the first assumption implies that the proposed approach can only prove association and not cause-effect relationships. 4. 4.
Linearity and additivity. The relation between dependent and independent variables is required to be linear in the parameters , therefore this assumption requires the model covariates to be correctly specified. More importantly it is required that the independent variables are additive, i.e. the amount of change in Y associated with the increase of one predictor is independent of the values of the other covariates. This latter assumption guarantees the correct interpretation of the obtained regression coefficients, avoiding their overestimation. In our context, non-additivity can be addressed by defining interaction terms of the predictors. Moreover, when interpreting the derived regression coefficients, this assumption highlights again that this approach only shows the presence of associations and not cause-effect relationships. 5. 5.
Homoscedasticity. This assumption requires that the error variance is identical across observations. It can be tested by computing a heteroscedasticity test such as the Breusch-Pagan (for linear heteroscedasticity) or the White’s test (for non-linear heteroscedasticity). Heteroscedasticity causes too wide or too narrow regression coefficient standard errors, giving too much weight to certain subsets of the data when estimating the s. Important sources of this effect rely in physiological or artifactual effects that underlie the measurements and they can be reduced by using either using weighted least squares, log transformations of the data or heteroscedasticity-consistent robust standard errors.
If assumptions 1-4 are valid the estimation of the regression coefficients are unbiased and consistent, and if 5 is also valid it becomes efficient.
Appendix C A study on the effect of the TFCE parameters E and H
The sensitivity of the proposed pipeline using different values of the TFCE parameters E and H were assessed using synthetic data. A 3D model showing no correlation between WT and the posterior estimate of the allele frequency of a SNP (rs4288653) adjusted for age, gender, BSA and SBP was used to generate background noise. A synthetic data signal was generated by summing to the WT values of each subject a term at each vertex, where is the signal intensity and is a map of regression coefficients. Three distinct maps (signal 1, 2 and 3) obtained from real clinical data and characterized by non-null coefficients scaled to the (0,1] range were employed (FIG. 5). These covered the 25%, 50% and 75% of total area of the LV respectively were employed together with three distinct values (0.2, 0.3, 0.4) of signal intensity I. For a given value of the signal intensity I and of the spatial extension S of the synthetic signal, five values of the parameter E and five values of the parameter H were employed by the proposed framework to detect the synthetic signal generated (a total of 25 simulations for each (I,S) couple). The number of subjects N was fixed to 80, the number of permutations for each simulation to 5,000. Sensitivity results were linearly interpolated and normalized for the maximum sensitivity obtained for each (I,S) couple and plotted on the colour plots reported in FIG. 6.
The first row of FIG. 6 shows the sensitivity results obtained at a fixed spatial extension of the generated synthetic signal () and different signal intensities I. It can be noticed how at higher signal intensities the framework sensitivity increases and how on a small extended signal better sensitivity values are obtained when H is higher than E. The second row of FIG. 6 shows the results obtained at a same signal intensity I when increasing the spatial extension S of the generated signal. In particular, in the bottom left figure it can be noticed how the importance of the signal intensity I is still predominant, while with the increase of the signal extension S the relative importance of the parameter E increases. In all the studied cases, the false discovery rate of the framework was always below the and often equal to 0, while the sensitivity was always above .
Overall, in this preliminary study the values of and suggested in the TFCE original paper (Smith and Nichols, 2009) by theoretical and empirical reasons achieved good sensitivity values. However, the performance of other combinations of E and H such as () show promise and will be investigated in future work.
Appendix D A comparison between standard cluster-extent based thresholding and TFCE
A comparison between the proposed framework using TFCE and the proposed framework using a standard cluster-extend based thresholding was performed on the same synthetic data used in the previous section. The latter procedure as proposed in (Friston and others., 1994) has been implemented in the R package developed for this work. This procedure consists of two steps. In the first one, a cluster in a statistical map obtained by mass univariate regression is defined as the group of connected vertices that have a t-statistic value greater than a user-defined threshold . Then, a second threshold is computed via permutation testing as the 95th percentile of the distribution of largest cluster in each permuted map and used to to declare significant the clusters in the original statistical map that are more spatially extended than this threshold . Hence this method depends on the user-defined initial cluster-forming threshold . For this reason, the sensitivity, specificity and FDR of the proposed approach with TFCE parameters and were compared against the results obtained by the same approach using cluster-extent based thresholding with five distinct cluster-forming thresholds () instead of TFCE on the five distinct signals studied in the previous section (FIG. 6). The number of subjects N was again fixed to 80, the number of permutations for each simulation to 5,000 and the graphs obtained are reported in FIG. 7.
The sensitivity of the cluster-extent based thresholding method proved to be very dependent on the cluster-forming threshold and its choice had a large impact on the results. Moreover, higher FDR and lower specificity characterised cluster-extent based thresholding results when their sensitivity was comparable or greater than TFCE. These results therefore favour the use of TFCE over cluster-extent based thresholding as also proved in the brain image analysis literature (Smith and Nichols, 2009).
Appendix E GWAS replication supplementary data
Table 1 contains a summary of the covariates employed in the regression model adopted fr GWAS replication. Table 2 reports the results of the linear regression models used for conventional 2D association analysis between left ventricular mass (LVM) and the posterior estimate of the allele frequency of one SNP adjusted for age, gender, body surface area (BSA) and systolic blood pressure (SBP). Even without multiple testing correction, none of the models reached significance.
Appendix F Sensitivity, Specificity and False Discovery Rate on Synthetic Data
The two regression coefficients () maps used to generate the synthetic data employed in this experiment are reported in FIG. 8. Signal A covers the 10% of the left ventricle and has coefficients scaled between 0 and 1, while signal B covers the 60% of the left ventricle and has coefficients scaled between -1 and 0.
FIG. 9 reports the two maps obtained on signal A and B at different signal intensities (I) and sample sizes (N) for the difference between the sensitivity scored by the proposed pipeline using TFCE and the sensitivity scored by the same pipeline without TFCE. The increase of sensitivity provided by TFCE was higher for signal B due to its larger spatial extension as expected. The difference of sensitivities converged to zero at high I and N as also the sensitivity of the pipeline without TFCE reached 100% sensitivity.
FIG. 10 and FIG. 11 report the detected false discovery rate (FDR) and specificity of the proposed pipeline with or without TFCE. In FIG. 10 the FDR of the proposed pipeline was zero for all the simulated I and N, while it increased for the second signal (signal B - covering the 60% of the ventricle) and exceeded 5% only for few simulations having a sample size N greater than 1,600 (one example is reported in FIG. 11). This effect is due to the large synthetic signal extension, which causes TFCE to reward also the neighbour vertices around the true signal and it is not considered an issue since it cannot cause to declare cluster arisen from noise to be significant (as also shown in FIG. 10).
Overall, the application of TFCE provides a relevant increase in sensitivity which only comes at the expenses of a little decrease of specificity on largely extended signals.
References
- Li et al. (2016) G. Li et al., “Transcriptomic profiling maps anatomically patterned subpopulations among single embryonic cardiac cells,” Dev Cell 39, 491–507 (2016).
- Hundley et al. (2010) W. G. Hundley et al., “ACCF/ACR/AHA/NASCI/SCMR 2010 expert consensus document on cardiovascular magnetic resonance: a report of the American College of Cardiology Foundation Task Force on Expert Consensus Documents,” J Am Coll Cardiol 55, 2614–2662 (2010).
- Young and Frangi (2009) A. A. Young and A. F. Frangi, “Computational cardiac atlases: from patient to population and back,” Exp Physiol 94, 578–596 (2009).
- Fonseca et al. (2011) C. G. Fonseca et al., “The Cardiac Atlas Project–an imaging database for computational modeling and statistical atlases of the heart,” Bioinformatics 27, 2288–2295 (2011).
- Medrano-Gracia et al. (2015) P. Medrano-Gracia et al., “Challenges of cardiac image analysis in large-scale population-based studies,” Curr Cardiol Rep 17, 1–7 (2015).
- Friston (2007) K. J. Friston, Statistical parametric mapping : the analysis of functional brain images (Elsevier/Academic Press, Amsterdam Boston, 2007).
- Smith and Nichols (2009) S. M. Smith and T. E. Nichols, “Threshold-free cluster enhancement: addressing problems of smoothing, threshold dependence and localisation in cluster inference,” NeuroImage 44, 83–98 (2009).
- Bai et al. (2015) W. Bai et al., “A bi-ventricular cardiac atlas built from 1000+ high resolution MR images of healthy subjects and an analysis of shape and motion,” Med Image Anal 26, 133–145 (2015).
- Woodbridge et al. (2013) M. Woodbridge et al., “MRIdb: medical image management for biobank research,” J Digit Imaging 26, 886–890 (2013).
- Schulz-Menger et al. (2013) J. Schulz-Menger et al., “Standardized image interpretation and post processing in cardiovascular magnetic resonance: Society for Cardiovascular Magnetic Resonance (SCMR) board of trustees task force on standardized post processing,” J Cardiovasc Magn Reson 15, 35 (2013).
- Delaneau et al. (2013) O. Delaneau et al., “Improved whole-chromosome phasing for disease and population genetic studies,” Nat Methods 10, 5–6 (2013).
- Howie et al. (2009) B. N. Howie et al., “A flexible and accurate genotype imputation method for the next generation of genome-wide association studies,” PLoS Genet 5, e1000529 (2009).
- de Marvao et al. (2015) A. de Marvao et al., “Precursors of hypertensive heart phenotype develop in healthy adults: a high-resolution 3D MRI study,” JACC Cardiovasc Imaging 8, 1260–1269 (2015).
- Freedman and Lane (1983) D. Freedman and D. Lane, “A nonstochastic interpretation of reported significance levels,” Journal of Business & Economic Statistics 1, 292–298 (1983).
- Benjamin and Hochberg (1995) Y. Benjamin and Y. Hochberg, “Controlling the false discovery rate: a practical and powerful approach to multiple testing,” J Royal Statistical Society, Series B (57) 289 (1995).
- Winkler et al. (2014) A. M. Winkler et al., “Permutation inference for the general linear model,” NeuroImage 92, 381–397 (2014).
- Benjamini et al. (2006) Y. Benjamini et al., “Adaptive linear step-up procedures that control the false discovery rate,” Biometrika 93, 491–507 (2006).
- Reiss et al. (2012) P. T. Reiss et al., “Paradoxical results of adaptive false discovery rate procedures in neuroimaging studies.” NeuroImage 63, 1833–1840 (2012).
- Benjamini and Yekutieli (2001) Y. Benjamini and D. Yekutieli, “The control of the false discovery rate in multiple testing under dependency,” Ann Stat , 1165–1188 (2001).
- Cribari-Neto and da Silva (2011) F. Cribari-Neto and W. B. da Silva, “A new heteroskedasticity-consistent covariance matrix estimator for the linear regression model,” AStA Adv Stat Anal 95, 129–146 (2011).
- Arnett et al. (2009) D. K. Arnett et al., “Genome-wide association study identifies single-nucleotide polymorphism in kcnb1 associated with left ventricular mass in humans: the hypergen study,” BMC medical genetics 10, 43 (2009).
- Benjamini and Yekutieli (2005) Y. Benjamini and D. Yekutieli, “Quantitative trait loci analysis using the false discovery rate,” Genetics 171, 783–790 (2005).
- Khouri et al. (2010) M. G. Khouri et al., “A 4-tiered classification of left ventricular hypertrophy based on left ventricular geometry the Dallas heart study,” Circ Cardiovasc Imaging 3, 164–171 (2010).
- Medrano-Gracia et al. (2013) P. Medrano-Gracia et al., “Atlas-based analysis of cardiac shape and function: correction of regional shape bias due to imaging protocol for population studies,” J Cardiovasc Magn Reson 15, 80 (2013).
- Vasan et al. (2009) R. S. Vasan et al., “Genetic variants associated with cardiac structure and function: a meta-analysis and replication of genome-wide association data,” JAMA 302, 168–178 (2009).
- Fox et al. (2013) E. R. Fox et al., “Genome-wide association study of cardiac structure and systolic function in African Americans: the Candidate Gene Association Resource (CARe) study,” Circ Cardiovasc Genet 6, 37–46 (2013).
- Semsarian et al. (2015) C. Semsarian et al., “New perspectives on the prevalence of hypertrophic cardiomyopathy,” J Am Coll Cardiol 65, 1249–1254 (2015).
- Wild et al. (2017) P. S. Wild et al., “Large-scale genome-wide analysis identifies genetic variants associated with cardiac structure and function,” J Clin Invest 127, 1798–1812 (2017).
- Srivastava and Olson (2000) D. Srivastava and E. N. Olson, “A genetic blueprint for cardiac development,” Nature 407, 221 (2000).
- Oktay et al. (2017) O. Oktay et al., “Anatomically constrained neural networks (ACNN): Application to cardiac image enhancement and segmentation,” arXiv preprint arXiv:1705.08302 (2017).
- Bronzino and Peterson (2014) J. D. Bronzino and D. R. Peterson, Biomedical signals, imaging, and informatics (CRC Press, 2014).
- Petersen et al. (2013) S. E. Petersen et al., “Imaging in population science: cardiovascular magnetic resonance in 100,000 participants of UK Biobank-rationale, challenges and approaches,” J Cardiovasc Magn Reson 15, 1 (2013).
- Schafer et al. (2017) S. Schafer et al., “Titin-truncating variants affect heart function in disease cohorts and the general population,” Nat Genet 49, 46–53 (2017).
- Vounou et al. (2012) M. Vounou et al., “Sparse reduced-rank regression detects genetic associations with voxel-wise longitudinal phenotypes in alzheimer’s disease,” Neuroimage 60, 700–716 (2012).
- Liu and Calhoun (2014) J. Liu and V. Calhoun, “A review of multivariate analyses in imaging genetics,” Frontiers in neuroinformatics 8 (2014).
- O’Brien et al. (2002) O’Brien et al., “Working Group on Blood Pressure Monitoring of the European Society of Hypertension International Protocol for validation of blood pressure measuring devices in adults.” Blood Press Monit 7, 3–17 (2002).
- Andrade et al. (1999) A. Andrade et al., “Ambiguous results in functional neuroimaging data analysis due to covariate correlation,” NeuroImage 10, 483–486 (1999).
- Hair (2010) J. F. Hair, ed., Multivariate data analysis, 7th ed. (Prentice Hall, 2010) OCLC: ocn227921762.
- Friston and others. (1994) K. Friston and others., “Assessing the significance of focal activations using their spatial extent,” Human brain mapping 1, 210–220 (1994).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Li et al. (2016) G. Li et al. , “Transcriptomic profiling maps anatomically patterned subpopulations among single embryonic cardiac cells,” Dev Cell 39 , 491–507 (2016).
- 2Hundley et al. (2010) W. G. Hundley et al. , “ACCF/ACR/AHA/NASCI/SCMR 2010 expert consensus document on cardiovascular magnetic resonance: a report of the American College of Cardiology Foundation Task Force on Expert Consensus Documents,” J Am Coll Cardiol 55 , 2614–2662 (2010).
- 3Young and Frangi (2009) A. A. Young and A. F. Frangi, “Computational cardiac atlases: from patient to population and back,” Exp Physiol 94 , 578–596 (2009).
- 4Fonseca et al. (2011) C. G. Fonseca et al. , “The Cardiac Atlas Project–an imaging database for computational modeling and statistical atlases of the heart,” Bioinformatics 27 , 2288–2295 (2011).
- 5Medrano-Gracia et al. (2015) P. Medrano-Gracia et al. , “Challenges of cardiac image analysis in large-scale population-based studies,” Curr Cardiol Rep 17 , 1–7 (2015).
- 6Friston (2007) K. J. Friston, Statistical parametric mapping : the analysis of functional brain images (Elsevier/Academic Press, Amsterdam Boston, 2007).
- 7Smith and Nichols (2009) S. M. Smith and T. E. Nichols, “Threshold-free cluster enhancement: addressing problems of smoothing, threshold dependence and localisation in cluster inference,” Neuro Image 44 , 83–98 (2009).
- 8Bai et al. (2015) W. Bai et al. , “A bi-ventricular cardiac atlas built from 1000+ high resolution MR images of healthy subjects and an analysis of shape and motion,” Med Image Anal 26 , 133–145 (2015).
