Skip to main content

Assessment of right ventricular size and function from cardiovascular magnetic resonance images using artificial intelligence

Abstract

Background

Theoretically, artificial intelligence can provide an accurate automatic solution to measure right ventricular (RV) ejection fraction (RVEF) from cardiovascular magnetic resonance (CMR) images, despite the complex RV geometry. However, in our recent study, commercially available deep learning (DL) algorithms for RVEF quantification performed poorly in some patients. The current study was designed to test the hypothesis that quantification of RV function could be improved in these patients by using more diverse CMR datasets in addition to domain-specific quantitative performance evaluation metrics during the cross-validation phase of DL algorithm development.

Methods

We identified 100 patients from our prior study who had the largest differences between manually measured and automated RVEF values. Automated RVEF measurements were performed using the original version of the algorithm (DL1), an updated version (DL2) developed from a dataset that included a wider range of RV pathology and validated using multiple domain-specific quantitative performance evaluation metrics, and conventional methodology performed by a core laboratory (CORE). Each of the DL-RVEF approaches was compared against CORE-RVEF reference values using linear regression and Bland–Altman analyses. Additionally, RVEF values were classified into 3 categories: ≤ 35%, 35–50%, and ≥ 50%. Agreement between RVEF classifications made by the DL approaches and the CORE measurements was tested.

Results

CORE-RVEF and DL-RVEFs were obtained in all patients (feasibility of 100%). DL2-RVEF correlated with CORE-RVEF better than DL1-RVEF (r = 0.87 vs. r = 0.42), with narrower limits of agreement. As a result, DL2 algorithm also showed increasing accuracy from 0.53 to 0.80 for categorizing RV function.

Conclusions

The use of a new DL algorithm cross-validated on a dataset with a wide range of RV pathology using multiple domain-specific metrics resulted in a considerable improvement in the accuracy of automated RVEF measurements. This improvement was demonstrated in patients whose images were the most challenging and resulted in the largest RVEF errors. These findings underscore the critical importance of this strategy in the development of DL approaches for automated CMR measurements.

Introduction

Right ventricular (RV) function is an important predictor of outcomes in patients with heart disease [1,2,3]. However, the non-invasive assessment of RV size and function is significantly limited by its complex geometry. Cardiovascular magnetic resonance (CMR) quantification of RV volume using the method of disks is currently considered the reference standard for RV size and RVejection fraction (RVEF), because the entire chamber is imaged, the boundary between the RV myocardium and blood pool is clearly delineated, and no geometric assumptions about the RV shape are needed [4]. Indeed, the assessment of RV size and function using CMR has been shown to be of prognostic significance in patients with ischemic [5] and non-ischemic cardiomyopathy [6], valvular heart disease [7], congenital heart disease [8], and pulmonary hypertension [9]. Furthermore, quantification of RV size is an important determinant for guiding interventions, such as pulmonary valve replacement in patients with tetralogy of Fallot [10]. In patients with ischemic or non-ischemic cardiomyopathy, the presence of reduced RVEF in addition to reduced left ventricular (LV) function can better identify patients who may benefit from implantable cardioverter defibrillator and other interventions [1, 2].

Nevertheless, manual segmentation of the RV cavity is time-consuming and has greater inter-reader variability than is seen for the LV cavity [11,12,13,14]. The availability of an image analysis tool that can accurately and automatically quantify RV size and function from CMR images would not only help standardize image interpretation, but also improve clinical workflow. While several commercial tools are capable of accurately assessing LV volumes and ejection fraction, it is well recognized that these tools do not perform as well for the RV assessment. Indeed, we recently demonstrated that the RVEF measured using several commercial software tools resulted in clinically significant errors when compared to conventional measurements performed by an expert [15].

Most currently available software that automatically quantifies RV size and function from CMR images are complex algorithms that use a form of artificial intelligence referred to as deep learning (DL) [16, 17]. Such algorithms are typically trained and tested using thousands of CMR images with the key anatomy annotated by expert physicians. Due to the scarcity of well annotated CMR images, publicly available datasets which are fairly homogenous in nature, are often utilized to develop these DL algorithms. Although a homogenous dataset may be a good way to build the foundations of a DL algorithm, as it often implies standardized labelling or contouring procedures, the absence of a heterogeneous dataset during the testing phase of development may result in an algorithm that functions poorly in clinical practice, where more diverse disease states and image quality are encountered that were not adequately reflected by the training dataset. In this study, we hypothesized that the use of more diverse CMR datasets, which incorporate a wider range of RV pathology, scanner vendors, and imaging quality in addition to the use of multiple domain-specific quantitative performance evaluation metrics during cross-validation phase of DL algorithm development would result in a more generalized model and thus more accurate quantification of RV size and function in a subset of patients in whom the previous version of the software did not perform well.

Methods

Data and materials used in this study will not be made publicly available.

Population and study design

From our prior study [15], in which we compared the ability of several commercial DL-algorithms to automatically calculate RVEF against reference measurements made by a clinical expert. For this study, we selected 100 cases (of the 200 total) with the largest discrepancies between the DL-generated and reference RVEFs (Group 1). Automated RVEF measurements were performed using an updated version of the DL algorithm (DL2) cross-validated on a diverse dataset of CMR images. To assess the improvement in algorithm performance afforded by this strategy, these measurements were compared side-by-side with the measurements performed using the original software (DL1) against the same reference standard. These comparisons included actual RVEF values and also classification of RV function based on these values, namely normal, mildly to moderately reduced and severely reduced function. In addition, similar comparisons were performed on the remaining 100 patients in whom the original DL1 algorithm performed well, i.e. with the least discrepancies with the reference RVEFs (Group 2), in order to verify that the algorithm modifications did not have detrimental effects on RVEF measurements in these patients.

All subjects had previously signed informed consent to be included in a CMR registry permitting their images, clinical data, and outcomes to be used for future research. Patient demographics and clinical history were extracted from the electronic medical records. The protocol was approved by the Institution Review Board.

Deep learning algorithm development

DL algorithm 1

The DL1 segmentation model, included in cvi42 (version 5.11, Circle Cardiovascular Imaging, Calgary, Alberta, Canada), is founded on a convolutional neural network (CNN) based on the U-net architecture. The CNN is trained to associate pixel intensities of a CMR image to segmentation maps corresponding to the desired ventricular contours. During the training stage, the model parameters of CNN were optimized to reduce an energy function computed using the pixel-wise cross-entropy loss function, which penalizes the CNN when it does not correctly predict the segmentation label of a given pixel.

The initial machine learning-based RV contouring CNN was trained on multiple datasets summarized in Table 1. Standard image augmentation techniques were employed during the training phase. The training hyper-parameters of the DL1 model were selected by training multiple models with varying hyperparameters and selecting the model that performed the best on the cross-validation data presented in Table 1, which comprised solely of United Kingdom Biobank (UKBB) studies. The candidate models were compared against one another by computing the mean dice similarity coefficient (DSC) between predicted contours and manual annotations for each contour (RV, LV endocardium, and LV epicardium) independently.

Table 1 Summary of data (total number of cardiovascular magnetic resonance (CMR) images) used during the training and cross-validation phase of the development of the deep learning (DL) algorithms described herein

DL algorithm 2

DL2, included in cvi42 (Version 5.13, Circle Cardiovascular Imaging) was trained using the exact same CNN architecture as DL1. Prior to training DL2, 9745 new images were added to the Clinical dataset and the dataset was resampled such that roughly 20% of the cases were used in training and the remainder were kept for cross-validation. 460 additional tetralogy of Fallot images and 9202 additional images from UKBB were also added to the cross-validation dataset. Importantly, the cross-validation process was performed on this dataset and model selection was done in a manner that was blind to the cases on which it was tested and reported on in this study.

Additional quantitative performance metrics were used to assess the performance of the algorithm during the cross-validation phase as a spatial overlap metric, such as DSC fails to capture potential biases in the predicted contours [18]. In addition to DSC, the contour spatial distance based Hausdorff distance metric and domain specific performance metrics were computed, including absolute errors in RV volume prediction, LV volume and LV ejection fraction (LVEF) prediction, myocardial mass consistency and contour consistency. The total number of discontinuous predictions in the midventricular region was also computed to favor the selection of models with highly consistent predictions. The mean and standard deviation of all performance metrics were considered independently to select the most performant model. The change in the cross-validation process resulted in a different choice in hyperparameters. Specifically, there were changes in how each training batch was sampled, the frequency of the augmentations, and the point in time at which the training algorithm is stopped.

Image acquisition

Vasodilator contrast-enhanced stress perfusion CMR imaging was performed using a 1.5-T scanner (Achieva, Philips Healthcare, Best, The Netherlands) with a five-element phased array cardiac coil. Retrospectively gated cine images were obtained following stress perfusion imaging using a balanced steady-state free precession (bSSFP) sequence, during approximately 5-s breath holds (repetition time 2.9 ms; echo time 1.5 ms; flip angle 60°; temporal resolution 30–40 ms). Standard long-axis views were obtained, including four-chamber, two-chamber, and three-chamber images. In addition, six to ten short-axis slices were obtained from the LV and RV base to the apex (slice thickness 8 mm; gap 2 mm).

Conventional image analysis

The cine-CMR images were analyzed semi-automatically using commercial software (CardioAI, Arterys, San Francisco, California, USA) by an experienced expert reader in our core lab, who was blinded to clinical information. Using the short-axis cine images, the RV endocardial boundary was automatically detected by the software and then corrected by the expert in end-diastolic and end-systolic frames. RV papillary muscles and trabecular tissue were included in the blood pool volume. The RV outflow tract was included in the RV volume to the level of the pulmonary valve. When the boundary between the atrium and ventricle was unclear, long axis views and other frames of the cardiac cycle were reviewed to differentiate between the two chambers. Simpson’s method of disks was used to calculate RV end-diastolic volume (RVEDV) and end-systolic volume (RVESV), and RVEF. All volume measurements were indexed to body surface area. The RV measurements made by the core lab expert were considered as the reference standard (CORE-RVEF, CORE-RVEDV, CORE-RVESV). CORE-RVEF were used to divide the patient cohort into three clinically relevant groups [19,20,21]: severely reduced RVEF (≤ 35%), mildly to moderately reduced RVEF (35–50%), and normal RVEF (≥ 50%).

DL image analysis

The RVEF was determined using two different fully automated DL-algorithms DL1 and DL2, described above. Cine images were uploaded into the software and fully automated segmentation was then performed without any further user input. A plane demarcating the ventricular base was identified automatically. Using the entire short axis stack and standard long axis views, both DL1 and DL2 generated time-volume curves, from which end-diastole and end-systole were automatically determined and used to calculate RVEDV, RVESV, and RVEF.

Statistics

Continuous variables were tested for normal distribution. Continuous variables that were not normally distributed, were presented as the median with interquartile range (IQR). Categorical variables were presented as absolute numbers with percentages. Inter-technique comparisons included linear regression analysis with Pearson correlation coefficients and Bland–Altman analyses of biases and limits of agreement. This included the agreement between each of the DL techniques (DL1 and DL2) against the CORE reference. Histograms displaying the difference in RVEF measurements between the DL algorithms and the core lab were generated. Confusion matrices were generated for each DL-RVEF algorithm to display the concordance/discordance with the CORE-RVEF for each of the RVEF categories (≤ 35%, 35–50%, and ≥ 50%). The sensitivity, specificity, and accuracy of DL1-RVEF and DL2-RVEF algorithms’ ability to correctly categorize RV function were also calculated. Reproducibility was tested for the 2 DL algorithms on 20 randomly selected patients and by manual analyses by two expert readers, including inter- and intra-observer reproducibility. Inter- and intra- observer variability was assessed using intraclass correlation coefficients (ICC) and coefficients of variation (CoV). P-values < 0.05 were considered significant. Analyses were performed using SPSS software (version 23.0, Statistical Package for the Social Sciences, International Business Machines, Inc., Armonk, New York, llinois, USA).

Results

Patient demographics

Patient characteristics for Group 1 are shown in Table 2 along with the relevant imaging findings. Figure 1 shows examples of RV boundaries as determined by the core-lab and by the 2 DL algorithms.

Table 2 Population baseline characteristics and CMR parameters in the clinical setting
Fig. 1
figure 1

Examples of images of contours detected by the core lab (CORE), original deep learning (DL1) and updated deep learning (DL2) (shown from top to bottom). Non-ischemic cardiomyopathy at basal slice

Relationship between CORE-RVEF and DL-RVEF

Compared to CORE-RVEF, the correlations of DL1-RVEF and DL2-RVEF were 0.42 and 0.87, respectively (Fig. 2). The results of Bland–Altman analyses for the two DL algorithms in comparison to the core lab analysis are depicted in Fig. 2. Algorithm DL2 performed significantly better than DL1, as reflected by considerably narrower limits of agreement: − 1 ± 11% for DL2 versus − 1 ± 28% for DL1. Figure 3 shows histograms that demonstrate the frequency of absolute differences between CORE-RVEF and each of the DL-RVEF algorithms for Group 1 (top) and Group 2 (bottom). For DL1, no case in Group 1 had ≤ 5% error as a consequence of the study design, while errors > 10% were noted in 35(35%) of cases and errors exceeding 20% were present in 17(17%). In contrast, for algorithm DL2, 44 (44%) cases in Group 1 were within 3% absolute error and 70(70%) cases were within 5% error, while only 10 (10%) cases showed errors > 10% when compared to CORE-RVEF. In group 2, no case had > 5% error according to DL1, as a consequence of the study design, and DL2 resulted in a relatively small number of cases with larger errors.

Fig. 2
figure 2

Linear regression plots (top) and Bland–Altman plots (bottom) comparing CORE and DL1 (left), CORE and DL2 (right). Red lines represent the regression lines, and green lines represent perfect agreement (unity lines)

Fig. 3
figure 3

Histograms showing the distribution of absolute differences between CORE right ventricular ejection fraction (RVEF) versus DL1 RVEF (left) and CORE RVEF versus DL2 RVEF (right) for Group 1 (top) and Group 2 (bottom). See text for details

Relationship between CORE-RV size and DL-RV size

For RVEDV and RVESV, algorithm DL1 was less accurate than DL2 when compared to CORE lab reference values (Fig. 4). For RVEDV, the correlations between CORE-RVEDV and DL-RVEDV were 0.65 (DL1) and 0.92 (DL2). For RVESV, the correlations between CORE-RVESV and DL-RVESV were 0.54 (DL1) and 0.94 (DL2). Bland–Altman analyses of comparison between CORE-RVEDV and DL-RVEDV resulted in smaller biases and narrower limits of agreement for DL2: − 32 ± 103 ml (DL1), − 13 ± 37 ml (DL2). For RVESV, the bias and limits of agreement were − 15 ± 93 ml (DL1), − 4 ± 24 ml (DL2).

Fig. 4
figure 4

Linear regression plots (left) comparing CORE RVEDV and DL1 RVEDV (A), CORE RVEDV and DL2 RVEDV (B), CORE RVESV and DL1 RVESV (E), CORE RVESV and DL2 RVESV (F). Bland–Altman plots (right) comparing CORE RVEDV and DL1 RVEDV (C), CORE RVEDV and DL2 RVEDV (D), CORE RVESV and DL1 RVESV (G), CORE RVESV and DL2 RVESV (H). Red lines represent the regression lines, and green lines represent perfect agreement (unity lines). RVEDV, right ventricular end-diastolic volume; RVESV, right ventricular end-systolic volume

Reproducibility

The intraclass correlations for the inter- and intra-observer variability of manual measurements were 0.90 and 0.94 for the RVEF, 0.94 and 0.96 for the RVEDV, 0.94 and 0.97 for the RVESV, respectively. In contrast, DL algorithms showed zero variability in all repeated measurements, due to their fully-automated, deterministic nature.

Classification into ejection fraction categories

Based on CORE-RVEF, 11 of the 100 patients had severely reduced RVEF (≤ 35%), 21 had mildly to moderately reduced RVEF (35–50%), and 68 had preserved RVEF (≥ 50%) (Table 3). The accuracy of the DL1 and DL2 algorithms for classifying RV function into appropriate EF group was 0.53 and 0.80, respectively. The lowest rates of accurate classifications were noted in the middle RVEF category of 35–50% (Fig. 5).

Table 3 Population characteristics and imaging parameters in subgroups
Fig. 5
figure 5

Confusion matrices showing accuracy of DL-RVEF to correctly categorize into clinically meaningful RVEF groups as defined by the RVEF by core lab (CORE-RVEF). Across true label rows, the numbers in the boxes represent the number and percentage of labels classified for each group. Color intensity corresponds to percentage, see heat map on the right

Discussion

In this study, we aimed to determine the impact of including CMR images with diverse RV pathology during the cross-validation stage on the performance of a DL algorithm developed to automatically quantify RV size and function. We found that such an approach resulted in substantial and clinically important improvements in the performance of the algorithm in patients whose images were difficult to handle for the previous version of the algorithm. These findings were in agreement with several previous studies [22,23,24].

RV dysfunction is associated with poor clinical outcomes in a wide range of cardiovascular disorders including but not limited to acute myocardial infarction, LV heart failure, congenital heart disease, arrhythmogenic cardiomyopathy, and pulmonary hypertension [25, 26]. RV systolic function is highly sensitive to changes in the afterload with small increases in afterload causing large decreases in stroke volume. Importantly, left heart disease can affect RV function through abnormal motion of the ventricular septum. Thus, quantification of RVEF, as a biomarker of RV systolic function, can be used to guide therapeutic decision making and to assess prognosis [27,28,29]. Although RVEF is a continuous variable, RVEF cutoff values have been shown to have prognostic value in patients with dilated cardiomyopathy [28] and in other disease states [9]. The importance of RVEF for prediction of cardiovascular outcomes was demonstrated in a cohort comprised of a broad range of cardiovascular diseases by demonstrating that each 10% drop in RVEF was associated with a 1.33-fold increased risk [3]. In fact, RVEF is associated with poor outcomes even in individuals with a preserved LVEF [3, 30].

In recognition of the importance of RV function, CMR is increasingly being used to assess the RV due to its ability to accurately quantify chamber size and function without being limited by the complex RV geometry. However, the process remains time consuming and even fully automated, commercially available DL techniques used to quantify RVEF may perform poorly in some patients, and as a result may fail to predict clinical events [31], suggesting that the current approach to developing these algorithms needs to be refined. In our study, we investigated a new approach to selecting a candidate algorithm to be used in clinical practice. The biggest source of improvement in this study is from the increased variability in the cross-validation dataset and addition of multiple performance metrics employed to guide model selection. The addition of 10,161 images to include RV pathologies, such as pulmonary arterial hypertension, repaired tetralogy of Fallot, etc. were used to revamp the cross-validation strategy. Contour overlap metrics and the ability of accurate clinical biomarkers and results were factored into the exploration of DL2 algorithm. Additionally, the strict enforcement of a spatiotemporal consistency in the predicted outputs was adopted to post-processing in all stacks, which ensured the removal of outliers in the event that contours are predicted above the basal slice or below the apex.

Although other studies have examined the accuracy of DL algorithms to quantify RVEF in comparison to a human expert [12, 32, 33], they did not examine the clinical implications of any specific sources of error. Rahman et al., reported high levels of agreement between the DL algorithm and manual measurement of RVEF, but the focus was limited to the healthy population [34]. In a study by Hakim et al., when comparing automated RVEF measurements to a human expert generated reference values, the DL algorithm showed an r-value of 0.76. The major source of error has typically been explained by poor automated segmentation at the basal slice, as was observed in prior studies [35]. In our prior study [15], we compared manual RVEF by a clinician to automated RVEF measurements made by three DL methods developed using publicly available CMR datasets and found that all three DL methods performed relatively poorly in a significant proportion of the patients.

In this current study, we selected from our prior study [15] the 100 cases, in which the DL algorithm performed the worst, and found that an algorithm developed using a new approach with enhanced cross-validation resulted in considerable improvement in its clinical performance. Indeed, measurements of RVEDV, RVESV, and RVEF made using the improved algorithm correlated better with the reference measurements. This increased correlation translated into more accurate classification of RVEF function. However, it is important to note that the correlation between the core laboratory measurement and the DL algorithm was not perfect, 10% of cases still had a > 10% absolute error in the RVEF measurement, and 20% of patients were still categorized into an incorrect RVEF group. This emphasizes the need for additional algorithmic improvements in the future. Detecting the RV endocardial contour is challenging for both the clinician and automated DL algorithms. This can be attributed to the unique morphology, thinner myocardium, and higher trabeculation burden of the RV chamber. The basal short-axis slice is most often inaccurately segmented by both an expert and by the DL algorithms, particularly at the boundary of the pulmonary artery and right atrium. Furthermore, the RV apex is frequently incorrectly identified by the DL algorithms in cases of pathological RV morphology. Another challenge for the DL-algorithms for calculating RVEF is the accurate identification of the ED and ES frames (Fig. 6). In fact, some algorithms require end-diastole and end-systole to be pre-selected to overcome this challenge [36].

Fig. 6
figure 6

Example of images with contours detected by CORE, DL1 and DL2 algorithms. CORE and DL2 identified the correct ED and ES phases, but DL1 identified an incorrect ES phase. The CORE-RVEF was 52%, DL1-RVEF was 24%, and DL2-RVEF was 50%

Limitations

This was a single-center study performed in a relatively small number of patients. Although we demonstrated considerable improvement of algorithm performance in our cohort, it is possible that it may not perform as well on images acquired using other scanners or other CMR image acquisition techniques. Additionally, our study cohort was selected from patients referred for a CMR stress test and was not enriched with patients known to have more complex RV anatomy. Thus, it is unknown how well this current algorithm would perform when it encounters more complex clinical cases, such as cyanotic congenital heart disease and other distorted anatomies. Also, one could view as a limitation the fact that this study was performed on stress rather than resting CMR images. There were three reasons for this choice: (1) vasodilator stress is a common indication for CMR imaging; (2) we suspected that the algorithms may not perform as well in this situation, since the images used to quantify EF are usually acquired after the administration of a gadolinium-based contrast agent; and (3) the previous study, in which the original algorithm did not perform well in this subset of patients, was performed on their stress CMR images, and therefore testing the hypothesis that the retrained algorithm would perform better had to be done on the same images.

Conclusions

The use of a diverse dataset during the cross-validation phase of DL algorithm development resulted in a considerable improvement in the accuracy of the automated analysis of RV volumes and RVEF in patients in whom the previous version of the software did not perform well. However, despite the improvement, the current fully automated algorithm is still prone to errors when compared to a core laboratory measurement due to intrinsic anatomical challenges and therefore requires verification by an experienced reader.

Availability of data and materials

Data supporting the results reported in the manuscript can be found in a computer workstation in the Cardiac Imaging Laboratory at the University of Chicago.

Abbreviations

bSSFP:

Steady-state free precession

CMR:

Cardiovascular magnetic resonance

CNN:

Convolutional neural network

CORE:

Core lab

CoV:

Coefficient of variation

DL:

Deep learning

DSC:

Dice similarity coefficient

EDV:

End-diastolic volume

EF:

Ejection fraction

ESV:

End-systolic volume

ICC:

Intraclass correlation coefficients

LV:

Left ventricle/left ventricular

LVEF:

Left ventricular ejection fraction

RV:

Right ventricle/right ventricular

RVEDV:

Right ventricular end-diastolic volume

RVEF:

Right ventricular ejection fraction

RVESV:

Right ventricular end-systolic volume

UKBB:

United Kingdom Biobank

References

  1. Elming MB, Hammer-Hansen S, Voges I, Nyktari E, Raja AA, Svendsen JH, Pehrson S, Signorovitch J, Kober LV, Prasad SK, Thune JJ. Right ventricular dysfunction and the effect of defibrillator implantation in patients with nonischemic systolic heart failure. Circ Arrhythm Electrophysiol. 2019;12:e007022.

    Article  Google Scholar 

  2. Mikami Y, Jolly U, Heydari B, Peng M, Almehmadi F, Zahrani M, Bokhari M, Stirrat J, Lydell CP, Howarth AG, Yee R, White JA. Right ventricular ejection fraction is incremental to left ventricular ejection fraction for the prediction of future arrhythmic events in patients with systolic dysfunction. Circ Arrhythm Electrophysiol. 2017;10.

  3. Purmah Y, Lei LY, Dykstra S, Mikami Y, Cornhill A, Satriano A, Flewitt J, Rivest S, Sandonato R, Seib M, Lydell CP, Howarth AG, Heydari B, Merchant N, Bristow M, Fine N, Gaztanaga J, White JA. Right ventricular ejection fraction for the prediction of major adverse cardiovascular and heart failure-related events: a cardiac MRI based study of 7131 patients with known or suspected cardiovascular disease. Circ Cardiovasc Imaging. 2021;14:e011337.

    Article  Google Scholar 

  4. Addetia K, Bhave NM, Tabit CE, Gomberg-Maitland M, Freed BH, Dill KE, Lang RM, Mor-Avi V, Patel AR. Sample size and cost analysis for pulmonary arterial hypertension drug trials using various imaging modalities to assess right ventricular size and function end points. Circ Cardiovasc Imaging. 2014;7:115–24.

    Article  Google Scholar 

  5. Larose E, Ganz P, Reynolds HG, Dorbala S, Di Carli MF, Brown KA, Kwong RY. Right ventricular dysfunction assessed by cardiovascular magnetic resonance imaging predicts poor prognosis late after myocardial infarction. J Am Coll Cardiol. 2007;49:855–62.

    Article  Google Scholar 

  6. Pueschner A, Chattranukulchai P, Heitner JF, Shah DJ, Hayes B, Rehwald W, Parker MA, Kim HW, Judd RM, Kim RJ, Klem I. The prevalence, correlates, and impact on cardiac mortality of right ventricular dysfunction in nonischemic cardiomyopathy. JACC Cardiovasc Imaging. 2017;10:1225–36.

    Article  Google Scholar 

  7. Khalique OK, Cavalcante JL, Shah D, Guta AC, Zhan Y, Piazza N, Muraru D. Multimodality imaging of the tricuspid valve and right heart anatomy. JACC Cardiovasc Imaging. 2019;12:516–31.

    Article  Google Scholar 

  8. Geva T. MRI is the preferred method for evaluating right ventricular size and function in patients with congenital heart disease. Circ Cardiovasc Imaging. 2014;7:190–7.

    Article  Google Scholar 

  9. Freed BH, Gomberg-Maitland M, Chandra S, Mor-Avi V, Rich S, Archer SL, Jamison EB Jr, Lang RM, Patel AR. Late gadolinium enhancement cardiovascular magnetic resonance predicts clinical worsening in patients with pulmonary hypertension. J Cardiovasc Magn Reson. 2012;14:11.

    Article  Google Scholar 

  10. Wald RM, Valente AM, Gauvreau K, Babu-Narayan SV, Assenza GE, Schreier J, Gatzoulis MA, Kilner PJ, Koyak Z, Mulder B, Powell AJ, Geva T. Cardiac magnetic resonance markers of progressive RV dilation and dysfunction after tetralogy of Fallot repair. Heart. 2015;101:1724–30.

    Article  CAS  Google Scholar 

  11. Abouzeid CM, Shah T, Johri A, Weinsaft JW, Kim J. Multimodality imaging of the right ventricle. Curr Treat Options Cardiovasc Med. 2017;19:82.

    Article  Google Scholar 

  12. Backhaus SJ, Staab W, Steinmetz M, Ritter CO, Lotz J, Hasenfuss G, Schuster A, Kowallick JT. Fully automated quantification of biventricular volumes and function in cardiovascular magnetic resonance: applicability to clinical routine settings. J Cardiovasc Magn Reson. 2019;21:24.

    Article  Google Scholar 

  13. Clarke CJ, Gurka MJ, Norton PT, Kramer CM, Hoyer AW. Assessment of the accuracy and reproducibility of RV volume measurements by CMR in congenital heart disease. JACC Cardiovasc Imaging. 2012;5:28–37.

    Article  Google Scholar 

  14. Genovese D, Rashedi N, Weinert L, Narang A, Addetia K, Patel AR, Prater D, Goncalves A, Mor-Avi V, Lang RM. Machine learning-based three-dimensional echocardiographic quantification of right ventricular size and function: validation against cardiac magnetic resonance. J Am Soc Echocardiogr. 2019;32:969–77.

    Article  Google Scholar 

  15. Wang S, Patel H, Miller T, Ameyaw K, Narang A, Chauhan D, Anand S, Anyanwu E, Besser SA, Kawaji K, Liu XP, Lang RM, Mor-Avi V, Patel AR. AI based CMR assessment of biventricular function: clinical significance of intervendor variability and measurement errors. JACC Cardiovasc Imaging. 2021;15:413–27.

    Article  Google Scholar 

  16. Avendi MR, Kheradvar A, Jafarkhani H. Automatic segmentation of the right ventricle from cardiac MRI using a learning-based approach. Magn Reson Med. 2017;78:2439–48.

    Article  CAS  Google Scholar 

  17. Zhang L, Karanikolas GV, Akcakaya M, Giannakis GB. Fully automatic segmentation of the right ventricle via multi-task deep neural networks. Proc IEEE Int Conf Acoust Speech Signal Process. 2018;2018:6677–81.

    PubMed  PubMed Central  Google Scholar 

  18. Taha AA, Hanbury A. Metrics for evaluating 3D medical image segmentation: analysis, selection, and tool. BMC Med Imaging. 2015;15:29.

    Article  Google Scholar 

  19. Al-Khatib SM, Stevenson WG, Ackerman MJ, Bryant WJ, Callans DJ, Curtis AB, Deal BJ, Dickfeld T, Field ME, Fonarow GC, Gillis AM, Granger CB, Hammill SC, Hlatky MA, Joglar JA, Kay GN, Matlock DD, Myerburg RJ, Page RL. 2017 AHA/ACC/HRS guideline for management of patients with ventricular arrhythmias and the prevention of sudden cardiac death: a report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines and the Heart Rhythm Society. J Am Coll Cardiol. 2018;72:e91–220.

    Article  Google Scholar 

  20. Corrado D, Wichter T, Link MS, Hauer R, Marchlinski F, Anastasakis A, Bauce B, Basso C, Brunckhorst C, Tsatsopoulou A, Tandri H, Paul M, Schmied C, Pelliccia A, Duru F, Protonotarios N, Estes NA 3rd, McKenna WJ, Thiene G, Marcus FI, Calkins H. Treatment of arrhythmogenic right ventricular cardiomyopathy/dysplasia: an international task force consensus statement. Eur Heart J. 2015;36:3227–37.

    Article  Google Scholar 

  21. Ponikowski P, Voors AA, Anker SD, Bueno H, Cleland JGF, Coats AJS, Falk V, Gonzalez-Juanatey JR, Harjola VP, Jankowska EA, Jessup M, Linde C, Nihoyannopoulos P, Parissis JT, Pieske B, Riley JP, Rosano GMC, Ruilope LM, Ruschitzka F, Rutten FH, van der Meer P, Group ESCSD. 2016 ESC Guidelines for the diagnosis and treatment of acute and chronic heart failure: The Task Force for the diagnosis and treatment of acute and chronic heart failure of the European Society of Cardiology (ESC)Developed with the special contribution of the Heart Failure Association (HFA) of the ESC. Eur Heart J. 2016;37:2129–200.

    Article  Google Scholar 

  22. Chen C, Bai W, Davies RH, Bhuva AN, Manisty CH, Augusto JB, Moon JC, Aung N, Lee AM, Sanghvi MM, Fung K, Paiva JM, Petersen SE, Lukaschuk E, Piechnik SK, Neubauer S, Rueckert D. Improving the generalizability of convolutional neural network-based segmentation on CMR images. Front Cardiovasc Med. 2020;7:105.

    Article  Google Scholar 

  23. Campello VM, Gkontra P, Izquierdo C, Martin-Isla C, Sojoudi A, Full PM, Maier-Hein K, Zhang Y, He Z, Ma J, Parreno M, Albiol A, Kong F, Shadden SC, Acero JC, Sundaresan V, Saber M, Elattar M, Li H, Menze B, Khader F, Haarburger C, Scannell CM, Veta M, Carscadden A, Punithakumar K, Liu X, Tsaftaris SA, Huang X, Yang X, Li L, Zhuang X, Vilades D, Descalzo ML, Guala A, Mura L, Friedrich MG, Garg R, Lebel J, Henriques F, Karakas M, Cavus E, Petersen SE, Escalera S, Segui S, Rodriguez-Palomares JF, Lekadir K. Multi-centre, multi-vendor and multi-disease cardiac segmentation: the M&Ms challenge. IEEE Trans Med Imaging. 2021;40:3543–54.

    Article  Google Scholar 

  24. Mariscal Harana J, Vergani V, Ascher C, Razavi R, King A, Ruijsink B, Puyol-Anton E. Large-scale, multi-vendor, multi-protocol, quality-controlled analysis of clinical cine CMR using artificial intelligence. Eur Heart J Caediovasc Imag. 2021;22:jeab090046.

    Article  Google Scholar 

  25. Alabed S, Shahin Y, Garg P, Alandejani F, Johns CS, Lewis RA, Condliffe R, Wild JM, Kiely DG, Swift AJ. Cardiac-MRI predicts clinical worsening and mortality in pulmonary arterial hypertension: a systematic review and meta-analysis. JACC Cardiovasc Imaging. 2021;14:931–42.

    Article  Google Scholar 

  26. Meyer P, Filippatos GS, Ahmed MI, Iskandrian AE, Bittner V, Perry GJ, White M, Aban IB, Mujib M, Dell’Italia LJ, Ahmed A. Effects of right ventricular ejection fraction on outcomes in chronic systolic heart failure. Circulation. 2010;121:252–8.

    Article  Google Scholar 

  27. Guazzi M, Dixon D, Labate V, Beussink-Nelson L, Bandera F, Cuttica MJ, Shah SJ. RV contractile function and its coupling to pulmonary circulation in heart failure with preserved ejection fraction: stratification of clinical phenotypes and outcomes. JACC Cardiovasc Imaging. 2017;10:1211–21.

    Article  Google Scholar 

  28. Gulati A, Ismail TF, Jabbour A, Alpendurada F, Guha K, Ismail NA, Raza S, Khwaja J, Brown TD, Morarji K, Liodakis E, Roughton M, Wage R, Pakrashi TC, Sharma R, Carpenter JP, Cook SA, Cowie MR, Assomull RG, Pennell DJ, Prasad SK. The prevalence and prognostic significance of right ventricular systolic dysfunction in nonischemic dilated cardiomyopathy. Circulation. 2013;128:1623–33.

    Article  Google Scholar 

  29. Patel RB, Li E, Benefield BC, Swat SA, Polsinelli VB, Carr JC, Shah SJ, Markl M, Collins JD, Freed BH. Diffuse right ventricular fibrosis in heart failure with preserved ejection fraction and pulmonary hypertension. ESC Heart Fail. 2020;7:253–63.

    PubMed  PubMed Central  Google Scholar 

  30. Guazzi M, Naeije R. Right heart phenotype in heart failure with preserved ejection fraction. Circ Heart Fail. 2021;14:e007840.

    Article  Google Scholar 

  31. Dawes TJW, de Marvao A, Shi W, Fletcher T, Watson GMJ, Wharton J, Rhodes CJ, Howard L, Gibbs JSR, Rueckert D, Cook SA, Wilkins MR, O’Regan DP. Machine learning of three-dimensional right ventricular motion enables outcome prediction in pulmonary hypertension: a cardiac MR imaging study. Radiology. 2017;283:381–90.

    Article  Google Scholar 

  32. Backhaus SJ, Schuster A, Lange T, Stehning C, Billing M, Lotz J, Pieske B, Hasenfuss G, Kelle S, Kowallick JT. Impact of fully automated assessment on interstudy reproducibility of biventricular volumes and function in cardiac magnetic resonance imaging. Sci Rep. 2021;11:11648.

    Article  CAS  Google Scholar 

  33. Schuster A, Lange T, Backhaus SJ, Strohmeyer C, Boom PC, Matz J, Kowallick JT, Lotz J, Steinmetz M, Kutty S, Bigalke B, Gutberlet M, de Waha-Thiele S, Desch S, Hasenfuss G, Thiele H, Stiermaier T, Eitel I. Fully automated cardiac assessment for diagnostic and prognostic stratification following myocardial infarction. J Am Heart Assoc. 2020;9:e016612.

    Article  Google Scholar 

  34. Attar R, Pereanez M, Gooya A, Alba X, Zhang L, de Vila MH, Lee AM, Aung N, Lukaschuk E, Sanghvi MM, Fung K, Paiva JM, Piechnik SK, Neubauer S, Petersen SE, Frangi AF. Quantitative CMR population imaging on 20,000 subjects of the UK Biobank imaging study: LV/RV quantification pipeline and its evaluation. Med Image Anal. 2019;56:26–42.

    Article  Google Scholar 

  35. Fadil H, Totman JJ, Hausenloy DJ, Ho HH, Joseph P, Low AF, Richards AM, Chan MY, Marchesseau S. A deep learning pipeline for automatic analysis of multi-scan cardiovascular magnetic resonance. J Cardiovasc Magn Reson. 2021;23:47.

    Article  Google Scholar 

  36. Bernard O, Lalande A, Zotti C, Cervenansky F, Yang X, Heng PA, Cetin I, Lekadir K, Camara O, Gonzalez Ballester MA, Sanroma G, Napel S, Petersen S, Tziritas G, Grinias E, Khened M, Kollerathu VA, Krishnamurthi G, Rohe MM, Pennec X, Sermesant M, Isensee F, Jager P, Maier-Hein KH, Full PM, Wolf I, Engelhardt S, Baumgartner CF, Koch LM, Wolterink JM, Isgum I, Jang Y, Hong Y, Patravali J, Jain S, Humbert O, Jodoin PM. Deep learning techniques for automatic MRI cardiac multi-structures segmentation and diagnosis: is the problem solved? IEEE Trans Med Imaging. 2018;37:2514–25.

    Article  Google Scholar 

Download references

Acknowledgements

Not applicable.

Disclosures

In addition to employment relationships between several authors as listed above, ARP has received research support from Philips, General Electric, Arterys, CircleCVI, and Neosoft.

Funding

This project was supported by the National Center for Advancing Translational Sciences (NCATS) of the National Institutes of Health (NIH) through Grant Number 5UL1TR002389-02 that funds the Institute for Translational Medicine (ITM). HP was funded by a T32 Cardiovascular Sciences Training Grant (5T32HL7381). The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.

Author information

Authors and Affiliations

Authors

Contributions

SW: enrolled patients, acquired and analyzed images, drafted the manuscript. DS: concept development, analyzed images. HP: enrolled patients, statistical analysis and writing the manuscript. AAK, IFS, AS, SF: developed and implemented in software the AI algorithm. AS, LL: critically reviewed the manuscript. TM, KA, AN: enrolled patients and data collection. KK, QT: concept development and critical review of the manuscript. VM: concept development, statistical analysis and writing the manuscript. ARP: all aspects of study. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Amit R. Patel.

Ethics declarations

Ethics approval and consent to participate

The study was approved by the Institutional Review Board of the University of Chicago with a written informed consent.

Consent for publication

Not applicable.

Competing interests

In addition to employment relationships between several authors as listed above, ARP has received research support from Philips, General Electric, Arterys, CircleCVI, and Neosoft.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Dr. Robert Judd served as a JCMR Guest Editor for this manuscript.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, S., Chauhan, D., Patel, H. et al. Assessment of right ventricular size and function from cardiovascular magnetic resonance images using artificial intelligence. J Cardiovasc Magn Reson 24, 27 (2022). https://doi.org/10.1186/s12968-022-00861-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12968-022-00861-5

Keywords