Sources of variability in quantification of CMR infarct size and their impact on sample size calculations - reproducibility among three core laboratories
Journal of Cardiovascular Magnetic Resonance volume 17, Article number: P84 (2015)
Infarct size is increasingly used as an efficacy endpoint in randomized trials comparing acute myocardial infarct (AMI) therapies. Infarct size, depicted by delayed-enhancement-CMR, is quantified using manual planimetry (MANUAL), visual scoring (VISUAL), or automated techniques using signal-intensity thresholding to define infarct borders (AUTO). Although AUTO is considered the most reproducible, prior studies did not account for the subjective determination of endocardial/epicardial borders, which all methods require. For MANUAL and VISUAL, prior studies have not explicitly defined how to treat intermediate signal-intensities due to partial volume. We wanted to assess sources of variability among 6 methods in quantification of AMI size, and illustrate the significance of these findings on sample size calculations for clinical trials.
Scans of 30 AMI patients and 12 controls were sent to 3 core-laboratories. Infarct size was measured using 6 methods, each separated by >2-months time, as follows (n=540 evaluations):  AUTO;  AUTO-UC (user correction for endocardial border pixels, no-reflow, etc.);  MANUAL;  MANUAL-ISI (adjustment for intermediate signal-intensities);  VISUAL;  VISUAL-ISI. Reproducibility was assessed by calculating the coefficient of variation (CV) and intraclass correlation coefficient (ICC). Using standard variance components analysis, we calculated the variance between-patients and within-patients separately.
Mean infarct size varied between 16.8% and 27.2% of LV mass depending on the method. Even AUTO (no user interaction for infarct borders) resulted in significant within-patient variability given the need to delineate endocardial/epicardial contours (CV=10.6%). Adding user input to correct computer generated infarct borders resulted in a mild improvement in reproducibility (AUTO-UC: CV=8.3%; p=0.045 for comparison with AUTO). For manual and visual categories, explicitly adjusting for intermediate signal-intensities led to improved reproducibility (MANUAL-ISI vs MANUAL: CV=8.3% vs 14.4%; p=0.03; VISUAL-ISI vs VISUAL: CV=8.4% vs 10.9%; p=0.01). When the best techniques in each category were compared, reproducibility was similar (AUTO-UC, MANUAL-ISI, and VISUAL-ISI: CV=8.3%, 8.3%, 8.4%, respectively). For these 3 techniques the within-patient variability due to the quantification method was less than 10% of the total variability. Hence, there were minimal differences between these methods in the calculated sample sizes needed to detect a 3%, 5%, and 7% absolute reduction in acute infarct size.
Among CMR core-laboratories, an important source of variability in infarct size quantification is the subjective delineation of endocardial/epicardial borders. When intermediate signal intensities are considered in manual planimetry and visual scoring, reproducibility and impact on sample size are similar to automated techniques.
About this article
Cite this article
Klem, I., Heiberg, E., van Assche, L. et al. Sources of variability in quantification of CMR infarct size and their impact on sample size calculations - reproducibility among three core laboratories. J Cardiovasc Magn Reson 17 (Suppl 1), P84 (2015). https://doi.org/10.1186/1532-429X-17-S1-P84