In Vivo Size Error Measurements Of Potential Liver Lesions Detected On MRI
D Nir, B Davison, M Kwong, C Roth, L Anastasia-Rubino, R Tello
Keywords
detection, growth, liver, medicine, mri, radiology, sensitvity
Citation
D Nir, B Davison, M Kwong, C Roth, L Anastasia-Rubino, R Tello. In Vivo Size Error Measurements Of Potential Liver Lesions Detected On MRI. The Internet Journal of Radiology. 2000 Volume 2 Number 1.
Abstract
Background: With the increasing use of MR to evaluate liver lesions the ability to accurately characterize individual liver lesions becomes critical if diagnostic certainty regarding malignant versus benign etiology is to supplant biopsy in many of these lesions. Our aims were to establish the error margin for lesion size characterization in vivo so that it becomes possible to ascertain when there has been a true interval size change in a lesion detected on MRI of the liver.
Methods: A phantom study using 8mm chambers filled with water attached onto the abdomen of 2 volunteers during each undergoing 4 abdominal T2W MR exams using typical clinical parameters were measured twice each by 2 blinded observers and the size measurements (320 total) were analyzed for size consistency and variation.
Results: The mean measurement was 8.5 mm with a standard deviation of 0.5mm. Indicating that at least 1mm size change (on the computer monitor) must be detected prior to concluding a true change in a lesion detected on abdominal MRI. (P<0.05).
Conclusion: We recommend that at least a 1 mm change in size be used as a criteria to diagnose true lesion growth or regression when measured on T2W abdominal MRI.
Introduction
Liver lesion characterization has relied on the tendency for malignant lesions to be less bright on sequential echoes than cysts or hemangiomas with morphology and enhancement patterns used as adjuncts due to lack of certainty and consistency in lesion brightness characterization. Quantitative analysis has implemented dual echo T2 MR imaging with quantitation of T2 relaxation time showing an accuracy of 97% with consistent performance [1]. Unfortunately, prior work has demonstrated significant overlap of benign and malignant lesions [2] which makes use of T2 relaxation a weak predictor of lesion malignancy when in the range of 90-120msec.
It is our hypothesis that measurements of serial lesion size for equivocal liver lesions detected on T2W MRI may be the only means of ascertaining if a lesion is malignant based on interval growth. Our aims were to establish the error margin for lesion size characterization in-vivo to ascertain when there has been a true interval size change in a lesion detected on MRI of the liver. Though phantom studies in isolation can establish the reliability and repeatability of measurements on MRI, in-vivo liver lesion evaluation suffer from respiratory and patient body motion, phase encoding artifact and signal to noise effects due to body habitus. It was the purpose of this project to simulate these phenomena by placing a phantom onto the abdominal wall of volunteers adjacent to the liver so that in-vivo error measurements could be quantitiated with a known ‘lesion size’.
Materials and Methods
A phantom study using 8mm chambers filled with water attached onto the abdomen of 2 volunteers (males ages 25 [55Kg] and 32 [80Kg]) each imaged twice with a 30 day period between studies for a total of 4 abdominal T2W MR sequences during each examination. The phantom size was measured by 2 different observers on the computer monitor and on the film generated and the size measurements were analyzed for size consistency and variation.
MR Imaging
All MR images were acquired on a superconducting 1.5-T MR imager (Gyroscan ACS NT; Philips Medical Systems, Best, The Netherlands) using conventional spin-echo sequences, and Fast spin-echo sequences. On the conventional spin echo (SE) sequences to obtain both moderately and heavily T2-weighted images the following standard scan parameters were used; a repetition time (TR) of 3600 msec, a first echo time (TE) of 50 msec and a second TE of 160 msec (3600/ 50, 160) with two excitations. The matrix size was 256 x 128, with a field of view of 32 to 37cm, yielding an effective in-plane resolution of 1.25 to 1.44mm by 2.5 to 2.9mm. A body coil was used for both transmission and reception. Section thickness was 8mm with a 1mm gap. Respiratory and flow compensation was used routinely. The average imaging time for double-echo T2 weighted sequences was 15 minutes and 30 seconds. Images were obtained in the axial and sagital planes with phase encode direction anterior to posterior.
On the Fast spin echo (FSE) sequences to obtain both moderately and heavily T2-weighted images the following standard scan parameters were used; a repetition time (TR) of 3000 msec, a first echo time (TE) of 40 msec and a second TE of 110 msec (3000/ 40, 110) with two excitations, and a turbo factor of 16. The matrix size was 256 x 256, with a field of view of 32 to 37cm, yielding an effective in-plane resolution of 1.25 to 1.44mm. A body coil was used for both transmission and reception. Section thickness was 8mm with a 1mm gap. Respiratory and flow compensation was used routinely. The average imaging time for double-echo FSE T2 weighted sequences was 5 minutes. Images were obtained in the axial and sagital planes. Images were obtained with phase encoding in both the right to left and anterior to posterior directions on the axial plane and in the anterior to posterior direction in the sagital plane. Table 1 summarizes all sequences examined.
Image Analysis
The phantom size was measured by 2 different observers on the computer monitor and on film blinded to each others results and at a date 3 months later these measure were repeated and the size measurements were analyzed for size consistency and variation. The phantom used consisted of a single width Lego block (Lego, Denmark) filled with distilled water and taped to the abdomen over the liver of each subject (figure 1). The depth (AP direction) on each image was measured by two blinded observers. The largest possible line located within the confines of the ‘lesion’ was selected. These measurements were recorded for each image from the computer monitor and the film hard copy on both moderately and heavily T2-weighted images on all cross-sections. Two measurements were recorded of the phantom at the short and long echo times and mean values and standard deviation were calculated and entered into a computer database (Microsoft Excel version 4.0, Microsoft, Bothel, WA). A total of 2 measures at 2 echo times for 5 sequences (SE in 2 planes and FSE in 2 planes and alternate phase encode direction) in 2 subjects imaged 2 times was done by 2 observers on 2 media (film and computer) yielding a total of 320 measures of the phantom were obtained. Table 2 shows the measures for the later echo of all the sequences with phase encoding in the anterior to posterior direction.
Figure 2
Statistical Analysis
All measurements were collected and analyzed for inter and intra reader variation, and variation between sequences, imaging planes, and between the phase encoding directions with paired t-test analysis [3]. Analysis was performed between; observers, media, time, imaging plane, sequence type, echo time, phase encode direction and subject.
Results
Analysis of variation between readers demonstrated no significant difference in size measurements based on film or monitor measurements, between sequences, or imaging planes. The mean measurement for phantom size was 8.5 mm with a standard deviation of 0.5mm. Measurements were reproducible in that there was no statistically significant difference between values calculated on two separate occasions with a standard error of the mean of less than 0.1mm. Paired t-test analysis demonstrated no significant difference between; observers, media, time, imaging plane, sequence type, echo time, phase encode direction and subject (p>0.05). Results are summarized in table 3. Note that low p values were noted between observers on the film measures on sagital SE and axial TSE sequences-suggesting a meaningful difference or bias on film measures, or a spurious statistical phenomena.
Discussion
It is well recognized that the combined use of moderately and heavily T2-weighted MR images (double-echo T2-weighted images) is valuable for liver lesion characterization [4,5, 6, 7, 8, 9]. Compared with hemangiomas and cysts, malignant hepatic tumors retain less signal intensity on heavily T2-weighted images than on moderately T2-weighted MR images. The use of double-echo T2-weighted MR images for lesion characterization has been described, not only with conventional spin-echo imaging, but with breathing averaged and breath-hold fast spin echo sequences, half-fourier single-shot turbo spin-echo (HASTE) sequences and echo-planar imaging [10, 11,12,13, 14].
Visual analyses of MR signal characteristics are inherently subjective, and are prone to uncertainty and inconsistency as recent work has documented [2]. Lesion characterization using calculated T2 relaxation times derived from double-echo T2-weighted sequences has been shown to be more accurate than other quantitative methods (lesion-to-liver contrast-to-noise ratios, lesion-liver, lesion-fat, lesion-muscle signal-intensity ratios) at 1.5-T field strengths [1, 15]. Furthermore, such measurements have been shown to be superior to subjective evaluations of both lesion brightness and morphology [1, 2, 16].
When attempting to differentiate benign from malignant liver lesions on MR imaging, observers combine a visual assessment of many variables including lesion brightness, lesion morphology, and patterns of contrast enhancement. The value of quantitative measures of T2 are that they provide objective evidence of lesion type and help reduce the uncertainty and inconsistency associated with visual inspection. In equivocal cases with T2 values between 90 and 130 msec serial studies may be needed to more accurately classify a given liver lesion. Until the accuracy of lesion size change is obtained certainty about tumor growth can be limited, hence the motivation for this study.
It is in this void that we attempt to establish measurement error so that certainty about lesion growth is improved for liver lesions identified on MRI. In this study, double-echo T2-weighted images were acquired using SE and FSE sequences alone and lesion size variation was measured. In summary we recommend that at least 1 mm change (that is 2 standard deviations) in size be used as a criteria to diagnose lesion growth or regression when measured on T2W abdominal MRI. It is worthy to note that low p values between observers on the film measures on sagital SE and axial TSE sequences may imply an element of bias on film measures, consequently lesion measures are more reproducible and reliable when done on the computer monitor.
Correspondence to
Richard Tello MD, MSME,MPH
Professor of Radiology, Epidemiology and Biostatistics
Boston University
Department of Radiology
88 East Newton St, Atrium 2
Boston, MA 02118
617-414-3736
F:617-638-6616
Email:tello@alum.mit.edu