Skip to main content
Advertisement

Main menu

  • Home
  • Content
    • Current Issue
    • Accepted Manuscripts
    • Article Preview
    • Past Issue Archive
    • AJNR Case Collection
    • Case of the Week Archive
    • Classic Case Archive
    • Case of the Month Archive
  • Special Collections
    • Spinal CSF Leak Articles (Jan 2020-June 2024)
    • 2024 AJNR Journal Awards
    • Most Impactful AJNR Articles
  • Multimedia
    • AJNR Podcast
    • AJNR Scantastics
    • Video Articles
  • For Authors
    • Submit a Manuscript
    • Author Policies
    • Fast publishing of Accepted Manuscripts
    • Graphical Abstract Preparation
    • Manuscript Submission Guidelines
    • Imaging Protocol Submission
    • Submit a Case for the Case Collection
  • About Us
    • About AJNR
    • Editorial Board
  • More
    • Become a Reviewer/Academy of Reviewers
    • Subscribers
    • Permissions
    • Alerts
    • Feedback
    • Advertisers
    • ASNR Home
  • Other Publications
    • ajnr

User menu

  • Alerts
  • Log in

Search

  • Advanced search
American Journal of Neuroradiology
American Journal of Neuroradiology

American Journal of Neuroradiology

ASHNR American Society of Functional Neuroradiology ASHNR American Society of Pediatric Neuroradiology ASSR
  • Alerts
  • Log in

Advanced Search

  • Home
  • Content
    • Current Issue
    • Accepted Manuscripts
    • Article Preview
    • Past Issue Archive
    • AJNR Case Collection
    • Case of the Week Archive
    • Classic Case Archive
    • Case of the Month Archive
  • Special Collections
    • Spinal CSF Leak Articles (Jan 2020-June 2024)
    • 2024 AJNR Journal Awards
    • Most Impactful AJNR Articles
  • Multimedia
    • AJNR Podcast
    • AJNR Scantastics
    • Video Articles
  • For Authors
    • Submit a Manuscript
    • Author Policies
    • Fast publishing of Accepted Manuscripts
    • Graphical Abstract Preparation
    • Manuscript Submission Guidelines
    • Imaging Protocol Submission
    • Submit a Case for the Case Collection
  • About Us
    • About AJNR
    • Editorial Board
  • More
    • Become a Reviewer/Academy of Reviewers
    • Subscribers
    • Permissions
    • Alerts
    • Feedback
    • Advertisers
    • ASNR Home
  • Follow AJNR on Twitter
  • Visit AJNR on Facebook
  • Follow AJNR on Instagram
  • Join AJNR on LinkedIn
  • RSS Feeds

Welcome to the new AJNR, Updated Hall of Fame, and more. Read the full announcements.


AJNR is seeking candidates for the position of Associate Section Editor, AJNR Case Collection. Read the full announcement.

 

Research ArticleBrain
Open Access

Automated White Matter Total Lesion Volume Segmentation in Diabetes

J.A. Maldjian, C.T. Whitlow, B.N. Saha, G. Kota, C. Vandergriff, E.M. Davenport, J. Divers, B.I. Freedman and D.W. Bowden
American Journal of Neuroradiology December 2013, 34 (12) 2265-2270; DOI: https://doi.org/10.3174/ajnr.A3590
J.A. Maldjian
aFrom Advanced Neuroscience Imaging Research (ANSIR) Laboratory (J.A.M., C.T.W., B.N.S., G.K., C.V., E.M.D.)
bDepartment of Radiology (J.A.M., C.T.W., B.N.S., G.K., C.V.)
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
C.T. Whitlow
aFrom Advanced Neuroscience Imaging Research (ANSIR) Laboratory (J.A.M., C.T.W., B.N.S., G.K., C.V., E.M.D.)
bDepartment of Radiology (J.A.M., C.T.W., B.N.S., G.K., C.V.)
cTranslational Science Institute (C.T.W., B.N.S.)
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
B.N. Saha
aFrom Advanced Neuroscience Imaging Research (ANSIR) Laboratory (J.A.M., C.T.W., B.N.S., G.K., C.V., E.M.D.)
bDepartment of Radiology (J.A.M., C.T.W., B.N.S., G.K., C.V.)
cTranslational Science Institute (C.T.W., B.N.S.)
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
G. Kota
aFrom Advanced Neuroscience Imaging Research (ANSIR) Laboratory (J.A.M., C.T.W., B.N.S., G.K., C.V., E.M.D.)
bDepartment of Radiology (J.A.M., C.T.W., B.N.S., G.K., C.V.)
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
C. Vandergriff
aFrom Advanced Neuroscience Imaging Research (ANSIR) Laboratory (J.A.M., C.T.W., B.N.S., G.K., C.V., E.M.D.)
bDepartment of Radiology (J.A.M., C.T.W., B.N.S., G.K., C.V.)
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
E.M. Davenport
aFrom Advanced Neuroscience Imaging Research (ANSIR) Laboratory (J.A.M., C.T.W., B.N.S., G.K., C.V., E.M.D.)
dDepartment of Biomedical Engineering (E.M.D.)
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
J. Divers
ePublic Health Sciences (J.D.)
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
B.I. Freedman
fDepartment of Internal Medicine, Section on Nephrology (B.I.F.)
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
D.W. Bowden
gCenter for Diabetes Research and Human Genomics (D.W.B.), Wake Forest School of Medicine, Winston-Salem, North Carolina.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Supplemental
  • Info & Metrics
  • Responses
  • References
  • PDF
Loading

Abstract

BACKGROUND AND PURPOSE: WM lesion segmentation is often performed with the use of subjective rating scales because manual methods are laborious and tedious; however, automated methods are now available. We compared the performance of total lesion volume grading computed by use of an automated WM lesion segmentation algorithm with that of subjective rating scales and expert manual segmentation in a cohort of subjects with type 2 diabetes.

MATERIALS AND METHODS: Structural T1 and FLAIR MR imaging data from 50 subjects with diabetes (age, 67.7 ± 7.2 years) and 50 nondiabetic sibling pairs (age, 67.5 ± 9.4 years) were evaluated in an institutional review board–approved study. WM lesion segmentation maps and total lesion volume were generated for each subject by means of the Statistical Parametric Mapping (SPM8) Lesion Segmentation Toolbox. Subjective WM lesion grade was determined by means of a 0–9 rating scale by 2 readers. Ground-truth total lesion volume was determined by means of manual segmentation by experienced readers. Correlation analyses compared manual segmentation total lesion volume with automated and subjective evaluation methods.

RESULTS: Correlation between average lesion segmentation and ground-truth total lesion volume was 0.84. Maximum correlation between the Lesion Segmentation Toolbox and ground-truth total lesion volume (ρ = 0.87) occurred at the segmentation threshold of k = 0.25, whereas maximum correlation between subjective lesion segmentation and the Lesion Segmentation Toolbox (ρ = 0.73) occurred at k = 0.15. The difference between the 2 correlation estimates with ground-truth was not statistically significant. The lower segmentation threshold (0.15 versus 0.25) suggests that subjective raters overestimate WM lesion burden.

CONCLUSIONS: We validate the Lesion Segmentation Toolbox for determining total lesion volume in diabetes-enriched populations and compare it with a common subjective WM lesion rating scale. The Lesion Segmentation Toolbox is a readily available substitute for subjective WM lesion scoring in studies of diabetes and other populations with changes of leukoaraiosis.

ABBREVIATIONS:

DM
type 2 diabetes mellitus
LST
Lesion Segmentation Toolbox
SPM8
Statistical Parametric Mapping
TLV
total lesion volume
GT
ground-truth

Leukoariaosis is a common WM pathologic lesion in older adults, characterized histologically by demyelination, loss of oligodendrocytes, and vacuolization resulting from small-vessel ischemia of the WM.1 On brain MR imaging, these lesions are commonly termed WM hyperintensities and appear as regions of increased signal on T2-weighted and FLAIR sequences. Increases in WM disease burden have been associated with risk factors, such as hypertension, type 2 diabetes mellitus (DM), and tobacco use.2 Quantifying WM disease burden in the brain is important because it is an accurate and sensitive predictor of future stroke, dementia, and cognitive decline.3⇓⇓–6

Clinical evaluation of WM disease has been limited to subjective interpretation of disease burden, with typical modifiers including “few scattered lesions” and “mild,” “moderate,” or “severe” applied in radiologic reporting. This is because of the onerous and time-consuming task of manual delineation of WM lesion burden and the lack of robust automated tools for quantitative WM lesion grading. A semi-quantitative visual rating scheme was developed for use with large epidemiologic studies involving brain MR imaging. This WM hyperintensity grading scale is commonly used for research studies and is based on visual assessment by an experienced reader with the use of a semi-quantitative 10-point (0–9) scale with predefined methodology.7,8

A variety of automated methods for WM lesion quantification, involving combinations of thresholding, segmentation, prior information, lesion growing algorithms, and, most recently, machine learning algorithms, have been used in research studies of multiple sclerosis. It is beyond the scope of this work to describe all the developments in WM lesion segmentation. Rather, we focus on a recently described software tool for automated WM lesion segmentation, the Lesion Segmentation Toolbox (LST); it was developed for use in the Statistical Parametric Mapping (SPM8) environment, which is familiar to most neuroimaging researchers, is freely distributed open source code written in Matlab (MathWorks, Natick, Massachusetts), and is easily implemented/fully automated. Additionally, the LST uses widely available structural T1-weighted and FLAIR images for performing WM lesion segmentation. The LST was developed for use in multiple sclerosis and originally evaluated in a group of 52 subjects with multiple sclerosis and 18 control subjects, achieving excellent agreement with manual tracing (R2 values of 0.93).9 A critical user-determined parameter in the LST procedure is the k-threshold, which was determined to be 0.3 in a multiple sclerosis population.

The purpose of our study is to compare the performance of total lesion volume (TLV) grading computed by means of the LST automated WM lesion segmentation algorithm to expert manual segmentation in a cohort of subjects with DM and to determine the optimum k-threshold in this population. A secondary objective is to compare the LST TLV with semi-quantitative subjective rating scales. Our hypothesis is that TLV computed with the LST will perform at least as well as subjective rating scales when compared with ground-truth (GT) TLV in DM. This report answers the important question of whether an automated toolbox developed for use in multiple sclerosis can be used reliably for grading WM lesion burden in populations with a different pathophysiologic mechanism for development of WM disease.

Materials and Methods

Subjects

The Diabetes Heart Study is a genetic and epidemiologic study of 1443 European American and African American participants from 564 families with multiple cases of DM.10,11 The Diabetes Heart Study–Mind is an extension of the Diabetes Heart Study family of studies and examines the genetic and brain imaging contributors to cognitive changes associated with DM. The study includes diabetes- and nondiabetes-affected siblings. All subjects provided written informed consent, and study protocols were approved by the blinded institutional review board. MR imaging studies from 100 subjects were randomly selected from the Diabetes Heart Study-Mind. These included 50 subjects with DM (27 women, 23 men), 52% smokers, mean ± standard deviation (SD) age of 67.7 ± 7.2 (age range, 52–84 years), body mass index of 32.3 ± 7.1, and hemoglobin A1C of 7.6 ± 1.48; and 50 siblings without DM (35 women, 15 men), 46% smokers, mean ± SD age of 67.5 ± 9.4 (age range, 43–89 years), body mass index of 28.5 ± 6.5, and hemoglobin A1C of 5.9 ± 0.31.

MR Imaging

Participants from the Diabetes Heart Study-Mind were scanned on a 1.5T scanner with twin-speed gradients, with the use of an 8-channel neurovascular head coil Twin Speed EXCITE; GE Healthcare, Milwaukee, Wisconsin. High-resolution T1 anatomic images were obtained by means of a 3D spoiled gradient-echo sequence (matrix, 256 × 256; field of view, 20 cm; section thickness, 1.5 mm with no gap; number of sections, 124; in-plane resolution, 0.781 × 0.781 mm) aligned parallel to the anterior/posterior commissures (anterior/posterior commisure line). FLAIR images were acquired in the axial plane for the purpose of evaluating WM hyperintensities (TR = 8002, TE = 108.5, TI = 2000, flip angle = 90, 24 cm FOV, matrix size = 256 × 256 [0.94 × 0.94 mm], 3-mm section thickness).

Semi-Quantitative WM Rating Scale

WM hyperintensity signal changes of each individual were assessed independently by 2 board-certified neuroradiologists by means of a semi-quantitative 10-point (0–9) scale with predefined methodology.7,8 WM hyperintensity burden was estimated as the total extent of periventricular and subcortical white matter FLAIR signal hyperintensity that successively increases from no or barely detectable changes (grades 0 and 1, respectively) to almost all WM involved (grade 9). This scale has an inter-reader reliability agreement within 1 grade of 85.7%, with relaxed κ of 0.8, and intrareader reliability for agreement within 1 grade of 96.9%, with relaxed κ of 0.96.8

Image Preprocessing

The structural T1-weighted images were segmented into gray matter, WM, and CSF, normalized to Montreal Neurological Institute imaging space, and modulated with the Jacobian determinants of the normalization procedure to obtain tissue volume maps by use of the Dartel high-dimensional warping and the SPM8 (Wellcome Department of Imaging Neuroscience, London, UK)12 new segment procedure, as implemented in the VBM8 toolbox (http://dbm.neuro.uni-jena.de/vbm.html). In addition to normalized images in Montreal Neurological Institute imaging space, the procedure outputs native space segmentations and a native space partial volume estimate label image of the most likely tissue class for each voxel. The quality of the segmentation and normalization for all subjects was confirmed by visual inspection.

WM Lesion Segmentation

WM lesion segmentation and TLV maps were generated by use of the LST9 for SPM8 at 20 thresholds (k), ranging from 0–1 at 0.05 increments. The algorithm operates in native space and initially coregisters the FLAIR images to the space of the native T1. Each voxel in the T1 image is assigned to 1 of 3 classes (gray matter, WM, CSF) by use of the VBM8 toolbox as described above (partial volume estimate label map in native space). The FLAIR intensity distribution is calculated for each of the 3 classes to determine outliers, weighted according to the spatial probability of being WM, resulting in 3 classes of belief maps, and summed to generate a single belief map. A binarized version of the gray matter lesion map is used to seed a region growing algorithm with the summed belief map as the target. The user-selected k-threshold is used as the cutoff to generate the initial gray matter seed binarized belief map. The algorithm outputs WM lesion segmentations for each k threshold, as well as a table of total lesion volume (20 TLV values corresponding to each of the 20 k-thresholds).

GT Segmentation

A multi-tiered approach was used to generate the reference GT lesion volume segmentation. All stages involved expert manual segmentation by board-certified neuroradiologists. Two raters (each with 1–2 years of neuroradiology experience) initially manually segmented all the white matter lesions independently by use of in-house software. This generated 2 independent lesion volume datasets. The 2 datasets were merged by use of a binary union procedure to generate a single dataset representing the combination of all lesions identified by both raters. Two highly experienced neuroradiologists (with >15 years and >3 years of experience, respectively) then together performed a consensus reading, reviewing all lesions in the combination dataset. The consensus reading session was performed by use of MRIcron (http://www.sph.sc.edu/comd/rorden/mricro.html)13 with direct overlay onto the original FLAIR images and included the ability to manually add, remove, and edit the borders of all lesions. This final consensus lesion volume dataset served as the GT for the study.

Statistical Analysis

Correlation and regression analyses were performed for 3 main comparisons: subjective WM lesion scores versus GT, LST versus GT for each k-threshold, and subjective WM lesion scores versus LST for each k threshold. These analyses were repeated by use of log (TLV) to account for nonlinearity in the relationship between TLV and subjective scores. The optimum k-threshold was determined as that providing the maximum correlation value compared with manual segmentation. Between-group and inter-reader comparisons were also performed for the subjective WM scores and correlations with GT. Fisher r-to-z transformation14 was used to test whether observed correlations were statistically different from zero. Steiger Z-test of correlated correlations15 was used to determine if there was a significant difference in correlation values between subjective rating with GT and LST with GT. Bland-Altman plots were also computed between LST versus GT and LST versus subjective WM lesion scores to determine the mean difference between methods.

Results

Subjective WM Lesion Scores

Mean WM score for the group was 2.0, with an SD of 1.5. The distribution of WM scores ranged from 0–7, with most values falling in the lower range of 0–3 (On-line Fig 1). Between-reader agreement was 88% within 1 grade, similar to that reported in the literature for this method.8 There was no statistically significant difference in subjective WM scores between groups, with mean ± SD of 2.0 ± 1.3 for DM-affected and 2.0 ± 1.6 for non-DM groups.

Subjective WM Lesion Scores Versus GT

The Pearson correlation between average WM scores and GT was 0.84, with reader 1 having a correlation of 0.79 and reader 2 a correlation of 0.82. The correlation between average WM scores and GT was 0.82 and 0.85 for DM-affected and non-DM groups, respectively. With the use of the logarithm of GT, the correlations with average WM scores improved to 0.85 for the entire cohort (0.77 for reader 1 and 0.86 for reader 2) and 0.845 for the DM-affected group and 0.86 for the non-DM group. All reported correlation values were statistically significant (P ≤ .0001).

LST Versus GT

On-line Fig 2 is a graph of the LST k-threshold versus correlation with GT for the entire cohort. Correlation values ranged from 0.62–0.81 for the thresholds considered. Maximum correlation for the full sample was 0.87, corresponding to k = 0.25. For the non-DM group, maximum correlation was 0.94 observed at k = 0.15. For the DM-affected group, the maximum correlation was 0.83, corresponding to k = 0.25. The GT correlation performance of the LST was not significantly different from subjective WM scores (Steiger Z = 0.8, P < .4 for raw GT and Z = 1.4, P < .15 for the log-transformed data).

LST Versus Subjective WM Lesion Scores

Correlation values ranged from 0.59–0.65 for the threshold values considered. Maximum correlation was 0.74, which was observed when k = 0.15 in the full sample (On-line Fig 2). For the non-DM group, maximum correlation was 0.76, which was observed at the same threshold of k = 0.15. For the DM-affected group, maximum correlation was 0.74, corresponding to a k-threshold of 0.2. The threshold value of 0.25 provided correlations of 0.73, 0.74, and 0.74 for the full sample, the non-DM group, and the DM-affected group, respectively. Correlation values were similar with the use of log-transformed values.

GT Manual Segmentation

Paired Spearman correlation between manual segmentation raters, the final consensus GT volume, and the LST were all very high (ρ = 0.91 between first and second rater, ρ = 0.96 between first rater and GT, ρ = 0.98 between second rater and GT, ρ = 0.92 between first rater and LST, ρ = 0.8 between second rater and LST, and ρ = 0.87 between GT and LST), with P < .001 for all comparisons. Although the inter-rater correlations for manual segmentation were high, mean ± SD (median) of lesion volumes reported in units of milliliters were more widely spaced between raters: GT and LST with 2.43 ± 3.97 (0.86) for rater 1, 3.65 ± 5.69 (1.2) for rater 2, 4.47 ± 6.48 (1.83) for GT, and 2.45 ± 4.59 (0.38) for the LST, which suggests high inter-rater variability. The GT mean ± SD TLV for the full sample was 4.47 ± 6.48. The distribution of WM TLV was heavily skewed toward values <1 mL (On-line Fig 3), with a minor secondary peak at 10 mL. The mean ± SD TLV for the DM-affected group was 4.42 ± 4.62. Mean ± SD TLV for the non-DM group was 4.52 ± 7.13. There was no statistically significant difference in TLV between groups. On-line Fig 4 is a plot of GT TLV versus LST TLV at a k-threshold of 0.25, demonstrating a strong linear relationship (R2 = 0.76). On-line Fig 5 is a plot of GT TLV versus average WM scores demonstrating a nonlinear relationship for the subjective scale. To perform nonlinear fitting of these data, the WM scores were remapped from 0–9 to 1–10, and 2 data points with zero GT TLV were excluded. Logarithmic, polynomial, and power law relationships all provided better fits than a linear fit. The highest R2 (0.72) was provided by a logarithmic fit. Plotting WM scores versus log(GT TLV) demonstrated a strong linear relationship (R2 = 0.73).

Discussion

The LST was originally developed and evaluated for multiple sclerosis. Here, we validate its use for TLV measurement in subjects with DM, relative to subjective rating scales. The pathophysiologic mechanisms leading to visible MR imaging WM changes in DM are very different from those in multiple sclerosis. In multiple sclerosis, the prototypic hallmark is focal demyelination with varying degrees of gliosis and inflammation.16 The MR imaging FLAIR appearance is typically that of focal well-demarcated round or ovoid lesions. In contrast, WM lesions in the elderly, or leukoaraiosis, tend to be more diffuse, with pathophysiology related to endothelial dysfunction and development of small-vessel ischemia.1,17⇓–19 In this regard, diabetic populations provide an important validation of the LST methodology that can potentially be extended to other populations in which WM lesion burden relates to microvascular disease. Additionally, we examined a relatively large cohort of non-DM siblings, providing a validation in a normal elderly population.

LST Versus Subjective WM Lesion Scores

We demonstrate the LST to be comparable to subjective WM scores for determining severity of WM lesion load in a population with DM. A high degree of correlation was observed between TLV computed by use of the LST and GT manual segmentation. LST achieved a maximum correlation of 0.87, corresponding to a k-threshold of 0.25, and appeared robust in its segmentation performance, within a range of k-thresholds from 0.2–0.4, all demonstrating similar correlations. In comparison, the subjective WM scoring demonstrated slightly weaker correlation with GT, achieving a 0.84 correlation. This improved slightly to 0.85 by use of the log of the GT TLV. Additionally, the subjective WM scores demonstrated a nonlinear relationship to GT TLV. This is not surprising because the amount of visible WM disease increases substantially over the range of scores. That is, near total involvement of the brain WM for a grade of 9 is much greater in volume than 10 times the few lesions identified for a grade of 1. The use of a logarithmic transformation of the GT TLV provided a clear linear relationship to the subjective WM lesion scores. This finding has important implications for studies that use the subjective WM lesion rating scale. Studies that use the subjective WM lesion rating scale with standard statistical regression models violate assumptions of linearity and potentially affect the validity or significance of the results. The assumptions required for use of parametric testing (eg, Pearson correlation analysis) in this evaluation were fulfilled through log transform of the data. Alternatively, nonparametric testing can be used when these assumptions are not met. Repeating these analyses by use of the Spearman rank correlation test and Kendall τ again demonstrated higher values for the LST than subjective WM scores, but the differences did not achieve significance. The Pearson correlation, however, provides a more complete assessment of the associations between variables when the underlying assumptions are attained.

The k-threshold corresponding to the maximum correlation between subjective scoring and LST was lower than that for LST and GT (0.15 versus 0.25). The lower k value in the LST corresponds to a more relaxed threshold for detection of WM lesions. This suggests that subjective ratings overestimate the true degree of WM disease. The degree of reader bias toward overestimation may be greater at the lower disease burden range, which was typical of our sample.

LST Versus GT

LST achieved a maximum R2 of 0.69 at a k-threshold of 0.25 in the sample with DM. In contrast, the LST achieved a maximum R2 of 0.94 and optimum k-threshold of 0.3 in the recent evaluation of multiple sclerosis.9 The performance difference between DM and multiple sclerosis probably is an effect of disease severity. In the multiple sclerosis evaluation, there was greater disease burden, with TLV extending to >50 mL, compared with <35 mL for our DM population. More importantly, the performance in the multiple sclerosis population improved with increasing lesion volume, ranging from a mean Dice coefficient of 0.67 for lesion volumes <5 mL to 0.85 for lesions volume >15 mL. The multiple sclerosis evaluation did not appear to have a significant number of subjects with TLV <1 mL (if any), whereas for our population, most subjects had TLV <1 mL. Thus, the difference in performance for our group probably reflects disease severity, with the LST performing less optimally at low disease burdens, rather than a difference in lesion detectability between populations.

Automated Segmentation in Diabetes

Although automated WM lesion detection is an area of active investigation, there have been very few studies validating tools in a diabetic population. Jongen et al20 described a k-nearest neighboring clustering algorithm used in a study of subjects with diabetes. de Bresser et al21 used the same algorithm in a separate study of subjects with diabetes and white matter lesion load. Tiehuis et al22 also used the same algorithm in a study of cognitive function, vascular disease and diabetes, and, in another study, showed that it performs favorably compared with subjective rating scales with reference to cognitive assessments.23 For all of these studies, performance of the described algorithm was previously evaluated by using a leave-on-out cross-validation procedure, achieving a similarity index of 0.8.24 This validation, however, was performed only on 10 elderly subjects with a history of vascular disease and not specifically on patients with diabetes. A recently described method with the use of support vector machines was evaluated on a subset of 45 subjects from a larger study on the treatment of diabetes.25 A single rater was used as the reference standard, with 10 subjects used for training and the remaining 35 for testing. This method demonstrated a high sensitivity (>0.9) and a specificity of ∼0.85. Interestingly, this study had a second manual rater but only used the first rater as the reference standard, possibly because of their reported large inter-rater variability. In comparison to these studies, our study provides a more direct evaluation of lesion segmentation in diabetes, including a large sample size, and a more rigorous approach to ground-truth determination.

WM Disease and Diabetes

The LST performance was very similar between DM-affected and unaffected individuals, achieving similar maximum correlations and optimum k-thresholds. There was no significant difference in degree of WM lesion load between affected and unaffected groups, by use of any of the metrics (WM scores, LST, or GT). Whereas there is a convincing relationship between DM and lacunar infarcts and brain atrophy, the association with WM hyperintensities on conventional MR imaging is less clear.26 There has been recent evidence suggesting that DM is an independent risk factor for deep WM lesions in the elderly.27 Additionally, voxelwise analyses of diffusion tensor imaging and fractional anisotropy have demonstrated early changes in white matter microstructural properties that were associated with diabetes duration.28 Our sample may not have been large enough to detect a between-group difference. Alternatively, the effects of aging may be a greater contributor to the presence of visible WM disease than DM. It should be noted that our DM population had very few subjects with WM scores >5, and none were >7. We drew a random sample from the Diabetes Heart Study, with WM disease burdens probably reflecting the distribution of WM disease in the general DM population. Conclusions about diabetes and WM disease should not be made on the basis of this study because we did not control for a variety of confounding effects, including disease duration, co-morbidities, and medications.

Limitations

No subject had WM scores >7 or TLV >35 mL. This limits evaluation at the extreme end of disease burden. Also, our population was skewed toward the lower end of disease burdens, with most having TLV <1 mL. Although this is a limitation in terms of disease distribution, it is also probably reflective of the true disease burden and incidence of WM disease in the general population. One critical limitation is generic to all studies of WM disease burden, which is that there is no accepted reference standard. Manual delineation by experienced readers comes closest to what most would accept as a reference standard. The laborious nature of manual WM segmentation makes it generally impractical to have multiple raters. Semi-automated methodologies to facilitate the procedure can introduce bias through predetermination of potential lesion borders. Even with multiple readers, some readers are typically more accurate (or meticulous). These limitations may be mitigated by machine learning approaches that can weight more accurate readers more heavily in the determination of GT. We attempted to address these potential biases in GT segmentation with the use of a multi-tiered manual segmentation approach with the final consensus manual segmentation performed by experienced neuroradiologists. This is a time-intensive and laborious approach but allows confidence in the quality of the GT determination.

Conclusions

We validate the use of the LST for determination of TLV in a diabetic population and demonstrate that it performs as well compared with GT as a widely used subjective WM lesion rating scale. Additionally, we identify an optimal k-threshold of 0.25, with robust performance between 0.2–0.4 in this population. The LST is a readily available substitute for subjective WM lesion scoring in studies of diabetes and other populations prone to leukoaraiosis. Studies that use subjective WM lesion scores should be cognizant of violations in assumptions of linearity for standard statistical models with the use of this scale.

Footnotes

  • Disclosures: Joseph Maldjian—RELATED: Grant: NIH NS058700 and NS0075107,* Comments: NIH R01 grant funding for data collection. Christopher Whitlow—RELATED: Grant: NIH*; UNRELATED: Grants/Grants Pending: NIH.* Clayton Vandergriff—RELATED: Grant: NIH.* Jasmin Divers—RELATED: Grant: NINDS RO1 NS075107. Barry Freedman—RELATED: Grant: NIH* (*money paid to institution).

  • This study was supported by NIH grants NS058700 (D.W.B.) and NS0075107 (J.A.M., B.I.F., J.D.).

Indicates open access to non-subscribers at www.ajnr.org

REFERENCES

  1. 1.↵
    1. Brown WR,
    2. Moody DM,
    3. Thore CR,
    4. et al
    . Apoptosis in leukoaraiosis. AJNR Am J Neuroradiol 2000;21:79–82
    Abstract/FREE Full Text
  2. 2.↵
    1. Murray AD,
    2. Staff RT,
    3. Shenkin SD,
    4. et al
    . Brain white matter hyperintensities: relative importance of vascular risk factors in nondemented elderly people. Radiology 2005;237:251–57
    CrossRefPubMed
  3. 3.↵
    1. Debette S,
    2. Markus HS
    . The clinical importance of white matter hyperintensities on brain magnetic resonance imaging: systematic review and meta-analysis. BMJ 2010;341:c3666
    Abstract/FREE Full Text
  4. 4.↵
    1. Prins ND,
    2. van Dijk EJ,
    3. den Heijer T,
    4. et al
    . Cerebral white matter lesions and the risk of dementia. Arch Neurol 2004;61:1531–34
    CrossRefPubMed
  5. 5.↵
    1. Kuller LH,
    2. Longstreth WT Jr.,
    3. Arnold AM,
    4. et al
    . White matter hyperintensity on cranial magnetic resonance imaging: a predictor of stroke. Stroke 2004;35:1821–25
    Abstract/FREE Full Text
  6. 6.↵
    1. Debette S,
    2. Beiser A,
    3. DeCarli C,
    4. et al
    . Association of MRI markers of vascular brain injury with incident stroke, mild cognitive impairment, dementia, and mortality: the Framingham Offspring Study. Stroke 2010;41:600–06
    Abstract/FREE Full Text
  7. 7.↵
    1. Manolio TA,
    2. Kronmal RA,
    3. Burke GL,
    4. et al
    . Magnetic resonance abnormalities and cardiovascular disease in older adults: the Cardiovascular Health Study. Stroke 1994;25:318–27
    Abstract/FREE Full Text
  8. 8.↵
    1. Yue NC,
    2. Arnold AM,
    3. Longstreth WT Jr.,
    4. et al
    . Sulcal, ventricular, and white matter changes at MR imaging in the aging brain: data from the Cardiovascular Health Study. Radiology 1997;202:33–39
    PubMed
  9. 9.↵
    1. Schmidt P,
    2. Gaser C,
    3. Arsic M,
    4. et al
    . An automated tool for detection of FLAIR-hyperintense white-matter lesions in multiple sclerosis. Neuroimage 2012;59:3774–83
    CrossRefPubMed
  10. 10.↵
    1. Bowden DW,
    2. Cox AJ,
    3. Freedman BI,
    4. et al
    . Review of the Diabetes Heart Study (DHS) family of studies: a comprehensively examined sample for genetic and epidemiological studies of type 2 diabetes and its complications. Rev Diabet Stud 2010;7:188–201
    PubMed
  11. 11.↵
    1. Divers J,
    2. Hugenschmidt C,
    3. Sink KM,
    4. et al
    . Cerebral white matter hyperintensity in African Americans and European Americans with type 2 diabetes. J Stroke Cerebrovasc Dis 2013;22:e46–52
    CrossRefPubMed
  12. 12.↵
    1. Ashburner J,
    2. Friston KJ
    . Voxel-based morphometry: the methods. NeuroImage 2000;11:805–21
    CrossRefPubMed
  13. 13.↵
    1. Rorden C,
    2. Karnath HO,
    3. Bonilha L
    . Improving lesion-symptom mapping. J Cogn Neurosci 2007;19:1081–88
    CrossRefPubMed
  14. 14.↵
    1. Fisher RA
    . Frequency distribution of the values of the correlation coefficient in samples from an indefinitely large population. Biometrika 1915;10:507–21
    FREE Full Text
  15. 15.↵
    1. Steiger JH
    . Tests for comparing elements of a correlation matrix. Psychol Bull 1980;87:245–51
    CrossRef
  16. 16.↵
    1. Filippi M,
    2. Rocca MA,
    3. Barkhof F,
    4. et al
    . Association between pathological and MRI findings in multiple sclerosis. Lancet Neurol 2012;11:349–60
    CrossRefPubMed
  17. 17.↵
    1. Brown WR,
    2. Moody DM,
    3. Challa VR,
    4. et al
    . Apoptosis in leukoaraiosis lesions. J Neurol Sci 2002;203–204:169–71
  18. 18.↵
    1. Grueter BE,
    2. Schulz UG
    . Age-related cerebral white matter disease (leukoaraiosis): a review. Postgrad Med J 2012;88:79–87
    Abstract/FREE Full Text
  19. 19.↵
    1. Knottnerus IL,
    2. Ten Cate H,
    3. Lodder J,
    4. et al
    . Endothelial dysfunction in lacunar stroke: a systematic review. Cerebrovasc Dis 2009;27:519–26
    CrossRefPubMed
  20. 20.↵
    1. Jongen C,
    2. van der Grond J,
    3. Kappelle LJ,
    4. et al
    . Automated measurement of brain and white matter lesion volume in type 2 diabetes mellitus. Diabetologia 2007;50:1509–16
    CrossRefPubMed
  21. 21.↵
    1. de Bresser J,
    2. Tiehuis AM,
    3. van den Berg E,
    4. et al
    . Progression of cerebral atrophy and white matter hyperintensities in patients with type 2 diabetes. Diabetes Care 2010;33:1309–14
    Abstract/FREE Full Text
  22. 22.↵
    1. Tiehuis AM,
    2. Mali WP,
    3. van Raamt AF,
    4. et al
    . Cognitive dysfunction and its clinical and radiological determinants in patients with symptomatic arterial disease and diabetes. J Neurol Sci 2009;283:170–74
    CrossRefPubMed
  23. 23.↵
    1. Tiehuis AM,
    2. Vincken KL,
    3. Mali WP,
    4. et al
    . Automated and visual scoring methods of cerebral white matter hyperintensities: relation with age and cognitive function. Cerebrovasc Dis 2008;25:59–66
    CrossRefPubMed
  24. 24.↵
    1. Anbeek P,
    2. Vincken KL,
    3. van Bochove GS,
    4. et al
    . Probabilistic segmentation of brain tissue in MR imaging. Neuroimage 2005;27:795–804
    CrossRefPubMed
  25. 25.↵
    1. Lao Z,
    2. Shen D,
    3. Liu D,
    4. et al
    . Computer-assisted segmentation of white matter lesions in 3D MR images using support vector machine. Acad Radiol 2008;15:300–13
    CrossRefPubMed
  26. 26.↵
    1. van Harten B,
    2. de Leeuw FE,
    3. Weinstein HC,
    4. et al
    . Brain imaging in patients with diabetes: a systematic review. Diabetes Care 2006;29:2539–48
    FREE Full Text
  27. 27.↵
    1. van Harten B,
    2. Oosterman JM,
    3. Potter van Loon BJ,
    4. et al
    . Brain lesions on MRI in elderly patients with type 2 diabetes mellitus. Eur Neurol 2007;57:70–74
    CrossRefPubMed
  28. 28.↵
    1. Hsu JL,
    2. Chen YL,
    3. Leu JG,
    4. et al
    . Microstructural white matter abnormalities in type 2 diabetes mellitus: a diffusion tensor imaging study. Neuroimage 2012;59:1098–105
    CrossRefPubMed
  • Received December 22, 2012.
  • Accepted after revision February 24, 2013.
  • © 2013 by American Journal of Neuroradiology
View Abstract
PreviousNext
Back to top

In this issue

American Journal of Neuroradiology: 34 (12)
American Journal of Neuroradiology
Vol. 34, Issue 12
1 Dec 2013
  • Table of Contents
  • Index by author
  • Complete Issue (PDF)
Advertisement
Print
Download PDF
Email Article

Thank you for your interest in spreading the word on American Journal of Neuroradiology.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
Automated White Matter Total Lesion Volume Segmentation in Diabetes
(Your Name) has sent you a message from American Journal of Neuroradiology
(Your Name) thought you would like to see the American Journal of Neuroradiology web site.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Cite this article
J.A. Maldjian, C.T. Whitlow, B.N. Saha, G. Kota, C. Vandergriff, E.M. Davenport, J. Divers, B.I. Freedman, D.W. Bowden
Automated White Matter Total Lesion Volume Segmentation in Diabetes
American Journal of Neuroradiology Dec 2013, 34 (12) 2265-2270; DOI: 10.3174/ajnr.A3590

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
0 Responses
Respond to this article
Share
Bookmark this article
Automated White Matter Total Lesion Volume Segmentation in Diabetes
J.A. Maldjian, C.T. Whitlow, B.N. Saha, G. Kota, C. Vandergriff, E.M. Davenport, J. Divers, B.I. Freedman, D.W. Bowden
American Journal of Neuroradiology Dec 2013, 34 (12) 2265-2270; DOI: 10.3174/ajnr.A3590
del.icio.us logo Twitter logo Facebook logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One
Purchase

Jump to section

  • Article
    • Abstract
    • ABBREVIATIONS:
    • Materials and Methods
    • Results
    • Discussion
    • Conclusions
    • Footnotes
    • REFERENCES
  • Supplemental
  • Info & Metrics
  • Responses
  • References
  • PDF

Related Articles

  • PubMed
  • Google Scholar

Cited By...

  • Automatic brain lesion segmentation on standard magnetic resonance images: a scoping review
  • Blood-based bioenergetic profiling is related to differences in brain morphology in African Americans with Type 2 diabetes
  • APOE Genotypes Associate With Cognitive Performance but Not Cerebral Structure: Diabetes Heart Study MIND
  • Subclinical Atherosclerosis Is Inversely Associated With Gray Matter Volume in African Americans With Type 2 Diabetes
  • Effects of Type 2 Diabetes on Brain Structure and Cognitive Function: African American-Diabetes Heart Study MIND
  • Cerebral Structural Changes in Diabetic Kidney Disease: African American-Diabetes Heart Study MIND
  • Crossref (40)
  • Google Scholar

This article has been cited by the following articles in journals that are participating in Crossref Cited-by Linking.

  • White matter hyperintensities, cognitive impairment and dementia: an update
    Niels D. Prins, Philip Scheltens
    Nature Reviews Neurology 2015 11 3
  • Bayesian Model Selection for Pathological Neuroimaging Data Applied to White Matter Lesion Segmentation
    Carole H. Sudre, M. Jorge Cardoso, Willem H. Bouvy, Geert Jan Biessels, Josephine Barnes, Sebastien Ourselin
    IEEE Transactions on Medical Imaging 2015 34 10
  • Automatic Detection of White Matter Hyperintensities in Healthy Aging and Pathology Using Magnetic Resonance Imaging: A Review
    Maria Eugenia Caligiuri, Paolo Perrotta, Antonio Augimeri, Federico Rocca, Aldo Quattrone, Andrea Cherubini
    Neuroinformatics 2015 13 3
  • Association between Type 2 Diabetes Mellitus and Brain Atrophy: A Meta-Analysis
    Tianqi Zhang, Marnie Shaw, Nicolas Cherbuin
    Diabetes & Metabolism Journal 2022 46 5
  • Associations of Early Kidney Disease With Brain Magnetic Resonance Imaging and Cognitive Function in African Americans With Type 2 Diabetes Mellitus
    Barry I. Freedman, Kaycee M. Sink, Christina E. Hugenschmidt, Timothy M. Hughes, Jeff D. Williamson, Christopher T. Whitlow, Nicholette D. Palmer, Michael E. Miller, Laura C. Lovato, Jianzhao Xu, S. Carrie Smith, Lenore J. Launer, Joshua I. Barzilay, Robert M. Cohen, Mark D. Sullivan, R. Nick Bryan, Benjamin C. Wagner, Donald W. Bowden, Joseph A. Maldjian, Jasmin Divers
    American Journal of Kidney Diseases 2017 70 5
  • Cerebral Structural Changes in Diabetic Kidney Disease: African American–Diabetes Heart Study MIND
    Kaycee M. Sink, Jasmin Divers, Christopher T. Whitlow, Nicholette D. Palmer, S. Carrie Smith, Jianzhao Xu, Christina E. Hugenschmidt, Benjamin C. Wagner, Jeff D. Williamson, Donald W. Bowden, Joseph A. Maldjian, Barry I. Freedman
    Diabetes Care 2015 38 2
  • Accuracy and reproducibility of automated white matter hyperintensities segmentation with lesion segmentation tool: A European multi-site 3T study
    Federica Ribaldi, Daniele Altomare, Jorge Jovicich, Clarissa Ferrari, Agnese Picco, Francesca Benedetta Pizzini, Andrea Soricelli, Anna Mega, Antonio Ferretti, Antonios Drevelegas, Beatriz Bosch, Bernhard W. Müller, Camillo Marra, Carlo Cavaliere, David Bartrés-Faz, Flavio Nobili, Franco Alessandrini, Frederik Barkhof, Helene Gros-Dagnac, Jean-Philippe Ranjeva, Jens Wiltfang, Joost Kuijer, Julien Sein, Karl-Titus Hoffmann, Luca Roccatagliata, Lucilla Parnetti, Magda Tsolaki, Manos Constantinidis, Marco Aiello, Marco Salvatore, Martina Montalti, Massimo Caulo, Mira Didic, Núria Bargallo, Olivier Blin, Paolo M Rossini, Peter Schonknecht, Piero Floridi, Pierre Payoux, Pieter Jelle Visser, Régis Bordet, Renaud Lopes, Roberto Tarducci, Stephanie Bombois, Tilman Hensch, Ute Fiedler, Jill C. Richardson, Giovanni B. Frisoni, Moira Marizzoni
    Magnetic Resonance Imaging 2021 76
  • Cognitive impairment and hippocampal atrophy in chronic kidney disease
    C.-Y. Chang, C.-C. Lin, C.-F. Tsai, W.-C. Yang, S.-J. Wang, F.-H. Lin, J.-L. Fuh
    Acta Neurologica Scandinavica 2017 136 5
  • Diagnostic value of MRS-quantified brain tissue lactate level in identifying children with mitochondrial disorders
    Roelineke J. Lunsing, Kim Strating, Tom J. de Koning, Paul E. Sijens
    European Radiology 2017 27 3
  • Analysis of the relationships between type 2 diabetes status, glycemic control, and neuroimaging measures in the Diabetes Heart Study Mind
    Laura M. Raffield, Amanda J. Cox, Barry I. Freedman, Christina E. Hugenschmidt, Fang-Chi Hsu, Benjamin C. Wagner, Jianzhao Xu, Joseph A. Maldjian, Donald W. Bowden
    Acta Diabetologica 2016 53 3

More in this TOC Section

  • Statin Therapy Does Not Affect the Radiographic and Clinical Profile of Patients with TIA and Minor Stroke
  • Optimal MRI Sequence for Identifying Occlusion Location in Acute Stroke: Which Value of Time-Resolved Contrast-Enhanced MRA?
  • SWI or T2*: Which MRI Sequence to Use in the Detection of Cerebral Microbleeds? The Karolinska Imaging Dementia Study
Show more Brain

Similar Articles

Advertisement

Indexed Content

  • Current Issue
  • Accepted Manuscripts
  • Article Preview
  • Past Issues
  • Editorials
  • Editors Choice
  • Fellow Journal Club
  • Letters to the Editor

Cases

  • Case Collection
  • Archive - Case of the Week
  • Archive - Case of the Month
  • Archive - Classic Case

Special Collections

  • Special Collections

Resources

  • News and Updates
  • Turn around Times
  • Submit a Manuscript
  • Author Policies
  • Manuscript Submission Guidelines
  • Evidence-Based Medicine Level Guide
  • Publishing Checklists
  • Graphical Abstract Preparation
  • Imaging Protocol Submission
  • Submit a Case
  • Become a Reviewer/Academy of Reviewers
  • Get Peer Review Credit from Publons

Multimedia

  • AJNR Podcast
  • AJNR SCANtastic
  • Video Articles

About Us

  • About AJNR
  • Editorial Board
  • Not an AJNR Subscriber? Join Now
  • Alerts
  • Feedback
  • Advertise with us
  • Librarian Resources
  • Permissions
  • Terms and Conditions

American Society of Neuroradiology

  • Not an ASNR Member? Join Now

© 2025 by the American Society of Neuroradiology All rights, including for text and data mining, AI training, and similar technologies, are reserved.
Print ISSN: 0195-6108 Online ISSN: 1936-959X

Powered by HighWire