Evaluating hippocampal internal architecture on MRI: Inter-rater reliability of a proposed scoring system

详细信息查看全文

作者：Lawrence W. Ver Hoef ; A. LeBron Paige ; Kristen O. Riley ; Joel Cure ; Mehdi Soltani ; Frank B. Williams ; Richard E. Kennedy ; Jerzy P. Szaflarski ; Robert C. Knowlton
关键词：HIA ; hippocampal internal architecture ; HS ; hippocampal sclerosis ; TLE ; temporal lobe epilepsy ; IRR ; inter-rater reliability
刊名：Epilepsy Research
出版年：2013
出版时间：September, 2013
年：2013
卷：106
期：1-2
页码：146-154
全文大小：1513 K

文摘

| Figures/TablesFigures/Tables | ReferencesReferencesversion=""1.0"" encoding=""UTF-8""?>

Summary

Background

Asymmetry of hippocampal internal architecture (HIA) has been reported to be a frequent imaging finding in epilepsy patients with temporal lobe epilepsy (TLE) who exhibit other signs of hippocampal sclerosis. HIA asymmetry may also be an independent predictor of the side of seizure onset in patients with otherwise normal MRI scans. The study of HIA asymmetry and its relationship to the laterality of TLE would benefit from a reliable method of assessing the clarity of HIA in MRI scans. We propose a visual scoring system that rates HIA clarity from 1 (imperceptible) to 4 (excellent) and report the inter-rater reliability (IRR) of this system.

Methods

In the initial preliminary phase of this study we examined IRR using a kappa statistic (¦Ê) among a mixed group of expert and non-expert reviewers using only a brief description of the scoring system to score single images from a series of patients. In the second phase we explored the effect of training on the use of our HIA scoring system by assessing IRR among neuroimaging experts before and after a brief interactive training session. In this phase, multiple slices from each patient were scored. Separate ¦Ê values and intraclass correlation coefficients (ICC) were calculated from the scores given to each hippocampal image and from the asymmetry of scores between left and right for each slice. In the third phase the effect of training on non-expert reviewers was explored using a similar approach as with the expert reviewers.

Results

In the preliminary phase of the study, HIA scoring of single images showed substantial agreement among expert reviewers (¦Ê_HIA = 0.65), fair agreement among non-expert reviewers (¦Ê_HIA = 0.27), and a fair to moderate degree of agreement among all the reviewers as a whole (¦Ê_HIA = 0.40). In the second phase, prior to training there was substantial agreement among expert reviewers in regard to the individual HIA scores (¦Ê_HIA = 0.62; ICC_HIA = 0.81) but only moderate agreement on the degree of asymmetry (¦Ê_Asym = 0.47; ICC_Asym = 0.71). Training improved agreement on the individual HIA scores (¦Ê_HIA = 0.58-0.72; ICC_HIA = 0.76-0.84) and on the degree of asymmetry (¦Ê_Asym = 0.61-0.67; ICC_Asym = 0.81-0.85). Among non-expert reviewers, scores improved from only a fair degree of agreement pre-training (¦Ê_HIA = 0.25, ¦Ê_Asym = 0.25; ICC_HIA = 0.68, ICC_Asym = 0.66) to a moderate level of agreement after training (¦Ê_HIA = 0.54, ¦Ê_Asym = 0.52; ICC_HIA = 0.78, ICC_Asym = 0.81).

Conclusions

The proposed HIA scoring system has a substantial degree of inter-rater reliability among experienced neuroimaging reviewers. Training improves the detection of asymmetries in HIA score in particular. Non-expert reviewers can employ the system with a moderate degree of reliability, and training has an even greater impact on the improvement of scoring reliability.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700