The interobserver agreement in the discrimination of nonprogressors from progressors was assessed by Kappa (
κ) statistics. The strength of agreement was categorized according to the method of Landis and Koch
19: 0 = poor, 0 to 0.20 = slight, 0.21 to 0.40 = fair, 0.41 to 0.60 = moderate, 0.61 to 0.80 = substantial, and 0.81 to 1.00 = almost perfect. The intersession RNFL and GCIPL thickness measurements as well as the GCIPL defect's angular width and areal measurements were compared by one-way ANOVA. The long-term reproducibility of macular GCA was assessed by the standard error of measurement, which is a measure of the within-subject variation or SD of values that would have been obtained from a single individual if tested multiple times.
20 This, the within-subject standard deviation (
Sw), is the same as the test–retest standard deviation (TRTSD), and is calculated as the square root of the within-subject mean square for error.
21 Additionally, in the present evaluation, the intraclass correlation coefficient (ICC), coefficient of variation (CV), and tolerance limit were determined to assess the reproducibility. The ICC, the ratio of the subject variance to the total variance (within-subject variance + between-subject variance), summarizes the reproducibility of a measurement process for a given group of subjects.
1 The ICC ranges from theoretical 0 to 1: a higher ICC (close to 1) value indicates smaller fluctuations among repeated measurements for the same subjects. The ICC cutoff value of 0.70 is considered acceptable, but greater values mean better reproducibility. The CV (%) is calculated as 100 ×
Sw /overall mean, and a result less than 10% is considered to represent good reproducibility. The tolerance limit was defined as 1.96 ×
Display Formula × intervisit
Sw.
21 The Mann-Whitney
U test was run to compare the CVs and TRTSDs between the subject groups. To assess any relation of the present reproducibility with the degree of glaucomatous damage, the associations between severity of glaucomatous damage (represented by visual field mean deviation [MD], mean average GCIPL thickness, and mean average RNFL thickness), and TRTSD of GCIPL and RNFL maps were examined by linear regression analysis. The correlation coefficient (
r2) was used to demonstrate the strength of association between variables. In addition, a logistic regression analysis was performed for the same purpose after classifying patients into low and high thickness reproducibility groups according to the median TRTSD values of the average GCIPL and RNFL thicknesses. The statistical analyses were performed using SPSS version 21.0 for Windows (IBM Corp., Armonk, NY, USA). Statistical significance was defined as
P less than 0.05.