Introduction

Intraocular pressure (IOP) is currently the only modifiable risk factor for glaucoma, a disease that is the second leading cause of blindness worldwide.1, 2 As IOP reduction is the mainstay of treatment, accurate IOP assessment is important in monitoring the efficacy of therapy and for assessing the risk of glaucomatous progression.3, 4, 5, 6 Goldmann applanation tonometry (GAT) is currently the clinical gold standard for IOP assessment.7 Despite its widespread use, GAT has several limitations, including confounding related to corneal thickness, calibration errors, and concerns related to contamination and sanitation.8, 9, 10, 11

The non-contact ocular response analyser (ORA: Reichert Ophthalmic Instruments, Buffalo, NY, USA) provides a corneal compensated IOP measurement (IOPcc) that avoids several of the imitations of GAT.7, 8, 10, 12, 13, 14, 15 The ORA has received attention in the areas of glaucoma, corneal disorders, and refractive surgery. It also provides a Goldmann correlated IOP (IOPg) value and a waveform score (WS) designed to guide the clinician in selecting high quality measurements. Although the determination of a more accurate IOP assessment is appealing to eye care professionals, the thought of abandoning GAT is concerning due to issues of continuity of care. As clinicians looking to move towards corneal compensated IOP measurement in glaucoma management will likely desire a period of transition, it is important to first determine how well IOPg approximates GAT. This study sought to evaluate the agreement of GAT and IOPg in a population of patients under clinical evaluation for glaucoma. Furthermore, this study sought to determine the value of the WS and other variables in determining which measurements best approximate GAT.

Methods

Approval for this study was obtained from the Institutional Review Board at Weill Cornell Medical College and New York Presbyterian Hospital. A total of 518 eyes of 260 consecutive patients over 18 years of age undergoing corneal hysteresis (CH) measurement as part of glaucoma evaluation from June 2009 to August 2009 at Weill Cornell Medical College were selected for inclusion in this retrospective cross-sectional study. Using the same criteria, 100 eyes of 52 patients were included in a separate second data set that was used to confirm results. Primary clinical diagnoses were obtained from retrospective chart review.

All subjects underwent ORA measurement directly followed by ophthalmic examination including GAT. For each ORA assessment, measurements were obtained until a WS of 6.5 was obtained or until five measurements were obtained per eye. As no published recommendations exist for choosing measurements based on WS, 6.5 was arbitrarily chosen as a cutoff score in this study based on office protocol. Topical anaesthetic and fluorescein was placed in each eye before GAT, which was performed immediately after ORA evaluation per office protocol. The GAT used for the study was calibrated monthly for errors of ±0.5 mm Hg using the manufacturer's standard check weight and protocol.16 GAT measurements were performed by one of three ophthalmologists. As the ORA assessment was performed primarily to measure CH (which was the only ORA-generated value documented in the patient record), the examiner measuring GAT was masked to IOPg during IOP assessment.

Charts for all study patients were reviewed for demographic and ocular data recorded on the day of ORA assessment. The most recent central corneal thickness (CCT) measurements obtained by ultrasound pachymetry (DGH Technology, Exton, PA, USA) and 24-2 SITA-standard automatic perimetry with the Humphrey Field Analyser II (Carl Zeiss Meditec, Dublin, CA, USA) results were obtained by chart review. The visual field index mean deviation was recorded from the most recent field test for each eye.

Statistical analyses were performed using SPSS statistical package (SPSS Inc., Chicago, IL, USA) and Stata/11 IC (StataCorp LP, College Station, TX, USA). Paired t-tests and simple linear regression analyses were performed. Multiple linear regression was used to validate the modelled relationship between GAT and IOPg on a second data set. To test the null hypothesis that the regression lines from the two data sets were equal, we built a multiple linear model including an indicator variable corresponding to each patient's data set of origin and allowing for interaction between data set of origin and mean IOP in predicting the difference between GAT and IOPg. The indicator variable tests equality of intercepts and the interaction term tests equality of slopes between the two lines.17 A Bland–Altman plot was constructed to determine agreement between GAT and IOPg assessments. Linear regression was used to calculate regression-based limits of agreement. All statistical tests were two-sided, with a 0.05 level for statistical significance.

Results

Patient characteristics, including primary glaucoma diagnoses, are presented in Table 1. To determine whether WS can be used to predict concordance of IOPg and GAT measurements, we performed simple linear regression using all ORA assessments conducted during the study period. Patients underwent ORA measurement 2.3 times per eye on average. Increasing WS was found to predict a smaller absolute value difference between GAT and IOPg (β=−0.2, 95% CI=−0.2, −0.1, P0.001). The highest WS was then selected from among all assessments of each eye. Simple linear regression was again used to model the relationship between highest WS and absolute value difference between GAT and IOPg. When eyes were selected based on highest WS, WS continued to predict concordance between GAT and IOPg (β=−0.2, 95% CI −0.3, 0.0, P=0.006). The mean WS for all ORA assessments was 5.7 (±2.5 (SD)), whereas the mean WS was 7.5 (±1.6 (SD)) when the highest score was selected for each eye. On the basis of the regression equation when all ORA assessments were considered, an increase in WS from 0 (lowest possible score) to 10 (highest possible score) predicts a decrease in variance between GAT and IOPg of only 1.6 mm Hg.

Table 1 Patient characteristics

As WS may be a good predictor of ORA assessment accuracy, we conducted analyses of GAT–IOPg concordance using data from the ORA evaluation with the best WS for each eye examined. The 518 eyes examined had a mean IOP of 15.6 mm Hg (±4.3 (SD)) by GAT and 15.5 mm Hg (±5.4 (SD)) by IOPg. The mean IOP difference between the two methods was 0.1 mm Hg, which was found to be a statistically insignificant difference by paired t-test analysis (P=0.391). Figure 1 uses cutoff values to illustrate the percentage of eyes within defined limits of variation between GAT and IOPg. GAT and IOPg measurements varied by 2 mm Hg or less in 53.9% of eyes and by a maximum of 5 mm Hg in 92.3% of eyes.

Figure 1
figure 1

Percent of eyes in which GAT and IOPg measurements vary by given pressure. GAT and IOPg measurements vary by 2 mm Hg or less in 53.9% of eyes and by 5 mm Hg or less in 92.3% of eyes.

As variation between GAT and IOPg differed significantly as a function of IOP (P<0.001), regression-based limits of agreement were calculated. The mean difference between GAT and IOPg values was 0.1 mm Hg with 95% limits of agreement from −0.2 to 0.4. To determine if differences between GAT and IOPg values were also related to levels of CCT, CH and/or age, the difference between GAT and IOPg was regressed on each of these independent variables. Although mean IOP (P<0.001) and CCT (P<0.001) were significantly associated with the difference between GAT and IOPg measurements, CH (P=0.596) and age (P=0.874) were not (Table 2). A Bland–Altman plot showed moderate variation between GAT and IOPg (Figure 2).

Table 2 Simple linear models to predict GAT–IOPg difference
Figure 2
figure 2

Bland–Altman plot of GAT and IOPg. The Bland–Altman plot assesses agreement of GAT and IOPg for measurement of IOP. Mean difference=0.1 (95% CI −0.2, 0.4); reference range for difference −6.0 to 6.2. Pitman's test P-value 0.001.

IOPg was shown to overestimate GAT at pressures above 16 mm Hg and underestimate GAT at pressures below this cutoff (regression equation: (GAT–IOPg)=3.941−0.246 × (mean IOP measurement), P0.001). Table 3 illustrates the expected bias in IOP measurement at several levels of IOP. Of note, the mean IOP in this study was close to 15 mm Hg and at this pressure there is an estimated bias of approximately 0.3 mm Hg (±0.25 (95% limits of agreement)) between methods.

Table 3 Estimates for differences between GAT and IOPg at several IOP levels

Using a second data set of 100 eyes, we sought to validate the relationship modelled between GAT and IOPg in this study. Simple linear regression using the second data set resulted in the following equation: (GAT−IOPg)=3.667−0.242 × (mean IOP measurement). Multiple linear regression was used to test for coincidence of the regression lines from the two data sets. There was no statistically significant association of the data set indicator variable (equality of line intercepts; P=0.524) or the data set mean IOP interaction term (equality of line slopes; P=0.659) with the difference between GAT and IOPg measurements; this indicates that the regression lines of the two data sets were not significantly different.

Discussion

In this clinical study of IOP measurement in a population of patients under evaluation for glaucoma, IOP assessments with GAT and IOPg values showed good agreement on most measures and were not significantly different upon paired analysis. The difference between GAT and IOPg values, however, was significantly associated with mean IOP. Thus, the magnitude and direction of bias between GAT and IOPg values was found to vary as a function of IOP. We successfully validated the modelled relationship between GAT and IOPg using a second distinct data set.

The WS was a significant predictor of agreement between the two measures; however, the magnitude of the predicted decrease in variance between GAT and IOPg using a higher WS was small. Although this result supports the use of WS in selecting the most accurate ORA measurement during a given measurement session, it does not provide a specific cutoff value for ORA measurement quality, as IOPg and GAT were closely related across the entire spectrum of WS scores.

In order for two measures to show a strong agreement, they need to be reliable. Some past research suggests that IOPg is a reliable measurement of IOP and its repeatability coefficient is comparable with that of GAT.18 The intrasession repeatability of GAT and IOPg measurements have generally been found to be within about 2 mm Hg.18, 19, 20, 21, 22, 23, 24 In addition, inter-examiner variability or bias is an issue that is theoretically eliminated with the use of an objective assessment of IOP such as ORA-generated IOPg that would eliminate the need for masked IOP assessment protocols in the research setting. In a population-based epidemiologic study, Dielemans et al19 investigated GAT test–retest reliability and found a mean intra-observer variation of 1.64 mm Hg and inter-observer variation of 1.79 mm Hg. Thorburn22 investigated inter-examiner variability of GAT measurements by two experienced ophthalmologists and found a difference of 2 mm Hg or more in 40% of eyes and 3 mm Hg or more in 17% of eyes examined. Similarly, Phelps and Phelps21 reported a difference between the GAT measurements of two examiners of at least 2 mm Hg in 50% of eyes and 3 mm Hg in 30% of eyes. Accordingly, the measurement variability between IOPg and GAT in this study is similar to previously reported inter-examiner variability of GAT measurements.

Tonnu et al25 studied the reproducibility of several IOP measurement techniques by comparing GAT with Tono-Pen tonometry, ocular blood flow tonometry, non-contact tonometry (NCT) and a second Goldmann observer. The authors presented estimates of IOP differences between tonometry methods at various levels of IOP, as in the current investigation. Of note, at 15 mm Hg—approximately the mean IOP in the current study—estimated GAT–IOPg agreement in this study was superior to agreement between GAT and all other IOP measurement techniques considered by Tonnu et al. Moreover, this study determined a mean difference of 0.1 mm Hg between GAT and IOPg values, compared with 0.7 mm Hg between GAT and NCT in the study by Tonnu et al.25 In addition, in two separate studies, Jorge et al27 demonstrated a difference of 0.1 mm Hg between GAT and NCT values among normal26 and glaucomatous subjects using the same NCT machine. However, using a different NCT machine (from the same manufacturer) to study the same cohort of normal subjects, the investigators reported a mean GAT–NCT difference of 0.8 mm Hg.26 Thus, it is clear that IOP measurement agreement will depend greatly on the protocol and equipment used.

A growing literature regarding CH measurement has demonstrated a potentially expanding role for the ORA in ophthalmic management. CH is lower in glaucoma damage, acquired optic nerve pitting, and progressive glaucoma.2, 15, 28, 29, 30, 31 In glaucoma patients with low hysteresis, there is greater backward bowing of the lamina cribrosa in response to transient IOP elevation.32 Furthermore, CH measurements may be useful in identifying patients with keratoconus and post-LASIK ectasia, as well as for monitoring biomechanical corneal properties before and after keratorefractive procedures.14, 33, 34, 35, 36 By compensating for CH and other properties, the ORA produces a corneal compensated IOP. Although CH is moderately correlated with CCT, CH and corneal compensated IOP measurements are less affected by corneal oedema than are GAT measurements.33, 37, 38 Furthermore, IOPcc may be one of the most appropriate measures of IOP before and after keratorefractive procedures and in eyes with keratoconus or corneal oedema.14, 33, 34, 35, 36, 39 In light of these and other findings, the ORA is an appealing tool for the measurement of IOP and ocular biomechanical characteristics.

It is important to note that the observed bias between GAT and IOPg could affect clinical management. This may be particularly significant for patients with borderline IOP, as IOPg values tend to exceed GAT at higher pressures in this study. Nevertheless, the bias detected between GAT and IOPg is similar to some previously reported inter- and intra-examiner GAT biases.19, 21, 22 Accordingly, the variation between IOPg and GAT may be acceptable to clinicians who are accustomed to this sort of variability.40 Future work should examine the test–retest repeatability of IOPg to determine if it provides a more repeatable measurement of IOP.

Given the likely redundancy of measuring both IOPg and GAT, the exclusive use of the ORA might offer several advantages to the clinician. In addition to measuring hysteresis and providing several objective measures of IOP, the ORA does not require the use of fluorescein and topical anaesthetic, nor does the instrument contact the patient's cornea or require sterilization (although a forehead guide is used). However, as physicians consider adapting to new technologies several issues arise. Clinical gold standards, such as GAT, have considerable value in the context of clinical history and continuity of care even with known inaccuracies. Furthermore, the majority of clinical research data still uses GAT measurements. By providing both IOPg and IOPcc measurements, the ORA offers clinicians a way to transition to the ORA only while still maintaining an estimate of GAT measurements through IOPg. As the inter-test variability of IOPg–GAT measurements is similar to test–retest variability of GAT, use of IOPg could help to maintain continuity of care during the adoption of this new technology without entirely abandoning the Goldmann standard.