Article Text

## Abstract

**Background/aims** We previously reported that the visual field (VF) prediction model using the variational Bayes linear regression (VBLR) is useful for accurately predicting VF progression in glaucoma (Invest Ophthalmol Vis Sci. 2014, 2018). We constructed a VF measurement algorithm using VBLR, and the purpose of this study was to investigate its usefulness.

**Method** 122 eyes of 73 patients with open-angle glaucoma were included in the current study. VF measurement was performed using the currently proposed VBLR programme with AP-7700 perimetry (KOWA). VF measurements were also conducted using the Swedish interactive thresholding algorithm (SITA) standard programme with Humphrey field analyser. VF measurements were performed using the 24–2 test grid. Visual sensitivities, test–retest reproducibility and measurement duration were compared between the two algorithms.

**Result** Mean mean deviation (MD) values with SITA standard were −7.9 and −8.7 dB (first and second measurements), whereas those with VBLR-VF were −8.2 and −8.0 dB, respectively. There were no significant differences across these values. The correlation coefficient of MD values between the 2 algorithms was 0.97 or 0.98. Test–retest reproducibility did not differ between the two algorithms. Mean measurement duration with SITA standard was 6 min and 02 s or 6 min and 00 s (first or second measurement), whereas a significantly shorter duration was associated with VBLR-VF (5 min and 23 s or 5 min and 30 s).

**Conclusion** VBLR-VF reduced test duration while maintaining the same accuracy as the SITA-standard.

- field of vision
- glaucoma

## Data availability statement

Data are available on reasonable request.

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

## Statistics from Altmetric.com

## Introduction

Visual field (VF) testing is undoubtedly essential for the management of glaucoma. The Swedish interactive thresholding algorithm (SITA) standard programme1 in the Humphrey field analyser (HFA, Carl Zeiss Meditec, Dublin, USA) has been widely used to measure VF since the 1980s. The main advancement compared with the former Full Threshold programme is the reduction of test duration without losing measurement quality by continuously updating the iterative maximum posterior probability estimation of the threshold using a Bayesian prior model and interrupting testing at each tested location at predetermined levels of test certainty. As a result, it was reported that the measurement duration was by up to 50% shorter2 and test–retest reproducibility was diminished with the SITA standard programme, probably due to the suppression of patient fatigue.3 SITA fast is a further rapid algorithm that presents the starting stimuli closer to the expected thresholds and interrupts stimulus staircases at an earlier stage by increasing error-related factor cut-off.4 However, this allows for accepting poorer accuracy of test results and indeed, a previous study suggested that SITA fast had better test–retest reproducibility3 and poorer measurement accuracy compared with the SIAT standard.5 SITA faster is an even newer algorithm that enables fastest VF measurement in the SITA family by adjusting the initial intensity for age, omitting some reversals in thresholding, not providing false-negative (FN) results and also eliminating blind spot catch trials.6 There is one previous study by the inventors of the SITA family which suggested that SITA faster has a similar level of test–retest reproducibility compared with SITA fast.6 Shortening of the VF measurement duration has obvious merits; it reduces the burden for patients, avoids inaccurate VF measurement due to fatigue, and enables clinical facilities to perform VF measurement in more patients at a given time.7 However, test duration and accuracy are usually in a trade-off relation, and rapid measurement is often achieved by sacrificing accuracy. There is no VF measurement algorithm which is faster than SITA standard without losing test accuracy, to the best of our knowledge.

We previously constructed a model to VF progression prediction model using variational Bayes linear regression (VBLR).8 9 VBLR is a method for predicting VF progression considering the spatial and temporal patterns of VF damage using the variational Bayes statistic. Prediction accuracy was far better compared with the conventional ordinary least squares linear regression model, such as mean deviation (MD) trend analysis. The possible usefulness of this model is not only for VF progression prediction, but also for VF measurement. VBLR can accurately predict VF sensitivities during VF measurement; each time VF sensitivity is decided, the prediction of VF sensitivities at remaining test points can be updated using VBLR. This accurate prediction of VF sensitivity is expected to avoid showing redundant target presentation by optimising initial intensity in bracketing, and by interrupting the thresholding by applying a cut-off value of the likelihood of estimated VF sensitivity value. We constructed this VF test algorithm using VBLR (VBLR-VF) and validated its usefulness in patients with glaucoma. As a result, we achieved a shorter VF measurement time without losing test–retest reproducibility.

## Method

Written consent was given by the patients for their information to be stored in the hospital database and used for research. This study was performed according to the tenets of the Declaration of Helsinki.

### Subjects

This study included the 122 eyes of 73 patients with primary open-angle glaucoma (POAG), followed up at the Tokyo University Hospital. POAG was diagnosed when the following findings were present: (1) presence of typical glaucomatous changes in the optic nerve head (ONH), such as a rim notch with a rim width ≦0.1 disc diameters or vertical cup-to-disc ratio of >0.7 and/or a retinal nerve fibre layer defect with its edge at the ONH margin greater than a major retinal vessel, diverging in an arcuate or wedge shape, confirmed by a panel of glaucoma specialists (HM and RA) after inspecting photographs of the stereofundus; (2) presence of glaucomatous VF defects compatible with glaucomatous ONH changes, thus fulfilling Anderson-Patella’s criterion10 and (3) absence of other systemic or ocular disorders, including cataract, except for clinically insignificant senile cataract, shallow peripheral anterior chamber that could affect ONH and VF, and history of intraocular surgery or refractive surgery except for uneventful intraocular lens implantation. All patients were at least 20 years old.

### VF measurement with SITA standard

The SITA standard measurement was carried out using the HFA 24–2 test and standard Goldmann III stimulus size.

### VF measurement with VBLR-VF

VF measurement algorithm using VBLR was implemented in a perimetry by KOWA(AP-7700).

The VF measurement algorithm procedure is as follows: The prior model and starting stimulus intensities were calculated as the first-time measurement in each measurement in this investigation. In other words, this experiment was conducted as the initial VF measurement, and hence data from each patient’s past VF measurement were not used.

#### Prior model

The Gaussian mixture distribution was set with VBLR using the dataset consisting of the 7268 eyes of 4278 subjects from the University of Tokyo Hospital.8 9 The Gaussian mixture distribution was given by:

where t is the total deviation values of the 52 test points with the 24–2 test grid, K is the number of the Gaussian distribution, is the mixing coefficient in each Gaussian distribution, , , is the average in each Gaussian distribution, and is the precision matrix in each Gaussian distribution.

#### Starting stimulus intensities

In the case of a patient in whom VF measurement was carried out N times before, the VF measurement sequence was begun at by equation 1 calculated with VBLR using the former N times results. In the case of a patient in whom VF measurement was carried out for the first time, the VF measurement sequence was begun at the level decided by the age-corrected normative database.

#### Strategy

Full Threshold strategy was used, that is, 4 dB until the first reversal, followed by 2 dB steps until the second reversal.

#### Model update through measurement

The VF measurement procedure was started by measuring the sensitivity at four primary points, one in each quadrant of the field at 12.7° from the point of fixation. After measuring the primary points, the Gaussian mixture distribution was updated when each point’s sensitivity was decided. The Gaussian mixture distribution was divided into the points which the sensitivity was decided and the points which the sensitivity was not decided. The divided Gaussian mixture distribution was given by:

where a is the points which the sensitivity is not decided, b is the points the sensitivity was decided, is updated the mixing coefficient in each Gaussian distribution, , , Σ is the variance-covariance matrix in each Gaussian distribution.

#### Update starting stimulus intensities

The points which were not begun the VF measurement sequence were updated the starting stimulus intensities with by equation 2. By predicting appropriate sensitivities, the number of stimulus presentations could be reduced at most points.

#### Estimate sensitivities

When the sensitivity at the point was considered to have been estimated to sufficient accuracy with by equation 2, the VF measurement at the point was terminated. The level of accuracy was decided by the age-corrected normative database variance. The appropriate estimation could be reduced the number of stimulus presentations with a marginal decrease of measurement accuracy.

These VF measurements with HFA SITA standard and VBLR were conducted twice within 3 months period. The prior model and the starting stimulus intensities were calculated as the first-time measurement in each measurement in the current study.

Only reliable VFs were used in the analyses, defined as a fixation loss rate <33 %, a false-positive rate <33% and an FN rate <33%.

### Statistical analysis

These values were compared between the VF measured with HFA SITA standard and VBLR-VF; (1) MD, (2) Pattern SD (PSD), (3) pointwise VF sensitivity and (4) measurement duration, using the linear mixed model where the random effect was patients. The linear mixed model is equivalent to ordinary linear regression in that the model describes the relationship between the predictor variables and a single outcome variable. However, standard linear regression analysis makes the assumption that all observations are independent of each other. Measurements were nested within subjects and also test points in this study; hence, dependent of each other. Ignoring this grouping of the measurements will result in the underestimation of standard errors of regression coefficients. The linear mixed model adjusts for the hierarchical structure of the data, modelling in a way in which measurements are grouped within subjects to reduce the possible bias derived from the nested structure of data.11 12

With VBLR-VF, the MD and PSD were calculated as follow, similarly to HFA SITA standard.10

n : The number of the tested points.

: The test result at the tested point i (dB).

： The age-corrected normal value at the tested point i (dB).

: The age-corrected SD at the tested point i (dB).

In addition, the test–retest reproducibility of the two VF measurements were compared using 52 VF sensitivity values through the root mean squared error (RMSE) statistic, defined as follows:

All analyses were performed using the statistical programming language ‘R’ (R V.3.1.3; The Foundation for Statistical Computing, Vienna, Austria).

## Results

Thirty-four of the 73 patients were male. Patients’ age was 62.5±10.5 (39–84) (mean±SD, SD, (range)) years old. 62 in the 122 eyes were right eyes.

The MD value with the SITA standard was −7.9±6.2 (mean±SD, (range: −26.7 to 0.6]) in the first measurement and −8.7±6.2 (−27.3 to 0.3) dB in the second measurement, as shown in table 1. MD values with the VBLR-VF were −8.2±6.4 (−25.6 to 1.4) and −8.0±6.2 (−24.9 to 1.6) dB for the first and second measurements, respectively. There was no significant difference between these four MD values (p>0.05, linear mixed model with the adjustment for multiple comparisons using the Tukey’s test). Figure 1 illustrates the relationship among MD values recorded with the SITA standard and the VBLR-VF. There were strong correlations in MD values between the first and second HFA SITA standard tests (correlation coefficient=0.97, p<0.0001, figure 1A) and also for the first and second VBLR tests (correlation coefficient=0.97, p<0.0001, figure 1B). In addition, there was a significant correlation between the MD values of the first HFA SITA standard tests and the first VBLR tests (correlation coefficient=0.97, p<0.0001, figure 1C) and the MD values between their second tests (correlation coefficient=0.98, p<0.0001, figure 1D).

The PSD value with the HFA SITA standard test was 8.5±4.9 (mean±SD, (range: 1.1–16.0)) in the first measurement and 8.6±4.8 (1.1 to 16.6) dB in the second measurement, as shown in table 1. PSD values with the VBLR-VF were 8.7±4.8 (0.9 to 16.1) and −8.6±4.8 (0.9 to 16.2) dB for the first and second measurements, respectively. There was no significant difference between these four PSD values (p>0.05, linear mixed model). Figure 2 illustrates the relationship among PSD values recorded with the HFA and the VBLR. There were strong correlations in PSD values between the first and second HFA tests (correlation coefficient=0.97, p<0.0001, figure 2A) and also for the first and second VBLR-VF tests (correlation coefficient=0.98, p<0.0001, figure 2B). In addition, there was significant correlation between the PSD values of the first HFA SITA standard test and the first VBLR-VF test (correlation coefficient=0.96, p<0.0001, figure 2C) and the PSD values between their second tests (correlation coefficient=0.97, p<0.0001, figure 2D).

Table 1 shows the pointwise VF sensitivities in each measurement. The pointwise VF sensitivity values with the HFA SITA standard test was 21.8±11.2 (mean±SD, (range: 0–39)) in the first measurement and 21.0±11.2 (0–35) dB in the second measurement. Pointwise VF sensitivity values with the VBLR-VF were 21.0±11.4 (0–41) and 21.0±11.2 (0–40) dB for the first and second measurements, respectively. There was no significant difference in the pointwise VF sensitivity values between the first HFA SITA standard test and the second HFA SITA standard test, and also between the first VBLR-VF test and the second VBLR-VF test (p>0.05, linear mixed model with the adjustment for multiple comparisons using the Tukey’s test), however, there was a significant difference in the pointwise VF sensitivity values between the first HFA SITA standard test and the first VBLR-VF test, and also between the second HFA SITA standard test and the second VBLR-VF test (p>0.05, linear mixed model with the adjustment for multiple comparisons using the Tukey’s test). Figure 3A,B shows the test–retest agreements with SITA standard test and also VBLR-VF test in a fashion similar to that in Artes *et al*.3 With both methods, precision decreased with increasing deficit. All of the pointwise VF sensitivities with SITA standard (first), SITA standard (second), VBLR-VF (first) and VBLR-VF (second) were significantly correlated to each other (all correlation coefficient=0.92, p<0.0001). These relationships are illustrated in figure 3C,D.

The RMSEs as a function of MD values of SITA standard test (first and second measurements were averaged) were illustrated in figure 4A. There was no significant difference in the RMSE values between SITA standard test and VBLR-VF test (p=0.68, linear mixed model). This was also the case when RMSEs were compared between the two methods in each of early (MD with SITA standard ≥6 dB), moderate (MD with SITA standard between −6 and −12 dB) and advanced stage (MD with SITA standard ≤12 dB)13 (p=0.68, 0.66 and 0.53, respectively, linear mixed model).

The measurement duration with the HFA SITA standard test was 6 min and 2 s±1 min 6 s (3 min 57 s to 8 min 34 s) in the first measurement and 6 min 2 s±1 min 2 s (4 min to 8 min 27 s) in the second measurement. A significantly shorter measurement duration was observed for the VBLR-VF test: 5 min 23 s±1 min 32 s (2 min 49 s to 8 min 51 s) and 5 min 30 s±1 min 26 s (2 min 58 s to 8 min 44 s) for the first and second measurements, respectively (p<0.001). This was also the case when the measurement durations were compared between the two methods in each of early (MD with SITA standard ≥6 dB), moderate (MD with SITA standard between −6 and −12 dB) and advanced stage (MD with SITA standard ≤12 dB)13 (all p<0.001 except for eyes in advanced stage in the second measurement: p=0.013, linear mixed model). The measurement duration was illustrated in the relationship to the MD values of SITA standard test (first and second measurements were averaged), figure 4B.

## Discussion

VF measurement was conducted in this study, using a novel algorithm of VBLR-VF, and the measurement results were compared with the SITA standard programme with HFA, in eyes with POAG. As a result, the measured sensitivity values were very closely related between the VBLR-VF and the SITA standard methods. More specifically, MD and PSD values had the correlation coefficient values between 0.97 and 0.99 (figures 1 and 2), and there were no significant differences in the MD and PSD values between SITA standard and VBLR-VF (table 1). The pointwise sensitivities were also very closely related between SITA standard and VBLR-VF with the correlation coefficient of 0.92. There was a significant difference in the pointwise sensitivities between SITA standard and VBLR-VF, however, there was no systematic trend which algorithm is associated with higher sensitivity than another in the repeated measurements (VF sensitivity was higher and lower with the SITA standard in the first and second measurement, respectively). The RMSE value of the test–retest reproducibility was not significantly different between the two methods. In contrast, the measurement duration was significantly shorter with the VBLR method by 30 or 40 s. The reduction of the VF measurement duration was observed across the disease status, as illustrated in figure 4B.

Since the algorithm of SITA was introduced in HFA in 1980s, despite efforts, there has been no method which succeeded to shorten the testing duration without losing the accuracy. The currently proposed VBLR-VF algorithm achieved significantly faster VF measurement in the initial VF measurement, compared with SITA standard. Of note, it was not associated with the increase of the test–retest reproducibility, unlike other SITA families, such as fast and faster.3 5 6 This is probably because SITA fast and faster attempted a faster measurement simply by omitting some reversals in the thresholding and also false negative test, whereas VBLR-VF avoided redundant target presentation by accurately predicting threshold at each test point using VBLR. In the bracketing thresholding method, the time required to determine the threshold becomes long when the luminance of the initial target presentation is far different from the threshold, because a larger number of target presentations are needed. In SITA standard, thresholds of the adjacent test points are predicted using the maximum likelihood estimation derived from a database of glaucomatous and normative VFs, so that the initial luminance level is optimisation. In addition, in SITA standard, thresholding is even terminated earlier using the Bayesian posterior probability of the threshold. The currently proposed VBLR-VF algorithm takes similar approach with SITA standard that the optimisation of the initial luminance level and early termination are conducted, so that VF measurement duration becomes short. In addition, both SITA standard and VBLR predict VF sensitivities using the Bayes statistic. In VBLR, the prediction of VF sensitivity was given, considering the spatial and temporal patterns of the VF damage and also the correlation across the test points.8 9 As a result, we have reported that very accurate prediction of future VF progression can be made; the prediction accuracy when predicting 10th future VF even with the initial 1 or 2 VFs using the VBLR was almost identical to that with 9 VFs using ordinary linear regression. In the current study, this accurate prediction also enabled significantly faster measurement of VF compared with SITA standard.

Attempts have been made to shorten the VF measurement also in perimetries other than HFA. The German Adaptive Thresholding Estimation (GATE) in Octopus perimetry (Haag-Streit, Switzerland) is one of such examples, in which the initial target luminance is decided considering the patient’s past VF.14 15 A previous study suggested that this approach resulted in identical or slightly poorer (where VF sensitivity is high) test–retest reproducibility with GATE compared with SITA standard, with a shorter measurement time (approximately 16% reduction: 4.7 min with GATE and 5.6 min with SITA standard in average). This shortening effect was slightly more than the currently proposed VBLR-VF algorithm shortened approximately 11 or 8% compared with SITA standard. However, a merit of the VBLR-VF algorithm over the GATE programme is that previous VF measurement is not needed, and can be applied to even the initial VF measurement. In addition, the advantage of GATE measurement may be questionable when very long period, such as 5 or 10 years, has passed from the prior VF measurement, because of the possible large discrepancy between the past and current VFs. The current experiment was conducted without using patients’ past VF data, assuming an initial VF measurement in each patient. In other words, more accurate prediction would be given, when the patient’s past VF data is also utilised in the VF sensitivity prediction using VBLR. It would be of interest to investigate in future whether VBLR-VF can shorten VF measurement no less than GATE programme under this condition. There are other programmes in Octopus perimetry, such as Dynamic strategy and Tendency Oriented Perimetry programme. These were developed aiming at fast VF measurement, by varying the luminance interval in the bracketing according to the eccentricity from fovea (Dynamic strategy) or performing target presentation only once at each test point and inferring adjacent test points’ thresholds.16 These measurements are known to be shorter than SITA standard measurement; however, the measurement accuracies have been reported to be optimal.17–20 Inaccurate measurement of VF sensitivity is not only problematic in assessing the disease status of glaucoma, but also in detecting progression, because detection of progression is delayed associated with larger variability of VF measurement, as reported by Jansonius.21 The zippy estimation by sequential testing (ZEST)22–24 is another VF measurement algorithm which has been reported as possibly useful to measurement accuracy compared with SITA, but not the measurement duration.25 A further merit of ZEST is that it is available for use on an Octopus perimeter through the Open Perimetry Initiative, where the source code is also available, unlike VBLR-VF. A study would be needed to compare the measurement accuracy and duration between the currently proposed VBLR-VF and ZEST in future.

Jansonius reported that the detection of progression does not only rely on the accuracy (variability), but also the frequency of VF measurement; progression detection is delayed when less frequently measured.21 SITA-standard algorithm enabled much faster VF measurement compared with its Full Threshold algorithm predecessor, however previous studies have reported that it is difficult enough for busy clinics to carry out central VF testing with sufficient frequency.26 27 Recent studies revealed the importance of VF measurement in the central 10 degrees,28–32 such as HFA 10–2 test, has further postulated this aspect. Faster VF measurement without losing the measurement accuracy, such as with the VBLR-VF algorithm, would be needed to resolve the problem of the burden of the VF measurement to clinical facilities as well. Thus the possible merit of the faster VF measurement with fast VF measurement is not only in the reduction of the patients’ burden and prevention of fatigue effect. As shown in the current study, the measurement duration with VBLR-VF was significantly faster than that of SITA standard, however, the difference was merely less than 1 min which is relatively small compared with the shortening effect with SITA fast or faster, and may not be sufficient to prevent patients’ fatigue considerably. One of the possible reasons would be that the current study was conducted assuming the first VF measurement for each eye. A further reduction of the measurement can be expected when VF measurement is conducted thereafter, because VF prediction in VBLR can be much more accurate when the previous VF record of the patient is used. A further study should be conducted shedding light on this issue.

There are several limitations to this study. As VBLR can predict patients’ VFs accurately when past VF data are used, nonetheless this information was not used in this study. This was because the current study was conducted assuming the initial VF measurement in each patient, however a further faster VF measurement may be achieved through more accurate prediction in VBLR with the patient’s previous VF data, which should be investigated in a future study. In addition, current study was conducted using the HFA 24–2 test grid. The usefulness of this approach should be evaluated with the HFA 10–2 test grid.

In conclusion, we developed a novel VF measurement algorithm of VBLR-VF. This programme shortened VF measurement by 30 or 40 s, compared with the SITA standard programme without losing measurement accuracy.

## Data availability statement

Data are available on reasonable request.

## Ethics statements

### Patient consent for publication

### Ethics approval

This study was approved by the Research Ethics Committee of the Graduate School of Medicine and Faculty of Medicine at the University of Tokyo.

## References

## Footnotes

Contributors RA, HM and SS researched literature and conceived the study. RA, HM and NS were involved in protocol development and gaining ethical approval. All authors were involved in patient recruitment and data analysis. RA wrote the first draft of the manuscript. All authors reviewed and edited the manuscript and approved the final version of the manuscript.

Funding Supported in part by Japan Science and Technology Agency (JST) CREST (RA, HM), Grants 00768351 (MM), 20 768 254 (YF), 25 861 618 (HM) and 19H01114, 18KK0253, and 26 462 679 (RA) from the Ministry of Education, Culture, Sports, Science and Technology of Japan, grant from the University of Tokyo Gap Fund ProgramProgramme (no grant number), and grants from Suzuken Memorial Foundation (no grant number) and Taiju Life Social Welfare Foundation (Mitsui Life Social Welfare Foundation) (no grant number).

Competing interests None declared.

Provenance and peer review Not commissioned; externally peer reviewed.

## Linked Articles

- At a glance