Article Text


Measurement of stray light and glare: comparison of Nyktotest, Mesotest, stray light meter, and computer implemented stray light meter
  1. L J van Rijn1,
  2. C Nischler2,
  3. D Gamer3,
  4. L Franssen4,
  5. G de Wit4,
  6. R Kaper1,
  7. D Vonhoff1,
  8. G Grabner2,
  9. H Wilhelm3,
  10. H J Völker-Dieben1,
  11. T J T P van den Berg4
  1. 1Departement of Ophthalmology, Vrije Universiteit Medical Centre, Amsterdam, Netherlands
  2. 2Landesklinik für Augenheilkunde und Optometrie, Salzburg, Austria
  3. 3Universitäts-Augenklinik, Tübingen, Germany
  4. 4The Netherlands Ophthalmic Research Institute, Amsterdam, Netherlands
  1. Correspondence to: Dr L J van Rijn Vrije Universiteit Medical Centre, PO Box 7057, NL-1007 MB Amsterdam, Netherlands;


Aim: To evaluate the properties of devices for measuring stray light and glare: the Nyktotest, Mesotest, “conventional” stray light meter and a new, computer implemented version of the stray light meter.

Methods: 112 subjects, divided in three groups: (1) young subjects without any eye disease; (2) elderly subjects without any eye disease, and (3) subjects with (early) cataract in at least one eye. All subjects underwent a battery of glare and stray light tests, measurement of visual acuity, contrast sensitivity, refraction, and LOCS III cataract classification. Subjects answered a questionnaire on perceived disability during driving.

Results: Repeatability values were similar for all glare/stray light tests. Validity (correlation with LOCS III and questionnaire scores), discriminative ability (ability to discriminate between the three groups), and added value (to measurement of visual acuity and contrast sensitivity) were all superior for both stray light meters. Results of successive measurements are interrelated for the conventional but not the new stray light meter. This indicates a better resistance to fraud for the latter device.

Conclusions: The new computer implemented stray light meter is the most promising device for future stray light measurements.

  • BCVA, best corrected visual acuity
  • CS, contrast sensitivity
  • CSLM, conventional stray light meter
  • NSLM, new stray light meter
  • RC, repeatability coefficients
  • ROCs, receiver operating characteristics
  • VA, visual acuity
  • vision tests
  • glare
  • driving
  • cataract
  • BCVA, best corrected visual acuity
  • CS, contrast sensitivity
  • CSLM, conventional stray light meter
  • NSLM, new stray light meter
  • RC, repeatability coefficients
  • ROCs, receiver operating characteristics
  • VA, visual acuity
  • vision tests
  • glare
  • driving
  • cataract

Statistics from

Disability glare is the reduction in visual performance, caused by veiling luminance on the retina. It is an effect of intraocular stray light. Measurements of glare and stray light are particularly important for drivers,1–3 cataract,4–6 and refractive surgery.7–14 Glare testing in the elderly may be important in view of the high accident rates in this age group,15 especially at night.16 Moreover, glare measurements may predict future decrease of visual acuity.17

Over the years, many glare testers have been developed. Most of these measure either visual acuity or contrast sensitivity in the presence of a glare source. None of these has evolved into a universally accepted standard.18–20 The stray light meter21–23 provides a direct measure of intraocular stray light, instead of measuring the effect on perception. It is therefore considered the current “gold standard,” but it is, as yet, suited for laboratory use only.24 The design of the equipment does not allow implementation in a setting in which fraud resistance is essential.

Glare measurements for drivers are advocated by the German Ophthalmological Society.2 The guidelines are based on the classic study of Aulhorn and Harms.25 This led to the development of the Nyktometer and the Mesotest. Although widely used,25–30 the properties of these devices (reproducibility, validity, and discriminative ability) have been scarcely investigated.31–33

In the current study, we compared the properties of several stray light and glare test devices. Apart from the stray light meter, the Nyktotest, and Mesotest, a new, computer implemented version of the stray light meter was included that, intentionally, does not have several of the drawbacks of the original device.



A cross sectional cohort of 112 subjects was drawn from the patients and visitors of the three participating clinics. Subjects belonged to one of the following groups:

  • The young group: between 20 and 40 years of age, no ocular disease, corrected visual acuity (VA) equal to or better than 0.1 logMAR (Snellen acuity 20/25) in both eyes (n = 40);

  • The elderly group: 50 years of age and over, corrected VA in both eyes equal to or better than 0.1 logMAR, minimal cataract at most, no other ocular disease (n = 37);

  • The cataract group: binocular VA equal to or better than 0.2 (Snellen acuity 20/32), clinically relevant cataract in at least one eye, no other eye disease (n = 35). Subjects had no more than 6 dioptres of myopia, 5 dioptres of hyperopia, and/or 1.5 dioptres of astigmatism. In half of the subjects, glare and stray light measurements were repeated on a different day, to allow repeatability calculations.


Before testing, subjects were asked to answer a questionnaire, containing questions about visual disability experienced during driving.34

Measurement of visual function

Visual acuity (VA) was measured with the ETDRS chart on a logMAR scale,35,36 using best corrected visual acuity (BCVA), according to the modified ETDRS protocol from the AREDS study37; contrast sensitivity (CS) was determined using the Pelli-Robson chart,38–40 with BCVA and, for subjects of 40 years and over, a near addition of +0.75 and expressed as log(contrast sensitivity). Stray light was measured, using the conventional stray light meter (CSLM).21–23 The right and left eyes were measured, using trial glasses as instructed by the designer (may be obtained from the authors), to correct for refraction errors. In addition, stray light was measured using the new (computer implemented) stray light meter (NSLM). This device differs from the CSLM in that the turning knob does not have an end stop and the luminance of the central detection field is kept constant. These modifications aimed at enhancing fraud resistance by eliminating cues, other than detection of flicker in the measurement fields. In addition, the NSLM may be used binocularly, because the subject looks at a computer screen rather than in a (monocular) test tube. Subjects wore trial glasses with their BCVA, with near addition according to age. Measurements were performed for right and left eyes and binocularly. Stray light measurements consisted of the average of three knob settings and were expressed as log(stray light parameter). CSLM and NSLM were performed with “normal” pupils at room light as well as with pharmacologically “dilated” pupils (one drop of tropicamide 0.5%). Mesopic contrast sensitivity and glare sensitivity were measured using the Mesotest II25 (Oculus, GmbH, Wetzlar, Germany) and the Nyktotest 30030 (with both 501 and 502 test discs, Rodenstock, GmbH, Ottobrunn, Germany) and expressed as level, corresponding to log(percentage contrast). The Nyktotest 300 has a brighter illumination of the background than the Mesotest. According to the instructions of the manufacturer, Mesotest measurements were the average of five subsequent readings. The Nyktotest has only one reading per level, therefore each measurement consisted of one reading only. For the Nyktotest, levels 1–8 correspond to log(percentage contrast) of, respectively, 0.02, 0.1, 0.2, 0.3, 0.4, 0.5, 0.7, and 0.92. For the Mesotest, levels 1–4 correspond to log(percentage contrast) of, respectively, 0.02, 0.1, 0.2, and 0.3.

Definition of impaired

Visual acuity was considered impaired when higher than 0.3 logMAR (Snellen acuity lower than 20/40); stray light when the log(s) parameter was higher than 1.4; contrast sensitivity when the log(contrast sensitivity) was less than 1.25,41 Nyktotest and Nyktotest with glare when more than 40% contrast was needed, corresponding to level 5,30 and Mesotest and Mesotest with glare when more than 20% contrast was needed, corresponding to level 2 (DOG recommendations for class 1 drivers2).

Quantification of cataract

Cataract was quantified using the LOCS III lens classification.42

Statistical analysis

The statistical analysis was performed using SPSS for Windows PC, release 9.0.0.



The level of agreement between repeated measurements can be expressed as repeatability coefficient RC  =  2 × SD of differences between repeated measurements.43 The Nyktotests and Mesotests have a minimum and a maximum score that is being achieved by many subjects. For example, for the right eyes, minimum or maximum scores were reached in 29 and six subjects for the Nyktotests and Mesotests without glare and in 28 and 26 subjects for the Nyktotests and Mesotest with glare, respectively (see also fig 1). Therefore, many scores are “arbitrarily” fixed on the minimum and maximum values. This truncation of scores artificially improves the repeatability.44 To correct for this effect, in the analysis we removed the maximum and minimum scores on the Nyktotest and Mesotest, both with and without glare. (see table 1: “RC, correction for truncations”).

Table 1

 Repeatability coefficients (RC) and related parameters (n = 49–55)

Figure 1

 Distribution of data for each of the tests. (A) Conventional stray light meter. (B) New stray light meter. (C) Nyktotest without glare. (D) Nyktotest with glare. (E) Mesotest without glare. (F) Mesotest with glare. Note that the vertical scaling differs between panels. For the CSLM and NSLM, there are hardly any “normal” young or elderly subjects who display “impaired” test results (low false positives). Some elderly (and young) subjects display “impaired” results on the Nyktotest and Mesotest without glare, and even more so on the Nyktotest and Mesotest with glare (higher false positives). Results are shown for right eyes only. Results for left eyes and binocular tests are similar.

To allow for comparison of values between tests that use different measurement units, we calculated the ratio between RC′ and the range of measurement values that was obtained. The ratio RC′/RNG hardly differs between the tests (table 1). Ratios are lower only for contrast sensitivity, caused by the wide range of scores obtained with the Pelli-Robson chart.

For none of the tests was there a significant relation between repeatability and test score (regression analysis, all p values larger than 0.24). There was no dependency of repeatability coefficients on the subject group (analysis of variance; all p values ranging from 0.20 to 0.84).


The validity of a test indicates the extent to which the test outcome is related to a reference variable in our study: the LOCS III lens classification and the questionnaire. We found the highest correlations for the CSLM and, to a lesser extent, the NSLM (table 2). We studied the correlation between test outcomes and single parameters of both LOCSIII (NO, NC, C, and P) and the questionnaire (single questions); these were similar to those of the overall scores. Therefore, only the overall scores are being reported. Since the CLSM in the literature is considered the gold standard, we also investigated the relation of test outcomes to the CLSM. These correlations were highest for the NSLM.

Table 2

 Validity: Pearson’s correlation coefficients between tests and reference variables: CSLM, LOCSIII, and the questionnaire (n = 112)

Dilated pupils

The LOCS III lens classification evaluates the properties of the entire lens, the effects of glare and stray light are clustered around the optical axis. This difference may have affected the relation between the two. Therefore, we measured stray light with dilated pupils. We found that the correlations between the CSLM/NSLM, as measured with dilated pupils, and the LOCS III lens classification are no better than those for undilated pupils (table 2).

Discriminative ability

We investigated to what extent the devices could discriminate between our groups. For the CLSM and NSLM, there are no young subjects who display impaired test results (fig 1A, B). There is only one elderly subject with an impaired NSLM result. For Nyktotests and Mesotests (fig 1C–F), several normal elderly subjects display impaired test results, for the with glare condition (fig 1E, F), even some young subjects display impaired results (false positives). For all tests, several subjects from the cataract group display non-impaired results These may not be considered false negatives because cataract can, but not necessarily does, lead to increased stray light/glare values.

Receiver operating characteristics

Table 3 shows the area under the curve of the receiver operating characteristics (ROCs) for comparisons between groups. For all comparisons (cataract versus no cataract; cataract versus elderly; young versus elderly), the results were best for the NSLM. Values for the stray light meters were similar to those for visual acuity and contrast sensitivity. Note that values for VA may be artificially high, because this was a selection criterion of the study. The high values for the LOCS III classification may be caused by the fact that lens opacification, as observed at the slit lamp, was one of the criteria for group assignment.

Table 3

 Values* of “area under the curve” of the receiver operating characteristics (n = 112)

Resistance to fraud: the CSLM and NSLM

For the CSLM, within measurement variability was lower than between measurement variability, indicating that successive measurements are related. This makes the device prone to fraud. This difference is not present for the NSLM (table 4).

Table 4

 Within and between measurement repeatability for the CSLM and the NSLM (n = 49–55)

Added value of a test

We investigated to what extent each of the glare tests provide information, additional to that acquired from visual acuity and contrast sensitivity. Furthermore, we studied to what extent visual acuity and contrast sensitivity provide information that is not being provided by the glare tests. We assumed our a priori group assignment as reference.

In the absence of any covariants, all tests provide information about differences between groups (table 5A, B). If VA and CS are covariants, only the CSLM, NSLM, and Nyktotest and Mesotest with glare provide information about differences between groups (table 5C). In all comparisons, F values for the CSLM and NSLM are highest.

Table 5

 ANOVA statistics for the discrimination between the three groups: young/elderly/cataract (n = 112)


Many authors have favoured the measurement of glare sensitivity for assessing the visual capacity of drivers and for evaluating cataract, but currently it is not included in the directive of the European Union.46 A possible inclusion of a glare or stray light test is hampered by the absence of a universally accepted, fraud resistant measurement technique. In this study we systematically investigated various aspects of several glare and stray light tests. Knowledge about these aspects is crucial before a widespread introduction of one or more of these tests may be taken into consideration.

We found that various aspects such as validity, discriminative ability, and added value are superior for the stray light meters. It may be noted that this device provides a measure of intraocular stray light, which is the cause of glare. Other devices, such as Nyktotests and Mesotests, but also other commercially available glare testers, measure the effect of glare on perception (contrast sensitivity). This effect is largely dependent on the specific measurement conditions. It may be that these conditions do not represent the conditions in daily life when glare is perceived. Apparently, this counts even for devices such as Nyktotests and Mesotests, in which traffic conditions are simulated. Notably, our questionnaire contained questions specifically directed at night time driving. Even the results of our questionnaire were better correlated with the stray light meters than with the Nyktotests and Mesotests. The fact that the stray light meters performed best in our experiments also indicates that intraocular stray light is well correlated with the detrimental effects of glare on perception.


We found that repeatability values, in relation to the range of measurement values, were about similar for all tests. Repeatability coefficients in relation to range of values for contrast sensitivity were much lower, owing to the wide range of measurement values that was obtained with this test. Because this wide range is mainly caused by outliers, this does not necessarily increase the reliability of this test. For the stray light meters, repeatability can probably be improved by changing the measurement strategy into a forced choice method. This is a topic of future study.

The Nyktometer is similar in design to the Nyktotest and similar in test conditions to the Mesotest. Hartmann and Wehmeyer26 investigated the repeatability of the Nyktometer in a study that involved nine subjects, who performed 18 measurements each. The authors found that the SD of the mean of these measurements was at the 0.75 level. Rassow,30 evaluating the Nyktotest, found an accuracy at a plus or minus 1 level. These results agree well with our results: we found an SD of 0.67 for the Nyktotest without glare and 0.70 with glare.

We note that for the Nyktotests and Mesotests, there is only a small distance between the average normal score and the recommended cut-off values. This counts for the young, but especially for the elderly group. (Expressed in units RC, distances were generally less than 1, compared to values of about 2 for visual acuity, contrast sensitivity and stray light meters.) With such small distances, misclassification of subjects (normal subjects being classified as impaired and vice versa) could easily occur. Hartmann and Wehmeyer concluded that this repeatability is “good.” Although we agree with the magnitude of the repeatability value, in view of the small difference between “normal” and “impaired” values, we consider this value low.

Discriminative ability

For the assessment of discriminative ability, we assumed our clinical group assignment as the reference variable. This group assignment was based on a constellation of signs and symptoms: the history of the subject, the clinical assessment of cataract by the ophthalmologist, and the visual acuity. Hence, this group assignment provides a clinical aggregation of the cataract assessment and perceived disability, similar to the LOCS III classification and questionnaire. The information of group assignment, LOCS III, and questionnaire therefore displays some overlap. Since group assignment is a discrete variable, it is particularly suited for discrimination studies. The low discriminative ability of the Nyktotests and Mesotests, particularly in the presence of glare, agrees with the literature: the high rate of false positives constitutes a major problem of these tests.32,34,47

Selection bias and misclassification

The subjects/participants in these experiments formed a non-random selection of the visitors to the outpatient departments of the involved clinics. Group assignment was made on the basis of clinical judgment. The primary intention of this study was to provide an analysis of the measurement devices that are currently available. Any selection bias only affects the concept of glare measurements, not the relation between the various measurement devices. The same counts for possible misclassification errors: elderly normal subjects who erroneously have been assigned to the cataract group and vice versa. These would have weakened the relation between test outcomes and reference variables, but not the relations between the various test devices. The inclusion criterion for the cataract group was a binocular visual acuity of at least 0.6. Although unlikely, it may be that relations between test outcomes and reference variables are different in subjects with lower visual acuities. We note that in such subjects, visual acuity itself is the most important parameter of visual function. Stray light and glare measurements are particularly important in subjects with early cataracts and only slightly decreased visual acuities. Our study demonstrates that in these subjects, measurement of stray light/glare does provide additional information and the stray light meter does discriminate between presence and absence of early cataract.

Glare and driving

Many investigators have promoted the use of glare measurement devices for assessing fitness for driving. Our research demonstrates that the stray light meter may be the device of choice for such assessment. However, before introducing any glare or stray light measurement, there should be a more thorough evaluation of the relation between increased stray light measurements and the detrimental effects on perception as well as an investigation of the prevalence of impaired values in the driving population.


This study was supported by grant I TREN E3 200/7/SI2.282.826 of the European Commission. The authors thank Rodenstock GmbH and Oculus GmbH for the provision of Nyktotests and Mesotests. TJTP van den Berg has a proprietary interest in the stray light meter; the remaining authors do not have any proprietary interest in any of the tests.


View Abstract

Request permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.