AIMS/BACKGROUND The construction and validation of an instrument for the assessment of subjective visual disability in the cataract patient is described. This instrument is specifically designed for measuring the outcome of cataract surgery with respect to visual disability.
METHODS Visually related activities thought to be affected by cataract were considered for the questionnaire. These were reduced by pilot study and principal components analysis to 18 items. A patient’s assessment of his/her ability to perform each task was scored on a four point scale. Scores were averaged to create an overall index of visual disability, as well as subscale indices for mobility related disability, distance/lighting/reading related disability, and near and related tasks visual disability. The questionnaire, administered verbally is entitled “The Visual Disability Assessment (VDA)”. Reliability testing included test-retest reliability, interobserver reliability (ρ, the intraclass correlation coefficient), and internal consistency reliability (Cronbach’s α). Construct validation, the process for proving that a test measures what it is supposed to measure, included consideration of content validity, comparison with the established Activities of Daily Vision Scale (ADVS) and empirical support with factor analysis.
RESULTS For the four indices, interobserver reliability varied from 0.92 to 0.94, test-retest reliability varied from 0.96 to 0.98, and internal consistency reliability varied from 0.80 to 0.93. The VDA compared favourably with the ADVS by correlation, but Bland–Altman analysis demonstrated that the two instruments were not clinically interchangeable. Factor analysis suggests that all test items measure a common theme, and the subgroupings reflect common themes.
CONCLUSIONS The VDA is easy to administer because it has a short test time and scoring is straightforward. It has excellent interobserver, test-retest, and internal consistency reliability, and compares favourably with the ADVS, another test of visual disability. Factor analysis demonstrated that the 18 items measure a related theme, which can be assumed to be visual disability. The VDA is a valid instrument which provides a comprehensive assessment of visual disability in cataract patients and is designed to detect changes within a patient over time.
- visual disability assessment
Statistics from Altmetric.com
Cataract is the leading cause of blindness worldwide and the leading cause of reversible blindness in most developed countries.1-3 In developed countries, the prevalence of cataract and the success of surgery are reflected in the high expenditure on cataract surgery by government health services.3-5 However, in many countries resources are limited and expenditure may need to be justified in terms of patient benefit.2 6 The traditional measure of clinical progress, visual acuity, can both be insensitive to the presence of eye disease7-12 and fail to completely capture visual disability.13-19 Other tests of visual function have been proposed to meet this shortfall, such as contrast sensitivity and glare loss.14 19-21 However, these have yet to be proved to relate to preoperative visual disability and postoperative changes in visual disability.2 Some even propose that an assessment of subjective visual disability is sufficient to evaluate the cataract patient and that measures of visual functions, such as contrast sensitivity and glare loss, are unnecessary.22 Since the goal of cataract surgery is to reduce visual disability,2it is necessary first to define and then measure visual disability reliably in order to quantify patient benefit.
Disability is the restriction or lack of ability to perform an activity in a manner or within a range considered normal for a human being.23 Visual disability is disability caused by impairment of vision. Assessment of visual disability can be used as part of any ophthalmological appraisal of patients with sight threatening disease to assess the impact of the disease on the patient as well as the impact of any treatment. The need to rate the status of patients using an index of functional disability has been recognised increasingly in many medical fields for both clinical research and clinical practice.24-26
We have constructed and validated an instrument for the assessment of subjective visual disability in cataract patients. This instrument is designed to look at the outcome of cataract surgery, including the relation between subjective visual disability and objective measures of visual function, how these change with cataract surgery, and which variables influence patient satisfaction. The construction and validation process described demonstrates how to create such an instrument. This may be of value to other investigators who need to create a tool for use in different communities. Since disability connects the individual’s ability to function with the demands of their environment, the system of assessment must reflect these demands. For example, aboriginal hunter gatherers from central Australia will need a different questionnaire from urban dwellers in Adelaide, South Australia. This has recently been considered in a study based in India.27 Similarly, more subtle differences in communities still require differences in assessment. Thus, scoring systems are affected by locality.
In May 1993, when construction and validation of this tool for the assessment of subjective visual disability began, other questionnaires had been described and validated,17 28-33 but only one of these, the Activities of Daily Vision Scale (ADVS), seemed suitable for the assessment of visual disability in cataract patients in Western urban society in the 1990s. The earlier tools focused too much on gross levels of disability to be relevant to a cataract surgery population in the 1990s3 34-36; this was proved in the item reduction phase. However, even the ADVS was considered to have several disadvantages that could have been overcome with the creation of a new instrument. Other groups thought similarly. In the United States, a similar instrument to ours, the VF-14, was being developed at the same time.37
The major disadvantage of the ADVS for use in a study of the outcome of cataract surgery carried out in Adelaide, was the relevance of the items to the local population. There was also concern over the time taken to complete the ADVS and to calculate the scores. The paper describing the ADVS also failed to demonstrate the internal consistency of the subscales.17 One planned purpose for the VDA was to compare subscales of visual disability with various measures of visual function, such as contrast sensitivity, glare loss, and colour vision, so robust subscales were essential. Furthermore, the content of the ADVS was different from some areas we were keen to study. Specifically, mobility related disability was not sufficiently probed, there was extensive examination of driving ability for a population in which many do not drive and there was concern about the specific nature of the near task questions; these two latter content issues may waste time in data collection. Finally, there was a concern as to whether the ADVS was designed to discriminate between cataract patients along a continuum of visual disability or to evaluate change after cataract surgery. Mangione and others’ original paper stressed the former and clearly stated that more research was required to establish the latter.38 Subsequently, the ADVS has been successfully used to measure change in disability after cataract surgery.38
An instrument for quantifying the visual disability of cataract patients in an outcome study needed not only to measure disability but also to be particularly sensitive to clinical progress. An instrument can both discriminate between individuals along a continuum of visual disability and evaluate change in an individual over time, but optimising for the former may impede the latter.39 The responsiveness of an instrument to change is a ratio of the change in subjects after intervention to the variability of stable subjects.40 The intention was to create an instrument which was sensitive to the impact of intervention but was stable to subtle changes in clinical state.41 The obvious step was to consider the gradations of the scale; the ADVS uses a five point scale of patient response, but intuitively a four point scale may be more robust. Fewer points may be more stable again, but this would sacrifice the discrimination of individuals along a continuum of disability.42 This discrimination along a continuum is assisted for the subscales if they include a large number of items.39 Thus, reducing the response choices to four may improve responsiveness (by increasing stability) without a great sacrifice in discriminating between individuals if factor analysis suggests that subscales should include multiple items.
The instrument also needed to be quick and easy to use because the patients in the main study were subjected to a long series of visual function testing, including measures of low contrast visual acuity, contrast sensitivity, glare loss, and colour vision, in addition to verbal questioning. Hence, further lengthy questioning would have been too tiring, especially considering the age distribution of a cataract population. Similarly, a short testing time would also make the instrument more suitable for other studies implemented in busy clinical situations where time limitations preclude the use of more lengthy tests.
SELECTING THE ITEM POOL
To ensure content validity, it was necessary to identify as many visually related tasks as possible that were potentially affected in the cataract patient. All such activities were included in the pool of items considered for the questionnaire. A retrospective examination of 50 cases of cataract yielded a list of many activities cited as impaired by cataract but which were expected to be improved by cataract surgery. Activities listed by other authors in attempts to quantify visual disability were also included,14 17 22 28-32 43 as were those cited in the specific literature on visual impairment and cataract.44-52
Items that were too specific to be relevant to the majority of patients (for example, oil painting or cross stitch) were eliminated or grouped into a general item (for example, hobbies). This left 37 items which were all included in a pilot questionnaire administered to 15 cataract patients. Several items, comprising mainly self care activities such as eating, personal grooming, and use of the telephone, were eliminated after the pilot study because this cataract patient population was not sufficiently impaired to have difficulty with these activities. Such items had come chiefly from early studies of cataract and visual disability which used patient populations more impaired than that typically operated on in the 1990s.28 29 This reflects the increased tendency to operate at an earlier stage of cataract than 15 years ago.3 34-36
Principal components analysis is a multivariate statistical method which can demonstrate how well items relate to each other and how much each item contributes to the overall variance. While it is important that all items measure a common theme, items which are too closely related are redundant. Items contributing to less than 0.4% of the variance of all items were eliminated. Redundant items included reading normal print books and reading newspapers when a more general reading question was already included. Reduction to a minimum number of items improves instrument efficiency, shortens test time, and reduces user and subject burden.39 This reduced the list to 18 items (Table 1). Each remaining item was thought to contribute to the pool of information about visual disability.
Categorical scales are amenable to statistical analysis if they have enough categories; a seven point scale is sufficient.39 Continuous scales such as Rosser lines or visual analogue scales are also excellent but they require the questionnaire to be administered in a written format. The main hurdle for applying this questionnaire to cataract patients is that some will not be able to see well enough to complete a written format, so the questionnaire needs to be administered verbally. This leads to the problem, especially relevant to a geriatric population, that a patient can only remember a limited number of categories on a scale. Pilot work suggested that the maximum number of scaled responses that could be remembered by most subjects was four, so in response to the question: “To what extent, if at all, does your vision interfere with your ability to carry out the following activities?”, the responses of choice were: not at all, a little, quite a bit, and a lot. The patient responses were recoded with the numerical values 1, 2, 3, 4, respectively. Hence, all items are scored in the same direction and in the same units. The use of a short scale also assists uniformity of interpretation.39
In assessing visual disability, patients are also asked to take into account both the degree to which they can perform each task and the extra effort involved. This is because ratings of the magnitude of performance on tasks are misleading if the patient’s effort is not considered.24 Patients are instructed to assess their disability with both eyes open and their habitual spectacle correction worn. The patients receive no further explanation.
The questionnaire is scored by adding all the numerical scores together and dividing by the number of questions answered. Missing data are dealt with by including only answered questions in the index. For example, if all 18 questions are answered then the sum of the answers is divided by 18. If the individual has never driven a car, two questions cannot be answered and the sum of the answers is divided by 16 (18–2). Several subscales are also scored (Table 1); the choice of items in each subscale is justified by factor analysis. All questions relating to the mobility index (seven items), distance/lighting/reading index (eight items), and the near and related tasks index (five items) are aggregated in the same manner. Missing data are treated the same way as for total visual disability index. Thus all scales give a score in a range of 1 (no disability) to 4 (severe disability). The resulting instrument is called the “Visual Disability Assessment” (VDA).
Short integer scales, such as four point scales used here, lack responsiveness to subtle changes in clinical state.42However, this is an advantage for detecting major changes in clinical state, such as the impact of surgery.40 If the instrument is stable to subtle variations in clinical state, then it is more likely to be responsive to larger changes, which enhances instrument utility in outcome studies.39 The creation of indices by combining several four point scale items has the effect of improving the sensitivity of the instrument to clinical change.42For example, the total visual disability index includes 18 items, so the number of steps becomes 4 × 18 = 72. Similarly, the number of steps on the other subscales is extended through item combination. This has the advantage of creating an instrument that is extremely sensitive to major events without completely sacrificing the ability of the tool to discriminate between patients along a continuum of disability.
RELIABILITY AND VALIDITY TESTING
Reliability is the proportion of the total variance which is attributable to true differences among subjects. The remainder of the variance is noise which is considered to result from test-retest variation, interobserver variation, and internal inconsistency.53 The reliability of an instrument is determined by measuring test-retest reliability, interobserver reliability, and internal consistency reliability, usually with ρ, the intraclass correlation coefficient, and Cronbach’s α.53-55
Validity is the extent to which the instrument measures what it is intended to measure.53 This is assessed by comparison with a universally accepted standard (criterion validity), if one exists. However, for disability measurements there are no universally acknowledged standards.24 41 53 56 For this reason, validity is established by ensuring that all relevant aspects of visual disability are included in the instrument (content validity), by comparison with instruments which purport to measure the same thing, and by factorial validity which helps establish how the items in the instrument can be grouped into scales measuring the same thing.53 57 58 This method of validation using these three techniques together is called construct validity.
In order to examine the reliability and validity of the VDA, it was administered to 438 cataract patients attending the outpatients ophthalmology clinic at Flinders Medical Centre, Adelaide, South Australia. All patients consented to participate and had sufficient language skills and cognitive function to complete the questionnaire.
Test-retest reliability was conducted by the same observer administering the questionnaire twice at an interval of one week. This should be sufficient separation time to negate the effects of memory without allowing the condition to change.59 Interobserver variation testing was also conducted at an interval of one week using one other trained observer. These results were compared using ρ, the intraclass correlation coefficient. The VDA was also assessed for internal consistency reliability using the standard Cronbach’s α,55 that tests whether multiple items in an instrument measure the same thing; this is assumed if they are summed to create a single index.
Content validity is the extent to which the items chosen reflect all visually related activities that are potentially affected in the cataract patient. Content validity cannot be formally assessed because it is difficult to prove conclusively that the items chosen were representative of all possible items.60 However, the methods outlined above in item selection are important steps for establishment of content validity.53
Criterion validation of the VDA is not possible as there is no universally accepted standard in visual disability scoring,24 41 53 56 but established instruments can be used to see how close they come to measuring the same thing. However, the error cannot be ascribed to either instrument to declare its departure from reality. Despite these difficulties, an important part of construct validity is to compare the VDA with a surrogate standard. In order to do so, the VDA was compared with the ADVS, which is a validated instrument for measuring visual disability, and although different in construct with the VDA, both purport to measure visual disability.17 The ADVS was administered as described by its authors.17 The correlation of the two instruments was measured with Spearman correlation coefficients and the agreement or interchangeability was assessed with Bland–Altman analysis.61
The final method considered for construct validity was factor analysis which was be used to provide empirical support for the instrument’s scales (factorial validity).53 57 Factor analysis is a multivariate statistical method which when applied to a matrix of variables reduces those variables to a number of factors.62 The grouping of variables into factors depends on how well each variable relates to each factor. The proportion of the variance described by the principal factor indicates whether the instrument tests in one or more content areas.58 However, factor analysis does not provide a unique solution. The analysis can be “rotated” by various techniques such as varimax or oblimin to find items which can have high communality and thus form additional factors.62 This grouping of items into additional factors can be used to justify the creation of subscale indices as it is proof that the items sample the same content area specified by the factor to which they contribute.58
Item reduction utilised principal components analysis. Internal consistency reliability was estimated using Cronbach’s α.55 Test-retest reliability and interobserver reliability were tested with ρ, the intraclass correlation coefficient. Criterion validity was estimated by comparison of the VDA with the ADVS using Spearman correlation coefficients and examination with Bland–Altman analysis. Finally, factor analysis was used to support the construct validity of the VDA, including a Kaiser–Caffrey reliability coefficient.63 All statistical analyses were performed on spss for Windows (SPSS Inc) or done manually.
The VDA was administered to 438 cataract patients. Ages ranged from 40 to 91 years with a mean of 74.1 (SD 7.7) years. Sixty six per cent of subjects were female. Factorial validity testing used data from all 438 patients. Eighty six patients had repeat administration of the VDA for reliability testing and 40 were administered the ADVS for criterion validity testing.
For the purposes of instrument comparison, the time taken to administer the VDA and the ADVS was recorded on 20 patients who received both questionnaires. The VDA took on average 5 minutes to complete (range 3–9 minutes) and the ADVS took 8 minutes to complete (range 5–14 minutes). The VF-14 has been reported to take 4 minutes, on average, to administer (range 2–14 minutes).64 The VF-14 range is not comparable with the other results because the patient group was different. In patients with poor communication skills or poor cognitive ability the VDA may also take longer. The time taken to calculate scores on the VDA varied from 2 to 3 minutes, but for the ADVS score calculation required between 5 and 7 minutes.
The data for all 438 patients gave results across the full range of each index. The total visual disability score ranged from 1.00 to 4.00 (mean 1.66 (SD 0.68)). The near visual disability score ranged from 1.00 to 4.00 (1.73 (0.71)). The distance visual disability score ranged from 1.00 to 4.00 (1.98 (0.85)). The mobility visual disability score ranged from 1.00 to 4.00 (1.40 (0.67)).
Interobserver reliability was estimated with ρ to be 0.94 for the total visual disability score and ranged from 0.92 to 0.93 for the three subscales (Table 2). Test-retest reliability was also estimated with ρ and found to be 0.98 for total visual disability score and to vary from 0.96 to 0.98 for the three subscales (Table 2). Internal consistency reliability as estimated by Cronbach’s α was 0.93 for the total visual disability score and ranged from 0.80 to 0.92 for the three subscales (Table 2).
Construct validity consists of a surrogate criterion validity and factorial validity. Criterion validity, where ADVS scores are used to represent the standard, was estimated by correlation using Spearman correlation coefficients, and by agreement or interchangeability, using Bland–Altman analysis. The VDA correlates well with the ADVS for overall scales −0.83 and distance scales −0.84 but less well for near scales −0.53. These coefficients are negative because increasing disability yields an increased score on VDA, but a decreased ADVS value. Bland–Altman assessment of agreement between the ADVS and the VDA finds limits of agreement of −0.94 to 0.70 difference on the 1–4 VDA scale or −32 to 24 on the 1–100 ADVS scale. Assuming the VDA and the ADVS have identical anchoring and that there was perfect agreement, a score of 2.5 on the VDA would be 50 on the ADVS, but with these limits of agreement it varies from 18–74 (2 SD). These limits are almost equivalent to plus or minus one response category which is clearly too broad for the two instruments to be clinically interchangeable.
Unrotated factor analysis identifies one factor which explains 50% of the variance and shows excellent loadings (0.60–0.84) with all items except appreciating colours, with which it has a reasonable loading (0.30) (Table 3). The factor analysis findings give a Kaiser–Caffrey reliability coefficient of 0.94. Oblimin rotation identifies three other factors which correlate well with only some items. These factors can be categorised as representing the themes of these items. These three subvariables are best interpreted as a mobility factor, a distance/lighting/reading factor, and a near and related tasks factor (Table 4).
The VDA takes less time to administer and score than the ADVS. This is the result of structural and methodological differences in the two questionnaires. All 18 VDA questions use the same format and are answered on the same scale, thus speeding up completion and score calculation. The questions are set up on the ADVS to ask whether the respondent performs the task and then to ask how well they perform the task. This means that for each of the 20 items, two questions are asked instead of one in the case of the VDA. The responses vary on some ADVS questions which lengthens observer explanation and slows patient response. Scoring is also slowed by the additional step of converting scale scores to a 1–100 scale. The VDA scoring method reports on a scale from 1 to 4 which is easily conceptualised. The ADVS is also slower to score because it is set out over seven pages and quite a lot of page turning is required to calculate the subscales. Previous reports suggest the VF-14 to be as quick to implement as the VDA.64 The VDA is a quick and effective method of measuring visual disability. The items cover similar areas as a normal history taking for cataract, although perhaps in more detail.
The reduced list of items seems to be representative of the universe of questions that could have been asked because accepted methodology was followed for arriving at the short list of items: consideration of patients’ preferences on which disabilities they wanted cataract surgery to reverse, consideration of other authors’ disability items, consideration of items in the literature on cataract and disability, consideration of the suggestions of experienced cataract surgeons, and item reduction by pilot study and principal components analysis.24 39 53 65 The other test for content validity is whether the instrument is clinically sensible,24 65 which it seems to be. This procedure for content validity defines the relevance of the tool to the population it was developed for. Since the cataract surgery outcome study that the VDA was developed for uses the same population, this process establishes relevance of the VDA to its target population.
The VDA exhibits very high interobserver reliability. The intraclass correlation coefficients (ρ) were 0.94 for total visual disability, 0.93 for mobility related visual disability score, and 0.92 for both distance/lighting/reading and near and related tasks visual disability. These are excellent reliability scores which suggest that the VDA is stable across different observers.53 This would make the VDA suitable for outcome studies where different individuals may collect preoperative and postoperative data.
The VDA has excellent test-retest reliability. The intraclass test-retest correlation coefficients (ρ) were 0.98 for total and distance/lighting/reading visual disability, 0.97 for mobility related visual disability, and 0.96 for near and related tasks visual disability. These are exceptionally high scores.56 66 The error variance (random fluctuations) between the two performances is minimal.
High test-retest correlation is probably due to the use of a short four point scale. The gaps between responses are quite large in a short scale, so respondents are unlikely to give different responses. This approach gives excellent test-retest reliability but may sacrifice sensitivity to small changes in status over time.40 42Finer scales, such as Rosser lines, may be more likely to give poorer test-retest correlation, but may be more sensitive to small changes in status.40 This would be more suitable for questionnaires principally designed to scale individuals on a continuum of disability. However, this questionnaire was intended for looking at the impact of cataract surgery on visual disability, which should involve large changes in disability status, so good test-retest reliability is more important than sensitivity to subtle changes in disability status.67 The creation of subscales by combining several four point scale items has the effect of improving the sensitivity of the instrument to subtle clinical change by effectively increasing the number of categories on the scale.42 This has the advantage of creating an instrument that is both sensitive to major events without sacrificing the ability of the tool to discriminate between patients along a continuum of disability.
The excellent test-retest reliability implies that the VDA is very stable across time. High test-retest reliability is necessary for studies where the same individual is being retested at different times.53 A reliability of greater than 0.90 allows comparison of differences between individual cases, whereas only comparisons between groups are appropriate for lower reliabilities.68
The VDA has excellent internal consistency reliability. This is important since multiple items were combined into indices and such an approach is only valid if the multiple items measure the same thing, in this case, visual disability. Cronbach’s α is 0.93 for total visual disability, 0.92 for mobility visual disability, 0.89 for distance/lighting/reading visual disability, and 0.80 near and related tasks visual disability. This suggests that the items in the overall index, as well as all three subscales, accurately detect the presence of their theme, which can be assumed to be domains of visual disability. It is generally held that Cronbach’s α should be at least 0.80 to detect accurately the presence of the theme or to detect changes following intervention.57 An α of 0.90 is necessary if results are to facilitate clinical decision making, but an α of 0.70 is acceptable under some circumstances—for example, exploratory research.69
Since there are no standards for visual disability, a surrogate standard is used to provide evidence for construct validity. That standard is the ADVS.17 The correlation coefficients which compare the VDA with the ADVS are negative because increasing disability yields an increasing score on the VDA but a decreasing score on the ADVS. The magnitude of the correlations are adequate at −0.83 for the overall scores, −0.84 for the distance scores, and −0.53 for the near scores. The ADVS does not have a mobility subscale. This does not prove the validity of the VDA, but simply shows that the VDA and the ADVS measure a similar concept, which is probably visual disability. However, it also does not prove that the VDA and the ADVS measure visual disability so similarly that they are interchangeable.
Bland–Altman assessment of agreement between the two measures demonstrates that the limits of agreement are too broad for the two instruments to be clinically interchangeable. This is not to say the VDA and the ADVS do not measure the same concept, but they measure and scale it in different ways. Both questionnaires measure visual disability, and both could be used for research on visual disability, but their scores cannot be compared within a single study because their limits of agreement are too broad. These differences reflect the different structures of the two questionnaires. The VDA includes domains of visual disability which we were interested in quantifying to assess their relation with various measures of visual function, such as contrast sensitivity, glare loss, and colour vision. Therefore, the VDA has more questions on mobility related activities, whereas the ADVS probes vision for driving in more depth. The ADVS has specific questions on near tasks, whereas the VDA has general questions about near tasks which individuals can apply to their own situation. The ADVS has no specific questions about appreciating colour and the two instruments address glare disability in different ways. It should not be surprising that the two instruments do not have sufficient agreement to be clinically interchangeable, even though they both measure visual disability.
The ADVS has some practical differences from the VDA in addition to the content differences. The ADVS takes longer to administer and to calculate scores for the scales, whereas the VDA is easier to score because all items are scored in the same way. This uniformity aids interpretation.39 The ADVS varies a little across questions; moreover, the final scales are converted to 100 point range whereas the individual questions are on a 1 to 5 scale. Although conversion is straightforward, it perhaps requires more thought for interpreting than the VDA. The ADVS is set out over seven pages, whereas the VDA requires only one page. The saving of both paper and time was important in the context of a large study into the outcome of cataract surgery where lots of other time and paper consuming data were collected.
An alternative instrument for use as a surrogate criterion was the VF-14.37 However, the VF-14 paper was not published when development of the VDA began in May 1993. Although developed without the same rigour for item selection and reduction, the VF-14 contains many items similar to the VDA and a similar scoring system. The VF-14 also has good reliability and validity.37 70
The construct validity of the VDA could also be explored by comparing VDA scores with practical tests of functional ability to perform the tasks listed. This was not attempted because the practical difficulties associated with physical measurement of a person’s ability to perform such tasks as crossing a road or watching TV, including scoring and scaling problems, would cause significant variation in the relation between test scores and VDA scores. This approach would add nothing to the construct validity of the VDA.
Alternatively, the VDA could be compared with tests of vision if tests of vision were true and robust indicators of functional ability. However, there have been numerous reports that tests of vision do not capture visual disability very well.17 20 21 71Furthermore, it was planned to use the VDA to explore the relation between measures of vision and visual disability, the use of measures of vision as part of construct validity created a circular logic which would defeat the aims. Again, clinical history involves patients’ subjective appraisal of their own abilities so this remains the most appropriate standard for validity testing rather than practical tests or vision tests.
The final aspect of construct validity is factorial validity. This provides empirical support for the instrument’s scales by demonstrating how well the items in the VDA measure common themes.53 57 A large proportion of the variance is explained by the first factor which shows excellent communality with almost all items. This suggests that the instrument is valid for the measurement of one content area,58 namely, visual disability. However, rotation of the factor analysis reveals that three factors can be identified. These can be classified as a mobility factor, a distance/lighting/reading factor, and a near and related tasks factor. The most strongly communal items were grouped into the subscales which have the common content areas listed above.58 For the mobility index and the distance/lighting/reading subscale communality was greater than 0.70 for each item (Table 4). For the near and related tasks, communality was greater than 0.40 for each item (Table 4). The use of five to eight items in each subscale assists discrimination along a continuum of visual disability. In addition to factorial validity, each subscale has excellent internal consistency (Cronbach’s α >0.80) which facilitates research using these subscales, such as their relation to measures of visual function. Interestingly, the VF-14 does not yield clinically coherent subscales when applied to a large number of cataract patients and examined with factor analysis.37
The VDA is designed to quantify visual disability in the cataract patient. The 18 items included were carefully selected by robust methodology to ensure content validity. Patients are asked to assess the extent to which their vision interferes with their ability to carry out these activities. Answers are limited to four possible levels of visual interference: 1 not at all, 2 a little, 3 quite a bit, 4 a lot. The scaling system is designed to detect large changes in status that may occur with surgery rather than subtle changes that may occur with a slight increase in cataract severity. Scores for all 18 VDA items are combined to create an overall index of visual disability. Scores for subsets of items can also be combined to create subindices of visual disability. The VDA overall index and the three subscales have excellent reliability and validity. The questionnaire is quick to administer, easy to score, and has meaningful interpretation. The VDA is a suitable instrument for cataract surgery outcome studies where a measure of visual disability is required.
We wish to thank Professor John Keeves of the School of Education, Flinders University of South Australia, for helpful advice on questionnaire development, Bronwyn Krieg for interobserver data collection, and Wendy Laffer of the Department of Ophthalmology, Flinders Medical Centre, for editorial assistance.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.