On-road and simulated driving: Concurrent and discriminant validation

https://doi.org/10.1016/j.jsr.2011.06.004Get rights and content

Abstract

Introduction

A converging pair of studies investigated the validity of a simulator for measuring driving performance/skill.

Study 1

A concurrent validity study compared novice driver performance during an on-road driving test with their performance on a comparable simulated driving test.

Results

Results showed a reasonable degree of concordance in terms of the distribution of driving errors on-road and errors on the simulator. Moreover, there was a significant relationship between the two when driver performance was rank ordered according to errors, further establishing the relative validity of the simulator. However, specific driving errors on the two tasks were not closely related suggesting that absolute validity could not be established and that overall performance is needed to establish the level of skill.

Study 2

A discriminant validity study compared driving performance on the simulator across three groups of drivers who differ in their level of experience - a group of true beginners who had no driving experience, a group of novice drivers who had completed driver education and had a learner's permit, and a group of fully licensed, experienced drivers.

Results

The findings showed significant differences among the groups in the expected direction -- the various measures of driving errors showed that beginners performed worse than novice drivers and that experienced drivers had the fewest errors. Collectively, the results of the concurrent and discriminant validity studies support the use of the simulator as a valid measure of driving performance for research purposes.

Impact on industry

These findings support the use of a driving simulator as a valid measure of driving performance for research purposes. Future research should continue to examine validity between on-road driving performance and performance on a driving simulator and the use of simulated driving tests in the evaluation of driver education/training programs.

Highlights

► A converging pair of studies supports the use a simulator as a valid measure of driving performance for research purposes. ► A concurrent validity study compared driving performance on-the-road with performance on a simulator. ► This study found a significant relationship between the two tests when performance was rank ordered. ► A discriminant validity study compared performance on the simulator among drivers with different levels of experience. ► This study found beginners performed worse than novice drivers and that experienced drivers had the fewest errors.

Introduction

Young, novice drivers are greatly overrepresented in crashes (Evans, 1987, Gebers et al., 1993, Mayhew and Simpson, 1990, Mayhew and Simpson, 1995, Mayhew and Simpson, 1999, Mayhew et al., 2004, National Highway Traffic Safety Administration [NHTSA], 1994, Williams, 1996, Williams, 2003). Research has consistently shown that the elevated crash risk among young drivers arises from two primary sources: inexperience in driving or skill deficiencies, and age-related factors associated with lifestyle (Jessor, 1987, Mayhew and Simpson, 1995, Simpson, 1996, Williams and Ferguson, 2002). Driver education has, for decades, been a popular and widespread response to this problem. It is endorsed by insurance companies as well as parents and teens, who believe it is effective in making safer drivers (Fuller and Bonney, 2004a, Fuller and Bonney, 2004b, Mayhew, 2007, Plato and Rasp, 1983, Williams and Ferguson, 2004). Program evaluations have, however, failed to produce compelling evidence that young people who complete driver education programs have fewer crashes than those who receive less formal driver instruction (Christie, 2001, Engrstrom et al., 2003, Lonero and Mayhew, 2010, Mayhew and Simpson, 2002, Roberts et al., 2002, Senserrick and Haworth, 2005, Vernick et al., 1999).

On the other hand, such evaluations have often not assessed other outcomes such as changes in knowledge, attitudes, skills, beliefs, and motivations. Moreover, some evaluations have used groups that were not comparable, or used sample sizes too small to reliably detect an effect (Clinton and Lonero, 2006, Engrstrom et al., 2003, Haworth et al., 2000, Lonero and Mayhew, 2010). As a consequence, these evaluations have offered little insight into why driver education courses have failed to reduce collisions, or of equal importance, how these courses can be improved to produce significant safety benefits.

To help address the shortcomings of previous evaluations, a large scale evaluation of driver education is currently being conducted. One segment of the evaluation involves an examination of changes in student outcomes, including knowledge, attitudes, opinions, driving practices, and skills. Most of these outcomes are being measured using a questionnaire, the New Driver Survey, which has been rigorously and extensively developed and tested (Mayhew, Simpson, Marcoux, & Lonero, 2009). Among these outcomes are driving skills, which are critical to the overall evaluation since one of the major purposes of driver education is to teach students how to drive and prepare them to pass the road test.

Accordingly, one segment of the large scale evaluation will seek to determine if exposure to driver education enhances performance skills, particularly because skills appear to be closely related to the risk of collision. To illustrate, a recent study that compared collision-involved teens with those who were collision-free found that the former were more likely to have important skill deficiencies (Mayhew, Simpson, Singhal, & Desmond, 2006). They also rated themselves as significantly less skilled, overall, particularly in anticipating hazards, preventing a skid, controlling the vehicle in a skid, predicting traffic situations ahead, reacting quickly, and driving in the dark. Collision-involved teens were also more likely to commit unintentional driving errors associated with an increased risk of collision.

Therefore, measuring driving skills is of central importance to an evaluation of driver education. Although self-reports provide valuable insights into driving skills, a more objective measure such as that obtained from an on-road test or from a simulator, would greatly enhance the validity of inferences. Driving simulators provide a more controlled and safer environment for measuring performance than an on-road test. However, as noted by Shechtman, Classen, Awadzi, and Mann (2009, p. 380), “driving simulators must be validated before they are used for driving assessment and/or training.” Fortunately there is evidence of validity from studies comparing on-road driving performance with specific driving tasks such as braking (Hoffman, Brown, Lee, & McGehee, 2002) and lateral control (Blaauw, 1982; see Shechtman et al., 2009 for a review). Moreover, a few studies (e.g., Harms, 1996, Törnros, 1998) have successfully compared overall driving performance on-the-road with performance on the simulator, a methodology that is considered to be the “gold standard” for validation (e.g., Bédard et al., 2010, Shechtman et al., 2009).

Establishing the validity of a driving simulator to measure driving skills was the purpose of the two studies reported here. A concurrent validity study was designed to compare driving performance on-the-road with driving performance on a simulator (i.e., how well does a drive test on road correlate with a drive test on a simulator). A discriminant validity study compared driving performance on the simulator among drivers with different levels of experience -- a group of unlicensed beginners compared to a group of novice drivers compared to a group of experienced drivers. This study design determined the degree to which a simulated drive test discriminates between “unskilled” and “skilled” drivers defined in terms of their driving experience.

These two validation studies were conducted in Winnipeg, Manitoba, Canada. Manitoba has a Graduated Driver Licensing Program that requires someone to be at least 16 years of age, or a minimum of 15 years, 6 months and enrolled in a high school driver education course, to obtain a learner's permit. The learner's permit is held for a minimum of nine months and the novice is required to pass a road test as a condition of obtaining their intermediate license.

Section snippets

Simulated driving performance

An initial review was conducted of several PC-based programs that assess driving skills. Results showed that one of these -- STISIM Drive – offered the most viable approach to assessing skill. STISIM Drive has a considerable research track record having been used in sleep, drug, aging, and fatigue studies and in validation studies (Allen et al., 1994, Bédard et al., 2010, Marcotte et al., 2005, Partinen et al., 2003, Rosen, 2004, Ware et al., 2006). In addition, there is evidence that STISIM

Concurrent validity study

The purpose of the concurrent validation study was to compare performance on STISIM Drive with performance on an on-road driving test.

Discriminant validity study

The purpose of the discriminant validity study was to determine if the simulator is sensitive enough to differentiate among drivers with different levels of skill. Accordingly, the discriminant validation study compared the driving performance of three groups of subjects -- a group of true beginners who had no driving experience, a group of novice drivers who had completed driver education and had a learner's permit (data for these subjects were derived from the prior concurrent study), and a

Discussion

Taken together, the concurrent and discriminant studies establish that the simulator can be used as a valid measure for assessing driving performance for research purposes. These findings are consistent with other research (Bella, 2008, Godley et al., 2002, Lee et al., 2004, Törnros, 1998, Yan et al., 2008) that has found simulation can provide a valid index of driving performance. Most prior studies have examined only a few driving performance measures such as speed and turning; however, it

Acknowledgements

This study was made possible by financial support from the AAA Foundation for Traffic Safety, Manitoba Public Insurance, The Centers for Disease Control and Prevention, and the National Highway Traffic Safety Administration. Their support is gratefully acknowledged.

Manitoba Public Insurance also provided assistance in numerous other ways and their support and assistance throughout the course of this investigation is appreciated.

Don Fisher and Matt Romoser from the University of Massachusetts

Daniel R. Mayhew, MA, is Senior Vice President of the Traffic Injury Research Foundation (TIRF). Since joining TIRF in 1980, he has been principal investigator and managed numerous research projects for the Foundation. His research interests include: driver licensing improvement systems, motorcycle safety, young driver accident prevention, senior driver safety, motor vehicle accident data systems, driver competency and experience, commercial vehicle driver licensing, driver education and

References (45)

  • R.W. Allen et al.

    An experimental study of driver alertness monitoring

    (1994)
  • M. Bédard et al.

    Brief report- assessment of driving performance using a simulator protocol: Validity and reproducibility

    American Journal of Occupational Therapy

    (2010)
  • F. Bella

    Driving simulator for speed research on two-lane rural roads

    Accident Analysis and Prevention

    (2008)
  • G.J. Blaauw

    Driving Experience and task demands in simulator and instrumented car: A validation study

    Human Factors

    (1982)
  • R. Christie

    The effectiveness of driver training as a road safety measure: A review of the literature

    (2001)
  • K.M. Clinton et al.

    Guidelines for evaluating driver education programs

    (2006)
  • I. Engrstrom et al.

    Young novice drivers, driver education & training: Literature review (VTI rapport 491A)

    (2003)
  • L. Evans

    Young driver involvement in severe car crashes

    Alcohol, Drugs, & Driving

    (1987)
  • R. Fuller et al.

    Driver Education and Training in Post-Primary Schools in Ireland. Monograph

    (2004)
  • R. Fuller et al.

    Driver education & training in post-primary schools. Monograph

    (2004)
  • L. Garay-Vega et al.

    Hazard anticipation of novice and experienced drivers: Empirical evaluation on a driving simulator in daytime and nighttime conditions

    Transportation Research Record

    (2007)
  • M.A. Gebers et al.

    Teen and senior drivers (CAL-DMV-RSS-93-141)

    (1993)
  • S.T. Godley et al.

    Driving simulator validation for speed research

    Accident Analysis and Prevention

    (2002)
  • L. Harms

    Driving performance on a real road and in a driving simulator: Results of a validation study

  • N. Haworth et al.

    Evaluation of pre-driver education program

    (2000)
  • J.D. Hoffman et al.

    Comparisons of braking in a high fidelity simulator to braking on a test track

    Transportation Research Record

    (2002)
  • R. Jessor

    Risky driving and adolescent problem behavior: An extension of problem-behavior theory

    Alcohol, Drugs, & Driving

    (1987)
  • H.C. Lee et al.

    Assessing the driving performance of older adult drivers: On-road versus simulated driving

    Accident Analysis and Prevention

    (2004)
  • L. Lonero et al.

    Large-scale evaluation of driver education review of the literature on driver education evaluation

    (2010)
  • T.D. Marcotte et al.

    The impact of cognitive deficits and spasticity on driving simulator performance in multiple sclerosis

  • D.R. Mayhew

    Driver education and graduated licensing in North America: Past, present, and future

    Journal of Safety Research

    (2007)
  • D. Mayhew et al.

    New to the Road. Young drivers & novice drivers: similar problems and solutions?

    (1990)
  • Cited by (0)

    Daniel R. Mayhew, MA, is Senior Vice President of the Traffic Injury Research Foundation (TIRF). Since joining TIRF in 1980, he has been principal investigator and managed numerous research projects for the Foundation. His research interests include: driver licensing improvement systems, motorcycle safety, young driver accident prevention, senior driver safety, motor vehicle accident data systems, driver competency and experience, commercial vehicle driver licensing, driver education and training, driving simulation, enforcement, fuel efficient driving, and alcohol, drugs and traffic safety.

    Herb M. Simpson, Ph.D., is a Research and Policy Consultant to the Traffic Injury Research Foundation (TIRF). He served as the Foundation's President and CEO for 31 years. His research expertise covers a wide range of traffic safety issues. However, his contributions to the areas of young drivers, and alcohol, drugs and traffic safety have both received international attention and acclaim. He is a consultant and advisor to governments and industry around the world.

    Katherine M. Wood, MPH, is a Research Associate with the Traffic Injury Research Foundation (TIRF). Since joining TIRF in 2009, she has worked on various aspects of a comprehensive evaluation of driver education. Her research interests include: driver education and training, young driver safety, driving simulation, and alcohol, drugs and traffic safety. She received her Master of Public Health from Indiana University in 2009 and has experience in public health research and research promoting behavioural changes.

    Lawrence Lonero, MS, principal of Northport Associates has led numerous driver R&D and testing projects and advised projects developing computer-based driving instruction. He led the AAA Foundation project to develop guidelines for evaluation of driver education and is currently co-leading implementation of the guidelines across multiple jurisdictions. He also analyses driver factors analysis for civil litigation in road crashes. He is a member the Transportation Research Board Committee on Operator Education and Regulation, the American Driver and Traffic Safety Association, and the American Evaluation Association. Prior to consulting, he held senior government positions in research, program development, and strategy in transportation safety.

    Kathryn M. Clinton, MPH, principal of Northport Associates has extensive experience in research, program development, and evaluation in the fields of education, road safety, and health promotion. She has led or co-led R&D projects across a range of safety and injury prevention initiatives, including road safety policy, behaviour change, driver education and improvement, senior drivers, and program evaluation. She specializes in qualitative research and evaluation. Prior to entering consulting, Ms. Clinton held senior government policy development and management positions in the transportation and energy sectors. Ms. Clinton has a PhD (ABT) in Population Health at the University of Ottawa, Canada.

    Amanda G. Johnson is an Administrative Assistant with the Traffic Injury Research Foundation (TIRF). Amanda provides technological, clerical and organizational support. She is very knowledgeable and provides technical assistance with regard to the use of driving simulators.

    View full text