Academia.eduAcademia.edu
1650 SPECIAL COMMUNICATION Recommendations for the Use of Common Outcome Measures in Traumatic Brain Injury Research Elisabeth A. Wilde, PhD, Gale G. Whiteneck, PhD, Jennifer Bogner, PhD, Tamara Bushnik, PhD, David X. Cifu, MD, Sureyya Dikmen, PhD, Louis French, PsyD, Joseph T. Giacino, PhD, Tessa Hart, PhD, James F. Malec, PhD, Scott R. Millis, PhD, Thomas A. Novack, PhD, Mark Sherer, PhD, David S. Tulsky, PhD, Rodney D. Vanderploeg, PhD, Nicole von Steinbuechel, PhD ABSTRACT. Wilde EA, Whiteneck GG, Bogner J, Bushnik T, Cifu DX, Dikmen S, French L, Giacino JT, Hart T, Malec JF, Millis SR, Novack TA, Sherer M, Tulsky DS, Vanderploeg RD, von Steinbuechel N. Recommendations for the use of common outcome measures in traumatic brain injury research. Arch Phys Med Rehabil 2010;91:1650-60. This article summarizes the selection of outcome measures by the interagency Traumatic Brain Injury (TBI) Outcomes Workgroup to address primary clinical research objectives, including documentation of the natural course of recovery from TBI, prediction of later outcome, measurement of treatment effects, and comparison of outcomes across studies. Consistent with other Common Data Elements Workgroups, the TBI Out- From the Departments of Physical Medicine and Rehabilitation (Wilde, Sherer), Neurology (Wilde), and Radiology (Wilde), Baylor College of Medicine, Houston and TIRR Memorial Hermann, Houston (Sherer); Michael E. DeBakey Veterans’ Administration Medical Center, Houston (Wilde); University of Texas Medical School at Houston, Houston, TX (Sherer); Craig Hospital, Englewood, CO (Whiteneck); Department of Physical Medicine and Rehabilitation, Ohio State University, Columbus, OH (Bogner); Department of Rehabilitation Medicine, Rusk Institute for Rehabilitation, New York, NY (Bushnik); Department of Physical Medicine and Rehabilitation, Virginia Commonwealth University, PM&R Service, Hunter Holmes McGuire Veterans Administration Medical Center, Richmond, VA (Cifu); Department of Rehabilitation Medicine, University of Washington, Seattle, WA (Dikmen); Department of Orthopaedics and Rehabilitation, Walter Reed Army Medical Center, Washington DC (French); JFK Johnson Rehabilitation Institute, Edison, NJ (Giacino); Spaulding Rehabilitation Hospital/Harvard Medical School, Boston, MA (Giacino); Moss Rehabilitation Research Institute, Elkins Park, PA (Hart); Department of Physical Medicine and Rehabilitation, Indiana University School of Medicine, Indianapolis (Malec); Rehabilitation Hospital of Indiana, Indianapolis, IN (Malec); Departments of Physical Medicine and Rehabilitation and Emergency Medicine, Wayne State University School of Medicine, Detroit, MI (Millis); Department of Physical Medicine and Rehabilitation, University of Alabama at Birmingham, AL (Novack); Department of Physical Medicine and Rehabilitation, University of Michigan, Ann Arbor, MI (Tulsky); Psychology Service, James A. Haley Veterans’ Hospital, Tampa (Vanderploeg); Departments of Psychology and Psychiatry, University of South Florida, Tampa, FL (Vanderploeg); and Department of Medical Psychology and Medical Sociology, University Medical Center Göttingen Georg-AugustUniversity, Göttingen, Germany (von Steinbuechel). Supported by the National Institutes of Health (NIH; National Institute of Neurological Disorders and Stroke), U.S. Department of Veterans Affairs (VA), U.S. Department of Defense (DoD), and U.S. Department of Education/National Institute on Disability and Rehabilitation Research. No commercial party having a direct financial interest in the results of the research supporting this article has or will confer a benefit on the authors or on any organization with which the authors are associated. Gale G. Whiteneck, PhD, is Chair of the Traumatic Brain Injury Outcomes Workgroup. Views expressed are those of the authors and do not necessarily reflect those of the agencies or institutions with which they are affiliated, including the U.S. Department of VA, the U.S. DoD, the U.S. Department of Health and Human Services, the NIH, the National Institute of Mental Health, the U.S. Department of Education, or the Uniformed Services University of the Health Sciences. This work is not an official document, guidance, or policy of the U.S. Government, nor should any official endorsement be inferred. Correspondence to Elisabeth A. Wilde, PhD, Baylor College of Medicine, 1709 Dryden Rd, Ste 1200, Houston, TX 77030, e-mail: ewilde@bcm.edu. Reprints are not available from the author. 0003-9993/10/9111-00369$36.00/0 doi:10.1016/j.apmr.2010.06.033 Arch Phys Med Rehabil Vol 91, November 2010 comes Workgroup adopted the standard 3-tier system in its selection of measures. In the first tier, core measures included valid, robust, and widely applicable outcome measures with proven utility in TBI from each identified domain, including global level of function, neuropsychological impairment, psychological status, TBI-related symptoms, executive functions, cognitive and physical activity limitations, social role participation, and perceived health-related quality of life. In the second tier, supplemental measures were recommended for consideration in TBI research focusing on specific topics or populations. In the third tier, emerging measures included important instruments currently under development, in the process of validation, or nearing the point of published findings that have significant potential to be superior to some older (“legacy”) measures in the core and supplemental lists and may eventually replace them as evidence for their utility emerges. Key Words: Outcome assessment; health care; Brain injuries; Neurobehavioral manifestations; Research; Rehabilitation. © 2010 by the American Congress of Rehabilitation Medicine HE PURPOSE OF THE common data elements traumatic T brain injury Outcomes Workgroup was to address the need for a common set of outcome measures for TBI research across agencies and populations, as outlined in Thurmond et al1 (see p. 1633-6, this issue). The work group was composed of physicians, psychologists, neuropsychologists, and others with expertise in TBI outcomes research. Many work group members also had previous collaborative experience in large multicenter TBI research projects, such as the NIH Clinical Trials Network and the National Institute on Disability and Rehabilitation Research TBI Model Systems program, as well as specific TBI-related clinical trials and multicenter studies. SELECTION OF TBI OUTCOME DOMAINS AND MEASURES The work group considered several factors in selecting outcome domains that should be assessed after TBI. First, we wanted to cover outcomes at multiple levels of the International Classification of Functioning, Disability, and Health; in other words, function, activity, and participation.2 Second, we targeted outcome domains previously shown to be affected by TBI and of importance to consumers, scientists, and practitioners. Third, we sought a set of measures that collectively would cover the continua from acute to long-term outcomes and from mild to severe TBI. Thus, the work group examined measures of global outcome; recovery of consciousness; neuropsychological impairment; psychological status; TBI-related symptoms; performance of activities loading on behavioral, cognitive, and physical demands; social role participation; and perceived health-related quality of life, as well as health economic measures. Additionally, a multidimensional domain of patient-reported outcomes was identified as a promising CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde List of Abbreviations ADL ASSIST AUDIT BSI-18 BVMT-R CDE CHART CHART-SF Cog-FIM COWAT CRS-R CWIT DRS FAD FrSBe GOS GOS-E GPT MCS MPAI-4 MMPI-2 MMPI-2-RF NIH NINDS NOS-TBI NSI PART PCL-C PCL-M PCL-S PI PROMIS PTSD QOLIBRI RAVLT RPQ SF-12 SF-36 SWLS TBI TBI-QOL TMT VA WAIS-III WAIS-IV WHO WRAT-4 activity of daily living Alcohol, Smoking, and Substance Use Involvement Screening Test Alcohol Use Disorders Identification Test Brief Symptom Inventory 18 Brief Visuospatial Memory Test–Revised common data element Craig Handicap Assessment and Reporting Technique Craig Handicap Assessment and Reporting Technique Short Form FIM Cognition Subscale Controlled Oral Word Association Test JFK Coma Recovery Scale–Revised Color-Word Interference Test Disability Rating Scale Family Assessment Device Frontal Systems Behavior Scale Glasgow Outcome Scale Glasgow Outcome Scale (Extended) Grooved Pegboard Test minimally conscious state Mayo-Portland Adaptability Inventory Minnesota Multiphasic Personality Inventory 2 Minnesota Multiphasic Personality Inventory 2, Restructured Form National Institutes of Health National Institute on Neurological Disorders and Stroke Neurological Outcome Scale for Traumatic Brain Injury Neurobehavioral Symptom Inventory Participation Assessment With Recombined Tools Posttraumatic Stress Disorder Check List– Civilian Version Posttraumatic Stress Disorder Check List– Military Version Posttraumatic Stress Disorder Check List– Stressor Specific Version Participation Index Patient-Reported Outcomes Measurement Information System posttraumatic stress disorder Quality of Life After Brain Injury Rey Auditory Verbal Learning Test Rivermead Post Concussion Symptom Questionnaire Medical Outcomes Study 12-Item Short Form Health Survey Medical Outcomes Study 36-Item Short Form Health Survey Satisfaction With Life Scale traumatic brain injury Traumatic Brain Injury–Quality of Life Trail Making Test Veterans Affairs Wechsler Adult Intelligence Scale, Third Edition Wechsler Adult Intelligence Scale, Fourth Edition World Health Organization Wide Range Achievement Test, Fourth Edition 1651 area represented by outcome measures currently in development. These domains are described further in table 1. Factors of Importance in Selecting Outcome Measures Within the Domains Within each domain, measures were selected to maximize the ability of clinical researchers to (1) document the natural course of recovery after TBI, (2) enhance the prediction of later outcome, (3) measure the effects of treatment, and (4) facilitate comparisons across studies. The work group divided into smaller subgroups based on interests and expertise to develop lists of names and detailed characteristics of potential measures for each domain. Measures were identified using the following criteria: (1) sufficient representation in the scientific literature and/or widespread use in the TBI clinical and research community in diagnosis, outcome measurement and prediction, or treatment effectiveness; (2) evidence of sound psychometric properties, including (when applicable) construct validity, internal consistency, sensitivity to change, test-retest reliability, intra-/interrater agreement (including subject/proxy and telephone/in-person administration); (3) well-established normative data; (4) applicability across a range of injury severity and functional levels; (5) availability in the public domain; (6) ease of administration; and (7) brevity. The panel also considered factors that would render the measures appropriate for international use, such as availability in different languages and validation in different ethnic groups. For measures of health-related quality of life, activity/participation, and psychological function, consideration also was given to flexibility of formats; for example, telephone interview versus in-person administration or self versus proxy respondent. Finally, for objective neuropsychological measures, the availability of alternate forms to moderate the potential impact of practice effects was considered. The work group considered measures that could be applied in both adult and pediatric populations, but recommended that a separate work group be convened to address pediatric outcome recommendations. Distinguishing Core, Supplemental, and Emerging Outcome Measure Recommendations In accordance with other CDE Workgroups, 3 tiers of CDE were recommended: core, supplemental, and emerging (see Thurmond et al,1 p. 1633-6, this issue). First, well-established core measures covering outcome domains relevant to most TBI studies were included. A listing of 9 core measures was selected, with the idea that most could be applied across large TBI studies either as a comprehensive battery or in addition to other outcome measures selected by the investigator. Use of these measures should be tempered by the objectives, study design, and target population. In the second tier, additional supplemental measures were recommended for consideration in TBI research focusing on more specific topics or populations. For example, a study in which neuropsychological outcome is of particular interest may draw on measures from the supplemental list that target cognitive functions not tapped by the core. In the third tier, emerging measures include important instruments currently under development, in the process of validation, or nearing the point of published findings that have significant potential to be superior to some older (“legacy”) measures currently in the core and supplemental lists. General Process for Selecting CDEs Each member of the panel selected 1 or 2 outcome domains based on his/her interests and expertise or was assigned a Arch Phys Med Rehabil Vol 91, November 2010 1652 CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde Table 1: Outcome Domains and Descriptions Domain Name Global outcome Recovery of consciousness Neuropsychological impairment Psychological status TBI-related symptoms Behavioral function Cognitive activity limitations Physical function Social role participation Perceived generic and disease-specific healthrelated quality of life Health economic measures Patient-reported outcomes (future multidimensional tools) Domain Description and Relevance in TBI Global outcome measures summarize the overall impact of TBI, incorporating functional status, independence, and role participation Duration of coma, level of consciousness, and rate of recovery contribute significantly to functional outcome and have a key role in treatment and disposition planning Objective measures of neuropsychological functions, such as attention, memory, and executive function, are very sensitive to effects of TBI and often affect everyday activities and social role participation Psychological issues associated with TBI that affect outcomes include adjustment problems, personality changes (eg, impulsivity), or mood disturbances. In addition, substance use disorders are prevalent in persons with TBI and can have a substantial impact on long-term outcomes TBI-related symptoms include somatic (eg, headaches, visual disturbances), cognitive (eg, attention and memory difficulties), and emotional (eg, irritability) symptoms. They commonly are reported after TBI or concussion and may persist in some cases at all levels of TBI severity Behavioral dysfunction commonly is reported after TBI and may contribute to difficulties in return to work/ school, personal relationships, and social functioning. Common examples are aggression and childlike behavior Cognitive activity measures describe the impact of neuropsychological impairments on cognitively loaded real-world tasks, such as instrumental ADLs, functional communication, and health and safety-related behaviors People with TBI (particularly severe TBI) may manifest difficulties in physical or neurologic functioning, including cranial or peripheral nerve damage; impairment in motor functioning, strength, and/or coordination; or impairment in sensation. These impairments may contribute to difficulties performing day-to-day activities safely and independently Participation is defined by the WHO as “involvement in life situations”3 and commonly includes engagement in endeavors within one’s community. TBI affects many areas of participation, including work/productive activity, recreation and leisure pursuits, and social/family role function TBI may create significant limitations in multiple areas of functioning and well-being, often reducing perceived quality of life with regard to multiple generic and disease-specific dimensions Health economic measures assess the magnitude of benefit in relation to costs spent; eg, they identify the most cost-effective therapeutic procedure in terms of cost per QALY No single measure to date can adequately capture the multiplicity of difficulties that people with TBI may face. This domain includes emerging large-scale measurement tools for patient-reported outcomes across several domains for generic medical populations, neurologic compromise, and TBI-related symptoms Abbreviation: QALY, quality-adjusted life-year. domain. Subgroups of panel members developed initial lists of potential measures within each domain and provided information about the criteria detailed. Potential measures were discussed among the entire panel through a series of conference calls, and a more limited set of measures for each outcome domain was selected for further discussion in the panel at a face-to-face meeting in March 2009. In preparation for the meeting, all panel members assisted in composing a series of detailed tables with relevant information about general administration characteristics, psychometric properties, and advantages and limitations of each potential measure. The primary objective of the meeting was to further examine, refine, and limit the list of potential outcome measures by using the information collected and reviewed. In accordance with other CDE work groups, a final set of measures was selected and organized into the 3 tiers described after further discussion of the relative advantages and limitations of each measure. Selection of the final measures for each CDE level was done by work group consensus. Description and Selection of Core, Supplemental, and Emerging CDEs The rationale behind the core measures was to create a primary set of well-established measures that cover outcome domains important to many studies. Primary emphasis was Arch Phys Med Rehabil Vol 91, November 2010 given to selecting a single measure (or limited set of measures) that best covered each domain. For the core measures, established use in subjects with TBI and favorable psychometric characteristics were considered primary criteria. Brevity and ease of administration also influenced the selection of core measures because the intent was to recommend measures that would be feasible to administer in a reasonable time (ie, ⬍90min). Availability of different validated formats, such as self and other proxy response formats, also was considered given that self-report is impossible or unreliable for some people with TBI. Finally, applicability of each measure across a range of postinjury functional levels also was considered highly important because one of the primary objectives of the CDEs was to foster comparability of outcome measurement across different studies. The rationale behind creating a set of supplemental measures was to recommend additional measures in each domain that could be considered for more in-depth outcome assessment within a certain domain or for patients at a specific functional level. For example, in studies in which neuropsychological outcome is of particular interest, investigators may draw on additional outcome measures from the supplemental list that target additional aspects of cognitive functioning not covered by the core measures (eg, visual memory, verbal fluency, fine motor control). Similarly, in studies focusing on patients with 1653 CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde Table 2: Core, Supplemental, and Emerging Measures for Each Domain Domain Name Global outcome Recovery of consciousness Neuropsychological impairment Core Measure(s) GOS-E RAVLT TMT Processing Speed Index from WAIS-III/WAIS-IV Psychological status BSI-18 TBI-related symptoms Behavioral function Cognitive activity limitations Physical function RPQ Social role participation Perceived generic and diseasespecific health-related quality of life Health economic measures Patient-reported outcomes (future multidimensional tools) CHART-SF SWLS Supplemental Measures MPAI-4 DRS SF-36, version 2 CRS-R BVMT-R Letter-Number Sequencing subtest of the WAIS-III/WAIS-IV COWAT CWIT Digit Span subtest of the WAIS-III/ WAIS-IV Word Reading subtest of the WRAT-4 GPT MMPI-2-RF AUDIT Substance use questions from the TBI Model Systems data set ASSIST PCL-C/M/S FAD NSI FrSBe Cog-FIM FIM motor subscale Emerging Measure(s) NIH Toolbox cognitive battery NIH Toolbox emotional battery NIH Toolbox motor and sensory batteries NOS-TBI PART QOLIBRI EuroQOL disorders of consciousness, the CRS-R was recommended because it was designed specially for assessment of this target population. Finally, additional measures of psychological and/or family functioning or substance abuse may be of importance, depending on the study design, functional level, or target population. The third tier consists of emerging measures, meaning those currently under development, in validation, or nearing publication and that have significant potential to be superior to some older (legacy) measures in the core or supplemental sets. These emerging measures were selected because they fill existing gaps in the measurement of TBI-related sequelae or use more sophisticated validation techniques than older measures. Additionally, some of these measures may better facilitate comparison across patient groups (eg, different disease populations, broader age range, more comprehensive sampling of domains of function). Because established use in subjects with TBI and psychometric characteristics were considered primary criteria for core and supplemental measures, the emerging measures will require further consideration as CDEs as evidence accumulates about their psychometric characteristics, normative data, and utility in TBI research. In this vein, the work group acknowledges that the selection of recommended outcome measures is a flexible and dynamic process that will undergo further evolution as additional evidence emerges and as testing of these measures as CDEs is PROMIS Neuro-QOL TBI-QOL undertaken. For example, some subtests from the WAIS-III were selected as core measures, although version IV has been released and the WAIS-III may not be available for purchase from the publisher in the future. Although version III currently is recommended because of its better fit with the selection criteria, particularly its use in TBI research, we acknowledge that the WAIS-IV versions of the subtests are likely to replace the WAIS-III versions in the core set pending further research, and that either version is acceptable because both measures will yield reliable and valid results. With any effort such as this attempt to create a set of CDEs, there is a dynamic tension between the desire to maintain consistency among a stable set of measures and the desire to adopt new improved measures as they become available. All core and supplemental measures listed here have been selected as recommended measures at the time of this publication; nevertheless, the work group advises the reader to consult the CDE web site for any updates to this listing. It is particularly important to track the progress of emerging measures because these are believed to have the potential to replace items in the existing core and/or supplemental set. RECOMMENDATIONS FOR TBI OUTCOME MEASURES Recommended CDEs (all 3 tiers) are listed in table 2 and described briefly next. The reader also is referred to www. Arch Phys Med Rehabil Vol 91, November 2010 1654 CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde CommonDataElements.ninds.nih.gov for detailed supplemental information about each measure. Core Data Elements Glasgow Outcome Scale (Extended). The GOS4 is a singleitem scale that summarizes patient status in 1 of 5 categories: dead, vegetative state, severe disability, moderate disability, and good recovery. The GOS-E5 is a revision of the GOS that provides 8 categories of outcome: dead, vegetative state, lower severe disability, upper severe disability, lower moderate disability, upper moderate disability, lower good recovery, and upper good recovery. GOS-E ratings are based on a structured interview and are easily recoded to GOS ratings. Together, these scales are the most commonly used TBI global outcome measure, and their use permits comparison to much of the world literature on TBI outcome. Rey Auditory Verbal Learning Test. This measure of word list learning is brief, available in the public domain, covers a wide age range, and has alternate forms. The RAVLT is one of the most widely studied measures of cognition, has extensive normative data,6-8 and has been used in different languages, cultures, and ethnic groups. It has good psychometric properties and is sensitive to neurologic conditions. The RAVLT will be used for validating the episodic memory measure of the NIH Toolbox or will be included in the Toolbox itself. Trail Making Test. The TMT9 is a measure of attention, speed, and mental flexibility. It is brief, widely used by neuropsychologists,10 sensitive to TBI-associated cognitive impairment, and reliable.11 Demographically adjusted normative data are available for a wide age range,9 and there are adult and child versions. Arabic, Chinese, and Hebrew versions are available. Practice effects are found over short retest intervals, but disappear after several administrations; at longer intervals, scores show only modest change in healthy adults. WAIS-III/WAIS-IV Processing Speed Index. This index is based on the Digit Symbol Coding and Symbol Search subtests of the WAIS-III (or WAIS-IV),12,13 which has extensive normative data and excellent psychometric properties. As a measure of information processing rate, it is highly sensitive to the effects of TBI and its severity. It has been used in different languages, cultures, and ethnic groups and is usable across literacy levels. This measure is being used as a legacy measure to validate NIH Toolbox processing speed measures.12,14,15 Brief Symptom Inventory 18. The BSI-1816 is a short form of the Symptom Checklist-90-Revised.17 It is a brief self-report measure of psychological distress with 3 subscales (Depression, Anxiety, and Somatization) and a Global Severity Index. The BSI-18 was selected as a core measure because of its brevity, global assessment of common psychological issues in people with TBI, and sound psychometric characteristics. It can be used to monitor change in response to treatment and can be completed using paper-and-pencil or computerized administration formats. Rivermead Post Concussion Symptom Questionnaire. The RPQ is a measure of postconcussion symptom presence and severity after TBI. It contains 16 items, which the participant rates in relation to premorbid functioning by means of written self-report or in-person or telephone interview. The RPQ has been used most often in assessing postconcussion symptoms in persons with mild to moderate TBI, but also has been used in patients with severe TBI,18 and the measurement of symptoms contained in this measure may be applicable at all levels of severity. The RPQ was selected as a core measure based on its sound psychometric characteristics and capacity to detect clinical changes in patients with mild TBI. The scale has been used Arch Phys Med Rehabil Vol 91, November 2010 to investigate the relationship between behavioral and neurophysiologic markers of injury19-21 and outcome prediction.22 FIM Cognition Subscale. The FIM was selected as a core measure of both physical and cognitive activity limitations because of its widespread clinical use in TBI populations, multiple validated response formats (observational ratings, self- or proxy-report in person or by telephone),23,24 and extensive use in studies of diagnostic accuracy, outcome prediction, and treatment effectiveness. Item response analysis of the FIM has confirmed a motor domain consisting of 13 items and a cognitive domain consisting of 5 items.25 There is low correlation between the Cog-FIM and mental and physical health measures, suggesting discriminant validity.24 Ceiling effects may limit the FIM’s utility for longitudinal studies of TBI, although ceiling effects are less extreme for the Cog-FIM versus the Motor FIM in those with moderate/severe TBI.26,27 Craig Handicap Assessment and Reporting Technique Short Form. The CHART-SF was designed to provide a simple objective measure of the degree to which impairments and disabilities result in handicaps (participation restriction) in the years after initial rehabilitation.28 It contains the 6 Raschvalidated subscales of Physical Independence, Cognitive Independence, Mobility, Occupation, Social Integration, and Economic Self-Sufficiency. It shows good interrater, test-retest, and subject-proxy reliability. It has been shown to discriminate between people with TBI and stroke who report lower scores than those with other disabilities.29,30 CHART Cognitive Independence scores correlate more highly with Cog-FIM scores than FIM motor subscale scores.29 Satisfaction With Life Scale. The SWLS is a global measure of life satisfaction.25 The SWLS consists of 5 items that are completed by the subject. It has shown consistent differences between populations that would be expected to have different quality of life (eg, psychiatric patients or male prison inmates). The SWLS also has been found to change in the expected directions in response to major life events31 and in patients receiving psychotherapy.32 Supplemental Data Elements JFK Coma Recovery Scale–Revised. The CRS-R is a standardized behavioral assessment instrument designed to measure neurobehavioral function in patients with disorders of consciousness.33 It is composed of 6 subscales designed to assess auditory, visual, motor, oromotor/verbal, communication, and arousal functions. The CRS-R is the only standardized assessment measure that directly incorporates diagnostic criteria for coma, vegetative state, MCS, and emergence from MCS and thus is strongly recommended for all studies of disorders of consciousness. The CRS-R shows adequate sensitivity and specificity,34-36 correlates well with functional outcome,37 is useful for monitoring treatment effectiveness,3,38 and is available in 11 languages. Mayo-Portland Adaptability Inventory. The MPAI-4 (and the PI, a component of the MPAI-4)39-41 was designed for outcome measurement after acquired brain injury in the postacute stage of recovery. The MPAI-4 is the product of 15 years of development using item response and classic psychometric theory and has established concurrent, construct, and predictive validity. There is a total score and subscale scores for Ability, Adjustment, and Participation. Strengths include ease of administration, flexibility (by telephone or in person), and normative values based on ratings by people with brain injury, significant others, and clinical staff. The PI (independent of the entire MPAI-4) has not been used extensively in research applications, although studies support the use of the entire MPAI-4 with adult and pediatric samples.42-44 CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde Disability Rating Scale. The DRS is intended to measure general functioning during the course of recovery.45 The DRS provides a single score based on level of arousal; cognitive ability to perform basic ADLs, including eating, grooming, and toileting; independence in the home; and employability. The DRS was selected because it is applicable across a wide range of injury severity and recovery intervals. The DRS may be useful in studies of subjects with moderate to severe TBI with serial measurement,46 particularly when initial measurement occurs in the acute postinjury interval. Medical Outcomes Study 36-Item Short Form Health Survey. The SF-36 is the most widely used subjective health status measure, developed by using classic psychometric test theory methods.47 It contains 11 items and generates subscale scores in physical functioning, physical role function, emotional role function, bodily pain, vitality, mental well-being, social functioning, and general health perception. An additional item assesses changes in health status during the last year. Two summary scores can be computed, a physical component score and a mental component score. In TBI research, more than 30 studies using the SF-36 showed high internal consistencies of all scales,48 as well as sound values for construct, discriminant, and content validity49,50 and sensitivity to treatment-related changes. The work group is mindful that the SF-12, has been selected as a core element by the PTSD Workgroup.51 However, the SF-12 items are contained within and may be scored from the SF-36.52 Thus, use of the latter scale, which we recommend for TBI studies for comprehensive evaluation, will readily permit comparison with studies using the SF-12. Brief Visuospatial Memory Test–Revised. The BVMT-R is a multitrial measure of visual-spatial memory/learning requiring reproduction of geometric forms. It was normed for a wide adult age range (age range, 18 to ⱖ79y) and has no significant sex- or education-related effects. It has good testretest reliability for total score and interrater reliability. The BVMT-R has multiple alternate forms, correlates well with other measures of memory, and shows sensitivity to neurologic conditions. It is being used as a legacy measure for validating the NIH Toolbox measure of episodic memory.53 WAIS-III/WAIS-IV Letter-Number Sequencing subtest. This is a measure of auditory working memory that appears in both the WAIS-III and Wechsler Memory Scale-Third Edition (or WAIS-IV).12,13 The subtest has extensive normative data and good psychometric properties, as well as clinical sensitivity. This measure is being used as a legacy measure to validate the NIH Toolbox working memory measure.14,54,55 Controlled Oral Word Association Test. The COWAT56 measures attentional control, working memory, and other components of executive function. There is a strong association between focal frontal lobe injuries after TBI and impaired performance on the COWAT.57 Several alternate forms and a Spanish version are available. It does not have a low ceiling in people without neurologic disorders.15 Demographically adjusted normative data are available for ages 20 to 85 years.9 Color-Word Interference Test. The CWIT,58 a variant of the Stroop procedure, measures cognitive flexibility, selective attention, and the capacity to inhibit an overlearned response. The CWIT version is a subtest from the Delis-Kaplan Executive Function System. Normative data are available for people aged 8 to 89 years. WAIS-III or WAIS-IV Digit Span subtest. This test12,13 provides a brief assessment of auditory attention. The digits backward component is particularly informative as a simple measure of working memory. The test is widely available, easy and quick to administer, and well normed. Digit Span has been used as a marker of cognitive deficit and recovery.59,60 Its 1655 potential use as a symptom validity measure is an added benefit.61,62 Word Reading subtest of the WRAT-4. The WRAT-463 provides a quick measure of academic achievement based on well-established norms. Studies have shown stability in people with TBI, allowing results to be used as an estimate of premorbid cognitive ability.64,65 Results can be affected adversely by visual difficulty, severe language disorder, and preexisting learning disability. Grooved Pegboard Test. The GPT has proved to be a sensitive indicator of brain functioning, with diminished performance noted after even milder injury. It is readily available, easy and quick to administer, and well normed. The GPT can be used to document existing deficits and predict outcome.66,67 One drawback is that performance can be influenced by peripheral injury, such as arm or hand fracture or problems with visual acuity. Minnesota Multiphasic Personality Inventory 2, Restructured Form. The MMPI-2-RF is a revised 338-item version of the MMPI-2. There are 50 scales: Restructured Clinical Scales, Validity Scales, Specific Problem Scales, Interest Scales, and Personality Psychopathology Five Scales.68 The MMPI-2 is the most extensively used and researched of the comprehensive personality assessment tools. The MMPI-2-RF provides a more time-efficient approach to using the MMPI-2. It is psychometrically up to date and is linked to current models of psychopathology and personality. Alcohol Use Disorders Identification Test. The AUDIT69 and the substance use questions from the TBI Model Systems data set indicate the extent of “problematic” substance use. The AUDIT was developed by the WHO and has been used extensively with a range of populations, including persons with TBI.70 An abbreviated 3-item version (AUDIT-C)71 screens for extent of alcohol consumption. The substance use questions from the TBI Model Systems data set query the use of alcohol and other drugs and are based on questions from populationbased surveys, thus allowing comparisons with statistics from the general population.72 A dichotomous variable indicating the presence of problem substance use (unhealthy use) can be derived from these questions. Alcohol, Smoking, and Substance Use Involvement Screening Test. For a more comprehensive assessment of substance use, the ASSIST73 also was developed by the WHO, has been validated in 9 countries, and is easily administered, reliable, and valid. Recently completed work indicates that the ASSIST is sensitive to change and specifically to the effects of a brief intervention.73 PTSD Check List–Civilian, –Military, and –Stressor Specific versions. The PTSD Checklist is a 17-item self-report measure composed of the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition symptoms of PTSD.74 Respondents rate on a 5-point scale how much they were bothered by each symptom “in the past month.” The PTSD ChecklistMilitary Version asks about problems in response to “stressful military experiences.” The PTSD Checklist-Civilian Version is not focused on one traumatic event. The PTSD ChecklistStressor Specific Version requires the respondent to provide responses in relation to a specific event. The PCL measures also can be used for evaluation of PTSD severity and to monitor change in response to treatment. These are public domain measures and are widely used. Family Assessment Device. The FAD75 is a 60-item selfreport instrument based on the McMaster Model of Family Functioning.76 Patients and/or family members read and respond to the items. The FAD assesses structural and organizational properties of the family group and the patterns of transArch Phys Med Rehabil Vol 91, November 2010 1656 CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde actions among family members that have distinguished between healthy and unhealthy families. It can be used to evaluate the family unit in a broadly defined manner to include any type of committed or enduring relationship structure. The FAD has 7 subscales: 1 General Functioning scale, which assesses overall health and pathology of the family, and 6 dimensional subscales: Problem Solving, Communication, Roles, Affective Responsiveness, Affective Involvement, and Behavior Control. Subscale reliabilities and test-retest reliability are adequate, and it has low correlations with social desirability and moderate correlations with other self-report measures of family functioning.77 Neurobehavioral Symptom Inventory. The NSI, also known as the Post Mild TBI Symptom Checklist,78 was designed to measure 22 common postconcussion symptoms after TBI without regard to the existence of preinjury symptoms. The severity of each symptom is measured using a 5-item scale (0 indicates none to 4 indicates very severe) that asks participants to indicate the extent to which each symptom has disturbed them in the previous 2 weeks. NSI total score is the sum of severity ratings of the 22 symptoms. Cluster scores (physical, cognitive, affective, and sensory domains) were derived.78 The NSI was selected as a supplemental measure because it currently is being used by both the U.S. Department of Defense and the VA as part of their Operation Iraqi Freedom/Operation Enduring Freedom TBI evaluation process of postconcussional symptoms. Frontal Systems Behavior Scale. The FrSBe79 assesses the severity of behavior problems associated with frontal lobe function as rated by the injured person and/or a caregiver. There is a total score and 3 subscales (apathy, disinhibition, and executive dysfunction) confirmed by using factor analysis.80 Norms are available based on sex, age, and education for persons with TBI and caregivers. The sample of non-neurologically impaired people used to derive the norms was small and constrained in terms of education and race, but the FrSBe has been used effectively in a few studies.81,82 EuroQoL. The EuroQoL is a generic self-rating instrument to assess health-related quality of life and health status. It generates an index of health for use in economic evaluation, has good psychometric properties, is available in many languages, and consists of self-rating of a set of health states and background information about the respondent’s health. The 5 dimensions of mobility, self-care, usual activities, pain/discomfort, and anxiety/depression result in a health state profile. Combined with clinical data (eg, survival) it gives qualityadjusted life-years. In TBI, the instrument has been used in some outcome studies with good success.83-87 Please see Supplementary Table 1 for information related to the psychometric properties of all core and supplemental measures. Emerging Data Elements NIH Toolbox. The NIH Toolbox (Cognitive, Emotional, Motor, and Sensory components) is part of the NIH Blueprint initiative. It seeks to assemble brief comprehensive assessment tools that will be useful in a variety of settings with particular emphasis on measuring outcomes in epidemiologic studies and clinical trials across the life span. The ultimate goal is to help improve communication within and between fields of biomedical research to advance knowledge by using CDEs. The battery will examine various cognitive (episodic memory, language, processing speed, working memory, executive functions, attention), emotional (negative affect, positive affect, stress and coping, social relationships), sensory (vestibular, audition, olfaction, taste, vision), and motor functions (dexterity, strength, locomotion, Arch Phys Med Rehabil Vol 91, November 2010 endurance, balance). The battery is designed to measure these domains in subjects aged 3 through 85 years, is normed for both English and Spanish speakers, and will be available at a nominal cost and will take no more than 2 hours to administer. The battery has gone through extensive work to identify and pretest the constructs to be measured. Validation of the NIH Toolbox batteries have been completed, with norming planned in about 4500 subjects (please see http://www.nihtoolbox.org for additional information). Neurological Outcome Scale for TBI. The NOS-TBI is a brief measure of neurologic functioning (including level of consciousness, cranial nerve functioning, limb strength, language, ataxia) modeled after the NIH Stroke Scale, but containing items specific to and validated in a TBI population. The NOS-TBI contains 15 items, some of which have subparts (eg, for lateralization). Administration and scoring guidelines are provided for patients who are comatose, obtunded, or aphasic, rendering this a measure that can be used across a wide range of injury severity and chronicity. Available preliminary psychometric results indicate excellent reliability and validity.88-90 Participation Assessment with Recombined Tools. The PART is a measure of community participation developed by the TBI Model Systems by combining the primary measures found in the TBI literature (Community Integration Questionnaire, original and revised91,92; Participation Objective; Participation Subjective93; and the CHART).28 The measure has been administered to persons with TBI and other sources of disability and to a population-based sample. Psychometric data have not yet been published, but available results indicate that the PART is reliable and valid, maintaining the strengths and overcoming some of the weaknesses of its component measures. Quality of Life After Brain Injury. The QOLIBRI is the first TBI disease-specific quality-of-life cross-culturally and consensually developed patient-reported outcome tool for clinical trials and individual use (www.qolibrinet.com).94-96 It has been validated in 2 large multinational TBI populations (N⬎1500, N⬎900) with different grades of disease, showing good psychometric properties. Based on classic and modern test theory, it yields 37 items in 6 Likert-formatted scales, 4 assessing satisfaction (Cognition, Self, Daily Life and Autonomy, Social Relationships) and 2 assessing the feeling of botheredness (Emotions and Physical Problems). A total score and a 6-item screener also are available. It is brief, will be in the public domain from February 2010 onward, and exists in more than 10 languages. Finally, there are 3 interrelated measurement systems (the PROMIS, Neuro-QOL, TBI-QOL) of patient-reported outcomes measures being developed to measure emotional functioning, social participation, and physical and medical functioning across a wide array of domain areas. Patient-Reported Outcomes Measurement Information System. The PROMIS97 is a new measurement system that is part of the NIH Roadmap to improve the clinical research enterprise. The PROMIS Network has developed and tested a large bank of items measuring patient-reported outcomes over several domains, including physical functioning, sleep disturbance, fatigue, anxiety, depression, anger, social roles, and social activities. Item banks have been calibrated, allowing the test to be administered as a computerized adaptive test or as short forms to ensure brevity. Researchers can select domains of functioning relevant to their specific research question. The PROMIS is designed as a generic measure that is to be used across all medical populations. Neuro-QOL. The Neuro-QOL also is a patient-reported outcome measurement system funded through a contract CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde method by the NINDS.98,99 The Neuro-QOL team has developed separate item banks covering the domains of Mobility/ Ambulation, ADLs/Upper Extremity, Depression, Anxiety, Positive Psychological Functioning, Stigma, Perceived and Applied Cognition (includes communication), Social Role Performance, Social Role Satisfaction, Fatigue, Personality and Behavioral Change, and Sleep Disturbances. Embedded in several of the Neuro-QOL domains are a significant number of PROMIS items. The Neuro-QOL is designed to be a common outcome variable across all clinical trials research sponsored by the NINDS. Traumatic Brain Injury–Quality of Life. The TBI-QOL is a new multifaceted patient-reported outcomes measure that is in development.100,101 It will embed Neuro-QOL and PROMIS items and will cover the domains of functioning of Perceived and Applied Cognition (includes communication), Personality and Behavioral Change (includes impulsivity), Depression, Anxiety, Positive Psychological Functioning, Stigma, Social Role Performance, Social Role Satisfaction, Fatigue, and Sexuality. Five TBI Model System centers and 4 VA Polytrauma/ Defense and Veterans Brain Injury Centers are collaborating to develop this instrument. Because the PROMIS, Neuro-QOL, and TBI-QOL contain common items and have been developed as calibrated item banks using item response theory, the researcher will not administer common item banks from these instruments (eg, both the TBI-QOL and PROMIS depression item banks), but instead select one or the other. Linking tables or cross-walks between the PROMIS, Neuro-QOL, and TBI-QOL will be developed, allowing researchers to compute a PROMIS and Neuro-QOL equivalency score from TBI-QOL item banks. Similarly, PROMIS equivalency scores will be derived for people who complete the relevant Neuro-QOL item banks. For additional information about all core, supplemental, and emerging CDEs, please consult www.CommonDataElements. ninds.nih.gov. This site contains a table with descriptions of the measures; a listing of variables and permissible values; information about administration length, training requirements, and appropriate populations for use; and references or contact information. FUTURE ISSUES AND RESEARCH NEEDS The work group identified several areas in which additional research would enhance outcome measurement in TBI. First, as indicated in the discussion of emerging measures, there is a need for further validation and testing of measures, such as the NIH Toolbox, to specifically evaluate their utility in TBI. Second, TBI causes characteristic cognitive and communication impairments that can compromise the validity of selfreport. Despite the great promise of the new patient-reported outcome measures under development, the work group also identified the need for more research about the applicability and validity of proxy report. In addition, as has been done for other populations, such as those with serious mental illness, we see the need for further development of standardized measures that directly test performance on cognitively demanding activities, such as using transportation, adhering to medication schedules, and exercising judgment in the home and community. Third, we recognize that measurement of vocational outcomes is minimally represented in the CDE recommended here, a circumstance that partly reflects the state of the science. More work is needed to develop standard measures of employment post-TBI, taking into account the diversity of important outcomes (return to work vs new employment, long-term job maintenance, pay, satisfaction, and so on). Finally, the work group acknowledged the need for additional measures of ex- 1657 ecutive functioning that keep pace with theoretical developments in clinical neuroscience. SUMMARY In accordance with other CDE work groups, the following 3 tiers of CDE were recommended: (1) 9 core measures covering outcome domains relevant to most TBI studies that could be applied as either a comprehensive battery or in addition to other outcome measures selected by the investigator, (2) supplemental measures for consideration in TBI research focusing on more specific topics or subpopulations, and (3) emerging measures, which include promising instruments currently under development, in the process of validation, or nearing the point of published findings that have significant potential to be superior to some measures currently in the core and supplemental lists. Selection of the CDE measures is intended to facilitate comparison of findings from large-scale research efforts designed to document the natural course of recovery from TBI, enhance the prediction of outcome, and/or measure the effects of treatment. The work group acknowledges that although these measures were chosen after substantial review of available evidence and discussion within the group, any selection of CDE is a dynamic process that must accommodate some shift and evolution in the measures within each category as new evidence emerges and selected measures continue to be tested. References 1. Thurmond VA, Hicks R, Gleason T, et al. Advancing integrated research in psychological health and traumatic brain injury: common data elements. Arch Phys Med Rehabil 2010;91: 1633-6. 2. World Health Organization. International Classification of Functioning, Disability and Health. Geneva: World Health Organization; 2001. 3. Schnakers C, Hustinx R, Vandewalle G, et al. Measuring the effect of amantadine in chronic anoxic minimally conscious state. J Neurol Neurosurg Psychiatry 2008;79:225-7. 4. Jennett B, Bond M. Assessment of outcome after severe brain damage. Lancet 1975;1:480-4. 5. Wilson JT, Pettigrew LE, Teasdale GM. Structured interviews for the Glasgow Outcome Scale and the extended Glasgow Outcome Scale: guidelines for their use. J Neurotrauma 1998; 15:573-85. 6. Schmidt M. Rey Auditory Verbal Learning Test. Los Angeles: Western Psychological Services; 1996. 7. Mitrushina M, Boone KB, Razani J, D’Elia LF. Handbook of normative data for neuropsychological assessment. 2nd ed. New York: Oxford University Pr; 2005. 8. Ivnik RJ, Malec JE, Tangalos EG, Peterson RC, Kokmen E, Kurland LT. Mayo’s older American’s normative studies: updated AVLT norms for ages 56 to 97. Clin Neuropsychol 1992; 6:83-104. 9. Heaton RK, Miller SW, Taylor MJ, Grant I. Revised comprehensive norms for an expanded Halstead-Reitan Battery: demographical adjusted neuropsychological norms for African American and Caucasian adults professional manual. Lutz: Psychological Assessment Resources Inc; 2004. 10. Rabin LA, Barr WB, Burton LA. Assessment practices of clinical neuropsychologists in the United States and Canada: a survey of INS, NAN, and APA Division 40 members. Arch Clin Neuropsychol 2005;20:33-65. 11. Strauss E, Sherman EMS, Spreen O. Rey-Osterrieth Auditory Verbal Learning Test. Compendium of neuropsychological tests. New York: Oxford University Pr; 2006. p 776 – 807. 12. Wechsler D. Wechsler Adult Intelligence Scale III. San Antonio: Harcourt Assessment Inc; 1997. Arch Phys Med Rehabil Vol 91, November 2010 1658 CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde 13. Wechsler D. Wechsler Adult Intelligence Scale IV. San Antonio: Harcourt Assessment Inc; 2008. 14. WAIS-III-WMS-III technical manual. San Antonio: Psychological Corp; 2008. 15. Strauss E, Sherman EMS, Spreen O. A compendium of neuropsychological tests: administration, norms, and commentary. 3rd ed. New York: Oxford University Pr; 2006. 16. Derogatis LR. Brief Symptom Inventory 18 (BSI 18): administration, scoring and procedures manual. Minneapolis: NCS Pearson Inc; 2001. 17. Derogatis LR. Symptom Checklist-90-R (SCL-90-R): administration, scoring and procedures manual. 3rd ed. Minneapolis: NCS Pearson, Inc; 1994. 18. King NS, Crawford S, Wenden FJ, Moss NE, Wade DT. The Rivermead Post Concussion Symptoms Questionnaire: a measure of symptoms commonly experienced after head injury and its reliability. J Neurol 1995;242:587-92. 19. Ingebrigtsen T, Romner B, Marup-Jensen S, et al. The clinical value of serum S-100 protein measurements in minor head injury: a Scandinavian multicentre study. Brain Inj 2000;14: 1047-55. 20. Wilde EA, McCauley SR, Hunter JV, et al. Diffusion tensor imaging of acute mild traumatic brain injury in adolescents. Neurology 2008;70:948-55. 21. Smits M, Dippel DW, Houston GC, et al. Postconcussion syndrome after minor head injury: brain activation of working memory and attention. Hum Brain Mapp 2009;30:2789-803. 22. Crawford S, Wenden FJ, Wade DT. The Rivermead Head Injury Follow up Questionnaire: a study of a new rating scale and other measures to evaluate outcome after head injury. J Neurol Neurosurg Psychiatry 1996;60:510-4. 23. Tepper S, Beatty P, DeJong G. Outcomes in traumatic brain injury: self-report versus report of significant others. Brain Inj 1996;10:575-81. 24. Hobart JC, Lamping DL, Freeman JA, et al. Evidence-based measurement: which disability scale for neurologic rehabilitation? Neurology 2001;57:639-44. 25. Diener E, Emmons RA, Larsen RJ, Griffin S. The Satisfaction With Life Scale. J Pers Assess 1985;49:71-5. 26. Hall KM, Bushnik T, Lakisic-Kazazic B, Wright J, Cantagallo A. Assessing traumatic brain injury outcome measures for longterm follow-up of community-based individuals. Arch Phys Med Rehabil 2001;82:367-74. 27. Hammond FM, Hart T, Bushnik T, Corrigan JD, Sasser H. Change and predictors of change in communication, cognition, and social function between 1 and 5 years after traumatic brain injury. J Head Trauma Rehabil 2004;19:314-28. 28. Whiteneck GG, Charlifue SW, Gerhart KA, Overholser JD, Richardson GN. Quantifying handicap: a new measure of longterm rehabilitation outcomes. Arch Phys Med Rehabil 1992;73: 519-26. 29. Mellick D, Walker N, Brooks CA, Whiteneck GG. Incorporating the cognitive independence domain into CHART. J Rehabil Outcomes Meas 1999;3:12-21. 30. Walker N, Mellick D, Brooks CA, Whiteneck GG. Measuring participation across impairment groups using the Craig Handicap Assessment Reporting Technique. Am J Phys Med Rehabil 2003;82:936-41. 31. Vitaliano PP, Russo J, Young HM, Teri L, Maiuro RD. Predictors of burden in spouse caregivers of individuals with Alzheimer’s disease. Psychol Aging 1991;6:392-402. 32. Pavot W, Diener E. Review of the Satisfaction With Life Scale. Psychol Assess 1993;5:164-72. 33. Giacino JT, Kalmar K, Whyte J. The JFK Coma Recovery Scale-Revised: measurement characteristics and diagnostic utility. Arch Phys Med Rehabil 2004;85:2020-9. Arch Phys Med Rehabil Vol 91, November 2010 34. Schnakers C, Giacino J, Kalmar K, et al. Does the FOUR score correctly diagnose the vegetative and minimally conscious states? Ann Neurol 2006;60:744-5; author reply 745. 35. Schnakers C, Majerus S, Giacino J, et al. A French validation study of the Coma Recovery Scale-Revised (CRS-R). Brain Inj 2008;22:786-92. 36. Schnakers C, Vanhaudenhuyse A, Giacino J, et al. Diagnostic accuracy of the vegetative and minimally conscious state: clinical consensus versus standardized neurobehavioral assessment. BMC Neurol 2009;9:35. 37. Giacino JT, Kalmar K. The vegetative and minimally conscious states: a comparison of clinical features and functional outcome. J Head Trauma Rehabil 1997;12:36-51. 38. Schiff ND, Giacino JT, Kalmar K, et al. Behavioural improvements with thalamic stimulation after severe traumatic brain injury. Nature 2007;448:600-3. 39. Malec JF. Comparability of Mayo-Portland Adaptability Inventory ratings by staff, significant others and people with acquired brain injury. Brain Inj 2004;18:563-75. 40. Malec JF. The Mayo-Portland Participation Index: a brief and psychometrically sound measure of brain injury outcome. Arch Phys Med Rehabil 2004;85:1989-96. 41. Malec JF, Kragness M, Evans RW, Finlay KL, Kent A, Lezak MD. Further psychometric evaluation and revision of the MayoPortland Adaptability Inventory in a national sample. J Head Trauma Rehabil 2003;18:479-92. 42. Winstanley J, Simpson G, Tate R, Myles B. Early indicators and contributors to psychological distress in relatives during rehabilitation following severe traumatic brain injury: findings from the Brain Injury Outcomes Study. J Head Trauma Rehabil 2006;21: 453-66. 43. Murrey GJ, Hale FM, Williams JD. Assessment of anosognosia in persons with frontal lobe damage: clinical utility of the MayoPortland Adaptability Inventory (MPAI). Brain Inj 2005;19:599603. 44. Oddson B, Rumney P, Johnson P, Thomas-Stonell N. Clinical use of the Mayo-Portland Adaptability Inventory in rehabilitation after paediatric acquired brain injury. Dev Med Child Neurol 2006;48:918-22. 45. Granger CV. The emerging science of functional assessment: our tool for outcomes analysis. Arch Phys Med Rehabil 1998;79: 235-40. 46. Hall K, Cope DN, Rappaport M. Glasgow Outcome Scale and Disability Rating Scale: comparative usefulness in following recovery in traumatic head injury. Arch Phys Med Rehabil 1985;66:35-7. 47. Ware JE Jr, Sherbourne CD. The MOS 36-Item Short-Form Health Survey (SF-36). I. Conceptual framework and item selection. Med Care 1992;30:473-83. 48. Findler M, Cantor J, Haddad L, Gordon W, Ashman T. The reliability and validity of the SF-36 Health Survey questionnaire for use with individuals with traumatic brain injury. Brain Inj 2001;15:715-23. 49. McNaughton HK, Weatherall M, McPherson KM. Functional measures across neurologic disease states: analysis of factors in common. Arch Phys Med Rehabil 2005;86:2184-8. 50. Emanuelson I, Andersson Holmkvist E, Bjorklund R, Stalhammar D. Quality of life and post-concussion symptoms in adults after mild traumatic brain injury: a population-based study in western Sweden. Acta Neurol Scand 2003;108:332-8. 51. Kaloupek D. Common data elements for posttraumatic stress disorder research. Arch Phys Med Rehabil 2010;91:1684-91. 52. Resnick B, Parker R. Simplified scoring and psychometrics of the revised 12-item Short-Form Health Survey. Outcomes Manag Nurs Pract 2001;5:161-6. CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde 53. Benedict RH. Brief Visuospatial Memory Test-Revised. Odessa: Psychological Assessment Resources Inc; 1997. 54. Donders J, Tulsky DS, Zhu J. Criterion validity of new WAIS-II subtest scores after traumatic brain injury. J Int Neuropsychol Soc 2001;7:892-8. 55. DeLuca J, Chelune GJ, Tulsky DS, Lengenfelder J, Chiaravalloti ND. Is speed of processing or working memory the primary information processing deficit in multiple sclerosis? J Clin Exp Neuropsychol 2004;26:550-62. 56. Spreen O, Benton AL. Neurosensory Center comprehensive examination for aphasia. Victoria: Univ of Victoria Neuropsychology Laboratory; 1977. 57. Henry JD, Crawford JR. A meta-analytic review of verbal fluency performance in patients with traumatic brain injury. Neuropsychology 2004;18:621-8. 58. Delis DC, Kaplan E, Kramer JH. Delis-Kaplan Executive Function System. San Antonio: The Psychological Corp; 2001. 59. Kersel DA, Marsh NV, Havill JH, Sleigh JW. Neuropsychological functioning during the year following severe traumatic brain injury. Brain Inj 2001;15:283-96. 60. Millis SR, Rosenthal M, Novack TA, et al. Long-term neuropsychological outcome after traumatic brain injury. J Head Trauma Rehabil 2001;16:343-55. 61. Greiffenstein MF, Baker WJ, Gola T. Validation of malingered amnesia meaures with a large clinical sample. Psychol Assess 1994;6:218-24. 62. Meyers JE, Volbrecht M. Validation of reliable digits for detection of malingering. Assessment 1998;5:303-7. 63. Wilkinson GS, Robertson GJ. Wide Range Achievement Test-4 professional manual. Lutz: Psychological Assessment Resources Inc; 2006. 64. Orme DR, Johnstone B, Hanks R, Novack TA. The WRAT-3 Reading Subtest is a measure of premorbid intelligence among persons with brain injury. Rehabil Psychol 2004;49:250-3. 65. Ball JD, Hart RP, Stutts ML, Turf E, Barth JT. Comparative utility of Barona formulae, Wtar demographic algorithms, and WRAT-3 reading for estimating premorbid ability in a diverse research sample. Clin Neuropsychol 2007;21:422-33. 66. Boake C, Millis SR, High WM Jr, et al. Using early neuropsychologic testing to predict long-term productivity outcome from traumatic brain injury. Arch Phys Med Rehabil 2001;82:761-8. 67. Sherer M, Sander AM, Nick TG, High WM Jr, Malec JF, Rosenthal M. Early cognitive status and productivity outcome after traumatic brain injury: findings from the TBI Model Systems. Arch Phys Med Rehabil 2002;83:183-92. 68. Tellegen A, Ben-Porath YS. MMPI-2-RF (Minnesota Multiphasic Personality Inventory-2): technical manual. Minneapolis: University of Minneapolis Pr; 2008. 69. Saunders JB, Aasland OG, Babor TF, de la Fuente JR, Grant M. Development of the Alcohol Use Disorders Identification Test (AUDIT): WHO Collaborative Project on Early Detection of Persons with Harmful Alcohol Consumption–II. Addiction 1993;88:791804. 70. Ponsford J, Whelan-Goodinson R, Bahar-Fuchs A. Alcohol and drug use following traumatic brain injury, a prospective study. Brain Inj 2007;21:1385-92. 71. Bush K, Kivlahan DR, McDonell MB, Fihn SD, Bradley KA. The AUDIT alcohol consumption questions (AUDIT-C): an effective brief screening test for problem drinking. Ambulatory Care Quality Improvement Project (ACQUIP). Alcohol Use Disorders Identification Test. Arch Intern Med 1998;158:1789-95. 72. Corrigan JD, Bogner J, Lamb-Hart G, Sivak-Sears N. Technical report on problematic substance use variables 2003. Available at: http://www.tbims.org/combi/subst. Accessed September 4, 2009. 73. Humeniuk R, Dennington V, Ali R. The effectiveness of a brief intervention for illicit drugs linked to the Alcohol, Smoking and Substance 74. 75. 76. 77. 78. 79. 80. 81. 82. 83. 84. 85. 86. 87. 88. 89. 90. 91. 92. 93. 1659 Involvement Screening Test (ASSIST) in primary health care settings: a technical report of phase III findings of the WHO ASSIST randomized controlled trial 2008. Available at: http://www. who.int/substance_abuse/activities/assist/en/index.html. Accessed August 17, 2009. Weathers F, Litz B, Herman D, Huska J, Keane T. The PTSD Checklist (PCL): reliability, validity, and diagnostic utility. Proceedings of the 9th Annual Convention of the International Society of Traumatic Stress Studies, October 25, San Antonio, Texas; 1993. Epstein NB, Baldwin LM, Bishop DS. The McMaster Family Assessment Device. J Marital Fam Ther 1983;9:171-80. Epstein NB, Bishop DS, Ryan C, Miller IW, Keitner GI. The McMaster model view of healthy family functioning. In: Walsh F, editor. Normal family processes. New York: Guilford Pr; 1993. p 138-60. Miller IW, Epstein NB, Bishop DS, Keitner GI. The McMaster Family Assessment Device: reliability and validity. J Marital Fam Ther 1985;11:345-56. Cicerone KD, Kalmar K. Persistent postconcussion syndrome: the structure of subjective complaints after mild traumatic brain injury. J Head Trauma Rehabil 1995;10:1-17. Grace J, Malloy PF. Frontal Systems Behavior Scale professional manual. Lutz: Psychological Assessment Resources Inc; 2001. Stout JC, Ready RE, Grace J, Malloy PF, Paulsen JS. Factor analysis of the Frontal Systems Behavior Scale (FrSBe). Assessment 2003;10:79-85. Norton LE, Malloy PF, Salloway S. The impact of behavioral symptoms on activities of daily living in patients with dementia. Am J Geriatr Psychiatry 2001;9:41-8. Reid-Arndt SA, Nehl C, Hinkebein J. The Frontal Systems Behaviour Scale (FrSBe) as a predictor of community integration following a traumatic brain injury. Brain Inj 2007;21:1361-9. EuroQol—a new facility for the measurement of health-related quality of life. The EuroQol Group. Health Policy 1990;16:199-208. Bell KR, Temkin NR, Esselman PC, et al. The effect of a scheduled telephone intervention on outcome after moderate to severe traumatic brain injury: a randomized trial. Arch Phys Med Rehabil 2005;86:851-6. Klose M, Watt T, Brennum J, Feldt-Rasmussen U. Posttraumatic hypopituitarism is associated with an unfavorable body composition and lipid profile, and decreased quality of life 12 months after injury. J Clin Endocrinol Metab 2007;92:3861-8. Jakola AS, Muller K, Larsen M, Waterloo K, Romner B, Ingebrigtsen T. Five-year outcome after mild head injury: a prospective controlled study. Acta Neurol Scand 2007;115:398-402. EQ-5D value sets: inventory, comparative review and user guide. Dordrecht: Springer; 2008. McCauley SR, Wilde EA, Kelly TM, et al. The Neurological Outcome Scale for Traumatic Brain Injury (NOS-TBI): II. Reliability and convergent validity. J Neurotrauma 2010;27:991-7. Wilde EA, McCauley SR, Kelly TM, et al. Feasibility of the Neurological Outcome Scale for Traumatic Brain Injury (NOSTBI) in adults. J Neurotrauma 2010;27:975-81. Wilde EA, McCauley SR, Kelly TM, et al. The Neurological Outcome Scale for Traumatic Brain Injury (NOS-TBI): I. Construct validity. J Neurotrauma 2010;27:983-9. Willer B, Ottenbacher KJ, Coad ML. The Community Integration Questionnaire. A comparative examination. Am J Phys Med Rehabil 1994;73:103-11. Johnston MV, Goverover Y, Dijkers M. Community activities and individuals’ satisfaction with them: quality of life in the first year after traumatic brain injury. Arch Phys Med Rehabil 2005;86:735-45. Brown M, Dijkers MP, Gordon WA, Ashman T, Charatz H, Cheng Z. Participation Objective, Participation Subjective: a Arch Phys Med Rehabil Vol 91, November 2010 1660 94. 95. 96. 97. 98. CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde measure of participation combining outsider and insider perspectives. J Head Trauma Rehabil 2004;19:459-81. von Steinbuechel N, Petersen C, Bullinger M. Assessment of health-related quality of life in persons after traumatic brain injury— development of the QOLIBRI, a specific measure. Acta Neurochir Suppl 2005;93:S43-9. von Steinbuechel N, Wilson L, Gibbons H, et al. Quality of Life After Brain Injury (QOLIBRI)—scale validity and correlates of quality of life. J Neurotrauma 2010;27:1167-85. von Steinbuechel N, Wilson L, Gibbons H, et al. Quality of Life After Brain Injury (QOLIBRI)—scale development and metric properties. J Neurotrauma 2010;27:1157-65. Ader D. Developing the Patient-Reported Outcomes Measurement Information System (PROMIS). Med Care 2007;45(Suppl):S1-2. Perez L, Huang J, Jansky L, et al. Using focus groups to inform the Neuro-QOL measurement tool: exploring patient-centered, Arch Phys Med Rehabil Vol 91, November 2010 health-related quality of life concepts across neurological conditions. J Neurosci Nurs 2007;39:342-53. 99. Miller D, Nowinski C, Victorson D, Peterman A, Perez L. The Neuro-QOL project: establishing research priorities through qualitative research and consensus development. Qual Life Res 2005;14:2031. 100. Tulsky DS, Carlozzi NE, Kisala P, Victorson D, Cella D. New initiatives to develop patient reported outcomes measures for research with neurological populations. Proceedings of the International Neuropsychological Society, February 11–14, 2009, Atlanta, GA; 2009. 101. Tulsky DS, Kisala P, Victorson D, Carlozzi NE, Cella D. Development of tailored outcomes measures for traumatic brain injury and spinal cord injury. Proceedings of the American Academy of Neurology, April 25–May 2, 2009, Seattle, Washington; 2009. Supplementary Table 1: Psychometric Properties of Recommended TBI Outcomes CDEs in Core and Supplemental Tiers Measure Global outcome GOS/GOS-E IRR of the GOS is very good, with 92%–95% agreement.1,2 Agreement for standard in-person administration for the GOS-E is less at ⬃78%.2 TRT weighted ␬ is .92 for in-person vs telephone GOS and GOS-E, which is excellent. Absolute agreement for in-person vs telephone was 71%–77% for GOS-E and 86%–90% for GOS. Interrater weighted ␬ was .85 and .84 for GOS and GOS-E, respectively.3 Weighted ␬ was .94 and .98 for TRT reliability using postal questionnaires for GOS and GOS-E, respectively. Overall agreement is ⬃85% for both. Agreement between the telephone and mail-administered questionnaire is not as strong, at 68.6% for GOS-E and 86% for GOS.4 Despite excellent reliability data in many reports, others have reported misclassification rates of 17%– 40% for GOS outcomes in clinical trials, with resulting decreases in power.5 Good ICC10 by Rasch (person reliability⫽.88; item reliability⫽.99) and classic metrics (Cronbach ␣⫽.89); good interrater agreement on individual items among staff, patients, and significant others, with 58%–88% agreement within ⫾1.11 The GOS is sensitive to recovery from 3– 6mo postinjury, but less sensitive for recovery from 6mo–1y postinjury.1 GOS scores at hospital discharge are not valid predictors of return to work at 6mo and predicted only 6-mo GOS scores for those who did not reach good recovery.6 The 3-mo GOS score predicted 12mo and 56% showed improvement from 3–12mo, partially countering concerns about sensitivity to change.7 However, in another study, 3-mo GOS score predicted 15-mo GOS score for patients with good early outcome, but not those with poorer early outcome.8 Patients in this study had milder injuries. GOS-E scores are associated with neuropsychological test findings and disability measures. indicating validity as an index of TBI outcome.9 Additional Psychometric Concurrent/construct validity established by Sensitive to change in studies of correlation with DRS (␳⫽.81).12 rehabilitation Bohac et al13 reported that MPAI factors correlated interventions14,16,18 and to with associated neuropsychological measures. frontal lobe damage.19 Predictive validity shown through correlations of admission MPAI ratings with outpatient rehabilitation outcomes, ie, Goal Attainment Scaling (␳⫽⫺.47), Independent Living Scale (␳⫽⫺.26), Vocational Independence Scale (␳⫽⫺.32).14 Using logistic regression, Malec et al15 showed that staff MPAI (␹2⫽8.30; P⬍.01) and time since injury (␹2⫽9.70; P⬍.01) were the best predictors (69% correct classification) of job placement after participation in vocational rehabilitation. Malec16 found that staff MPAI was the best predictor of long-term vocational (correct classification⫽67%; ␹2⫽5.33; P⬍.05) and independent living outcome (correct classification⫽70%; ␹2⫽6.85; P⬍.01) 1y after completion of comprehensive day rehabilitation in a logistic model that included age, education, severity of injury, Other 1660.e1 Arch Phys Med Rehabil Vol 91, November 2010 Validity CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde MPAI-4 Reliability DRS Reliability Additional Psychometric traumatic vs nontraumatic injury, time since injury, and Rasch-converted staff MPAI score. Malec and Degiorgio17 reported that logistic regression of the MPAI and time since injury could be used to estimate the probability of community-based employment as a result of outpatient rehabiliitation. Concurrent validity was established in the initial Rasch analysis was completed on publication for the DRS,20 in which the 8 DRS item scores at abnormality ratings of auditory, visual, and rehabilitation admission for 266 somatosensory brain evoked potentials cases. Composite scores of 1–29 significantly correlated with DRS ratings were obtained (0⫽normal, (r⫽.35–.78). 30⫽dead: clients rated ”normal” Additional validation of the scale is documented were omitted). Findings were as in a published article by Hall et al.23 follows: relative level of difficulty Correlation of DRS with simultaneously obtained between admission and GOS scores at 2 times was shown in a sample discharge ratings of DRS items of 70 TBI inpatients (r⫽.50 at admission, r⫽.67 for 256 cases was consistent; at discharge).24 range of difficulty reflected in Gouvier21 found Spearman ␳ correlation was .92 the scale is excellent, from items between the rehabilitation admission DRS and measuring very simple Stover Zeiger Scale (1976). Rehabilitation functioning to those measuring discharge DRS correlated, with .81 for the complex functions.21 Each of the following domains discharge Stover Zeiger Scale, .80 for the are scored: Eye Opening, GOS,1,251 and .85 for the GOS-E.26 Communication Ability, Motor Response, Feeding, Toileting, Grooming, Level of Functioning, and Employability. Difficulty levels of the 3 items Cognitive Ability for Feeding, Toileting, and Grooming were very similar. There is a gap between Cognitive Ability for Feeding, Toileting and Grooming and Level of Functioning (ie, ability to live independently) and between the latter and Employability. The functional difficulty of each item is substantially different, with no intervening items to reflect intermediate abilities, consistent with the observation of less sensitivity to change in the DRS in people at high functional levels. Other A limitation of the DRS is its relative insensitivity at the low end of the scale (mild TBI) and inability to reflect more subtle but sometimes significant changes in a person within a specific limited window of recovery. Average DRS scores at rehabilitation admission, discharge, 1y, and 2y postinjury for all cases with data in the TBIMS database were analyzed for ceiling and floor effects. Ceiling is defined as mean score of 0, 1, or 2 on the DRS (top 10% of scale). These ceiling scores define independent or modified independent status. The DRS has virtually no ceiling effect at discharge, 1y, and 2y postinjury on a consistent sample over time. Results including all cases with data available at any time were similar, with sample sizes ranging from 598–206. The DRS was developed with the continuum of recovery in mind, consistently shows good scale properties, and predicts employment well. At 1y postinjury, 28% of FIM and FIM⫹FAM scale reflects independence/modified independence (scores⫽6 and 7 on 7-point scale) and only 10% of DRS summed score represents this level of independence (scores⫽0, 1 and 2 on 30-point scale). This difference gives the DRS an advantage in regard to ceiling effect. CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde IRR of DRS was established among 3 raters on a sample of 88 TBI rehabilitation inpatients.20 Pearson correlations were .97–.98. In a separate study by Gouvier et al,21 Spearman ␳ correlation coefficient was .98 in 3 raters on a sample of 37– 45 subjects. Novack et al22 reported IRR in a study of 27 severely brain-injured persons. A comparison of DRS ratings by family members vs rehabilitation professionals yielded significant correlations for both rehabilitation admission (r⫽.95) and discharge (r⫽.93) ratings. TRT reliability was shown by Gouvier,21 reporting Spearman ␳ correlation of .95. Validity 1660.e2 Arch Phys Med Rehabil Vol 91, November 2010 Supplementary Table 1 (Cont’d): Psychometric Properties of Recommended TBI Outcomes CDEs in Core and Supplemental Tiers Measure Supplementary Table 1 (Cont’d): Psychometric Properties of Recommended TBI Outcomes CDEs in Core and Supplemental Tiers Measure Reliability Validity Additional Psychometric SF-36 1660.e3 Arch Phys Med Rehabil Vol 91, November 2010 There are ⬎200 studies of ICC and ⬎30 with Validity has been established in numerous studies.29,30 data for TRT reliability. In TBI specifically: Reliability estimates of single scales Findler et al28 reported convergent validity in generally are ⬎.80. 326 patients; correlations of physical SF-36 Sum score reliabilities (physical, mental) scales with Physical Symptoms scale of SCL generally are ⬎.90. were ⫺.50 to ⫺.63, and with the HPL were In TBI specifically: ⫺.60 to ⫺.75. There were robust correlations MacKenzie et al27 reported in patients with multiple injuries ((N⫽1197; 45% between BDI-II scores and SF-36 scales, with with head injuries): ␣ coefficient⫽.77 the largest value for Mental Health (⫺.77). (GH) to .93 (Physical Functioning). McNaughton et al31 examined construct validity of the mental and physical CS Findler et al28 reported scale ␣ scores shown in joint factor analysis with coefficients of .79 –.92 in several functional measures in 89 patients. moderate/severe TBI patients (N⫽228), In a study examining discriminant validity by .83–.91 in mild TBI (N⫽98), and .68 –.87 Paniak et al,32 significant differences in 271 healthy controls. between 120 MTBI patients and 120 healthy controls in all SF-36 scales (except GH), mental CS, and physical CS were found. In another study of discriminant validity by Emanuelson et al,33 reduced values were found on all SF-36 subscale, mental CS, and physical CS scores in a study of 173 MTBI patients and age-/sex-matched healthy controls. CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde In summary, Rasch analysis provided transformed scores for use in interval scale data analyses and validated observations about the DRS: a scale that measures a wide range of disability with less sensitivity at the high end (mild TBI). Items discriminate varying levels of disability well and relative difficulty of items remains constant between admission and discharge. In analysis of TBIMS national database data, average DRS untransformed score at rehabilitation admission was 12 (rounded); at discharge, 5; and at 1y postinjury follow-up, 3, in a sample of 70 cases with complete data for all times. Other Recovery of consciousness CRS-R TMT Validity Additional Psychometric Other ICC, TRT reliability, and IRR were shown to be good to excellent by original investigators34 (IRR, r⫽.84; TRT, r⫽.94). Schnakers et al35 reported IRR to be good (␬⫽.80) in a French CRS-R validation study.36 A recently completed Norwegian study found IRR (r⫽.65–.82) and TRT (r⫽.77–.83) to be acceptable to very good and noted that both IRR and TRT reliability were influenced by level of experience with the CRS-R. Reliability data also are available for specific CRS-R subscales, with most values in the moderate to good range.34,36,37 Reliability data for the original version of the CRS were published by Giacino et al38 and O’Dell et al.39 Criterion validity has been shown in comparative analyses with the GCS,34,40 DRS,34,37 WHIM,36 and FOUR.36 Total CRS-R scores significantly correlated with WHIM, FOUR, and GCS scores in both acute and long-term patient samples.36 However, CRS-R performance is related most closely to scores on the WHIM (r⫽.76), a scale designed primarily for use in rehabilitation settings. Lower correlations have been reported with the FOUR (r⫽.63) and GCS (r⫽.59), which are intended for use in intensive care and trauma settings, respectively. The CRS-R has been used in a Diagnostic validity has been range of studies exploring the established in 4 separate relationship between behavioral studies investigating the and neurophysiologic markers sensitivity and specificity of the of consciousness.41-43 Evidence CRS-R for detection of 34,36,40 of cognitive processing after MCS. Giacino et al34 found that the exposure to linguistic stimuli CRS-R detected behavioral has been reported in 3 fMRI signs of consciousness in 10 of studies involving patients who 80 patients misdiagnosed with failed to show behavioral signs VS on the DRS. of conscious awareness.41-43 40 The scale also has been used to Similarly, Schnakers et al reported that the CRS-R characterize the course of identified 7 cases of MCS (n⫽25) recovery from VS, MCS,44,45 and locked-in syndrome46 and has in which VS was misdiagnosed sufficient sensitivity to capture by using the FOUR. salient functional changes A more recent study by associated with pharmacologic Schnakers et al36 found that MCS was diagnosed in 45 of 77 interventions35 and deep brain stimulation.47 patients with disorders of consciousness after examination with the CRS-R compared with 36, 32, and 24 for the WHIM, FOUR and GCS. TRT reliability is good for total recall over 5 trials, .60–.70 over 1y.48 Internal reliability of total score is high (␣ coefficients⬎.90).49 Extensive literature regarding good validity, including construct, criterion, and predictive. For specific information, refer to Strauss et al.50 TRT reliability varies with age range and population studied, but is adequate, especially for Part B. Dikmen et al51 tested 384 healthy adults who were retested 11mo after the initial session. Coefficients were adequate for Part A (.79) and high for Part B (.89). Similar findings were reported by Levine et al.52 Parts A and B are moderately intercorrelated (r⫽.31–.36), suggesting they measure similar, but somewhat different, functions. TMT is sensitive to a wide range of neurologic disorders, including TBI. TMT completion time shows a dose-response relationship with TBI severity: time increases with increasing TBI severity.55 Sensitive to a variety of diseases of the brain and their severity. Has been used in TBI. Sensitive to change. Has good floor and ceiling. Has extensive norms. Refer to Strauss.50 Practice effects are found over short retest intervals, but disappear after several administrations. After longer intervals, TMT scores show only modest change in healthy adults. Performance on TMT is affected by age, with performance declining as age increases. Familiar and widely-used and accepted measure of memory and learning. For many years, it was part of the TBIMS data set. It is a legacy measure of episodic memory of the NIH Toolbox. CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde Neuropsychological impairment RAVLT Reliability 1660.e4 Arch Phys Med Rehabil Vol 91, November 2010 Supplementary Table 1 (Cont’d): Psychometric Properties of Recommended TBI Outcomes CDEs in Core and Supplemental Tiers Measure Supplementary Table 1 (Cont’d): Psychometric Properties of Recommended TBI Outcomes CDEs in Core and Supplemental Tiers Measure Validity Reliabilities in clinical groups are not as high. Goldstein & Watson53 found similar reliability coefficients (.69 –.94 for Part A; .66 –.86) for various neurologic groups. IRR has been reported as .94 for Part A and .90 for Part B.54 Longitudinal studies have reported marked heterogeneity of TMT outcome after moderate to severe TBI; 5y after injury, a substantial proportion of persons with moderate to severe TBI continued to show deficits on TMT. In Millis et al,56 43% showed marked impairment on Part A and 33% had impaired performance on Part B. TMT may be less useful in mild TBI. Poor discrimination was reported by Iverson57 in differentiating mild TBI from substance abuse. Several studies have shown that psychosocial outcome after TBI can be predicted by TMT. Good construct and criterion validity. Very good sensitivity to acquired brain damage. For more specific information, refer to WAISIII/ WMS III manual58 and Strauss.50 Highly correlated with HVLT, VR WMS, and Rey Figure (r⫽.65–.80); probably measures verbal and nonverbal memory; seems to show reasonable convergent and divergent validities.50 WAIS-III Processing Speed Index Internal consistency is high at .80–.89, as well as TRT reliability at 80–.89 (WAIS/ WMS technical manual58)50 BVMT-R TRT is .80 for total score, which is very good for a memory measure. IRR is high (.90.)50,59 WAIS-III NumberLetter Sequencing subtest COWAT ICC is .80–.89: TRT reliability is .70–.79.50 Other IQ has a moderate relationship with TMT. Sex has little impact on performance. Cultural and linguistic variables may affect test scores. Extensive normative data through This is a widely known index from the Wechsler norming and WAIS III. It is a legacy measure additional studies.50 of Processing Speed for the NIH Toolbox. A variety of studies support its One of the measures chosen by sensitivity to neurologic Matrics for studies in condition of the brain.50 schizophrenia based on extensive review of the literature. It also is 1 of the 2 legacy measures for the Memory Measure of the NIH Toolbox. Extensive normative data through It is a legacy measure for the the Wechsler plus additional Working Memory measure for studies the NIH Toolbox. COWAT has been used in treatment studies (eg, Sarno et al65). Higher education level is associated with better performance on COWAT. There is little evidence of sex differences on COWAT. 1660.e5 Arch Phys Med Rehabil Vol 91, November 2010 ICC is high: coefficient ␣ is .83 using total number of words generated for each letter as individual items.60 In healthy adults, TRT reliability typically is ⬎.70.51 IRR is high (.9961). Good criterion, construct, discriminant validities. Good clinical sensitivity. Refer to WAIS/WMS III manual58 and Strauss.50 Correlations among phonemic fluency tasks (eg, FAS, CFL) are high, ranging from .85–.94. Phonemic fluency shows a stronger relationship to Verbal IQ (r⫽.42–.48) than Performance IQ (r⫽.29 –.36). COWAT is sensitive to severity of TBI.62 A metaanalysis63 found that as with patients with focal frontal (but not temporal) lobe injuries, TBI patients were impaired similarly on tests of phonemic and semantic fluency. Phonemic fluency also was significantly more sensitive to the presence of TBI than the Wisconsin Card Sorting Test. In a mixed neurologic sample, Burgess et al64 found that poor performance on COWAT was moderately associated with caregiver ratings of patients’ problems in everyday life and patients’ lack of insight into their problems (r⫽.29 –.35). Additional Psychometric CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde Reliability Measure Reliability ICC is .70 –.79. TRT reliability is .70 –.79. WAIS-III Digit Span subtest TRT stability coefficient is .83. Average reliability coefficient across age is .90.58 ICC reliability coefficient73 is .90 –.96 (by age). Alternate form reliability (by age) is .85–.95. WRAT-4 Word Reading subtest GPT TRT reliability50 is .67–.86 (at 4–24mo). Validity Additional Psychometric Stroop-like tests frequently have been used in a Although women tend to have wide variety of patient groups thought to have superior color-naming skills, executive function deficits. sex differences on the colorTBI patients typically are slower is responding to word interference condition are all conditions, although they do not not consistently present.69 Education is modestly associated consistently show disproportionate with interference score impairment on the interference condition (eg, (r⬍.3069). Batchelor et al66). Stroop-like tests may have limited diagnostic sensitivity in mild TBI.67 Baseline interference scores on the Goldenversion Stroop were predictive of functional status at 1-y follow-up in patients with vascular dementia.68 The Digit Span test, particularly the Digits Backward component, has been identified as a marker of cerebral disorder after TBI.56,70 External validity73: correlations with the following measures: WIAT-II Word Reading, .71; Woodcock JohnsonIII Basic Reading .66; KTEA-II Comprehensive Letter/Word Rec, .76; WAIS-III FSIQ, .79. Studies have focused on earlier versions of the Word Reading subtest, but administration has not changed. With learning disability screened out, WRAT-3 oral reading was considered a reasonable predictor of premorbid ability.74 Similar results were obtained, with the WRATR reading subtest predicting verbal intellectual ability.75 Stability of WRAT-3 Word Reading across 1y in people with TBI was shown by Orme et al,76 although slight nonsignificant increases were evident in the most severely injured. External validity50: correlations with the following measures: Tapping Speed, ⫺.35; Near Visual Acuity, ⫺.62; Reaction Time, .31; TMT-B, .46; Digit Symbol, ⫺.60; Block Design, ⫺.34; Object Assembly, ⫺.45. More than 70% of those experiencing moderatesevere TBI show impairment on the GPT using established cutoff values.77,78 The GPT is among tests in this population that predict outcome in terms of productivity.78-80 Other Can be used as a measure of symptom validity.71,72 CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde CWIT 1660.e6 Arch Phys Med Rehabil Vol 91, November 2010 Supplementary Table 1 (Cont’d): Psychometric Properties of Recommended TBI Outcomes CDEs in Core and Supplemental Tiers Supplementary Table 1 (Cont’d): Psychometric Properties of Recommended TBI Outcomes CDEs in Core and Supplemental Tiers Measure Psychological status BSI-18 Additional Psychometric Validity analyses using TBI outpatients (n⫽176) After controlling for demographic found BSI-18 GSI correlated significantly with and TBI injury characteristics, psychosocial and functional outcomes: NFI the BSI-18 accounted for 4% of depression (r⫽.68), NFI aggression (r⫽.55), total variance in FIM scores, PANAS negative affectivity (r⫽.49).81 3% of total variance in DRS In community populations, BSI-18 scales have scores, 3% of total variance in shown excellent correlation with the SCL-90-R CIM scores, and 8% of total (Pearson correlation coefficient for global variance in SWLS scores.81 severity⫽.93, somatization⫽.91, depressive⫽.93, anxiety⫽.96), which in turn has shown acceptable convergent validity with other measures of somatization, depression, and anxiety.82 The MMPI-2-RF has several validity scales that provide information regarding threats to the validity of a test protocol that must be considered before scores on the clinical scales can be interpreted: inconsistent responding, overreporting, and underreporting indexes. Extensive correlate data are listed in Appendix A of the technical manual.83 External validity data were gathered from a wide range of settings that document the convergent and discriminant validity and corroborate the construct validity of the substantive scales. Empirical correlates are reported for clinical, forensic, medical, and nonclinical samples. Correlates include a broad range of criteria, including therapist ratings, clinical diagnoses, intake staff ratings, admissions records, biographical information, and other self-report measures. The 338 items of the MMPI-2-RF are embedded within the MMPI-2 item pool. Hence, MMPI-2-RF profiles can be generated from original MMPI2 profiles. Factor analyses indicate a 2 factor structure (consumption and adverse consequences), supporting the use of the abbreviated AUDITC as a measure of consumption. As with other measures, AUDIT does not perform well with the elderly. AUDIT and AUDIT-C have been used successfully Other 1660.e7 Arch Phys Med Rehabil Vol 91, November 2010 AUDIT TRT reliability in TBI patients (median retest interval, 1y): GSI of .66; somatization of .67; depression of .63; anxiety of .57.81 In TBI outpatients (n⫽176), ICC estimates (Cronbach ␣) were GSI of .91, somatization of .75, depression of .84, anxiety of .83. For TBI inpatients (n⫽81), ICC estimates were lower: GSI of .84, somatization of .61, depression of .64, anxiety of .74.81 In community populations, the BSI-18 has good ICC (coefficient ␣ for global severity⫽.89, somatization⫽.74, depressive⫽.84; anxiety⫽.89).82 Extensive psychometric information for the 50 scales of the MMPI-2-RF is presented in chapt 3 of the technical manual.83 Estimates in the form of Cronbach ␣ coefficient are reported for men and women of the normative sample, an outpatient community mental health sample, a psychiatric inpatient sample tested at a general community hospital, and male psychiatric inpatients tested at a VA hospital. TRT reliability estimates are reported for a combined-sex subset of the MMPI-2 normative sample. Members of the sample completed the MMPI-2 twice, with 1wk between test administrations. Compared with the original MMPI-2, the MMPI-2-RF had similar or improved reliability. TRT correlations and ICC values of the Higher-Order, Restructured Clinical, and PSY-5 scales of the MMPI-2-RF mostly were ⬎.80. The ␣ values derived from the normative sample are somewhat lower because of truncated distributions. SEMs generally are ⱕ8 T score points, and most scales have SEMs ⱕ6 points. Mean ICC was .83 across 18 studies. TRT reliability ␬ range was .70 –.89 using a cutoff of 8; intraclass correlations range was .87–.95.84 Validity CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde MMPI-2-RF Reliability Reliability Additional Psychometric AUDIT-C TRT was .65–.85 for 3-mo interval,85 .98 for 1-mo interval.86 Sensitivity and specificity have been studied extensively, with satisfactory identification of both hazardous drinking and harmful use. Lower cutoff scores are recommended for women and for the identification of hazardous drinking vs harmful/dependence.84 TBIMS questions (based on BRFSS and NSDUH) BRFSS TRT ␬ for any alcohol use was .82, and for binge consumption was .64; correlation for number of drinks/mo was .72, with lower values for blacks and Hispanics.90 ASSIST Average TRT ␬ for question stems ranged from .58–.90; for substance class, from .61 for sedatives to .78 for opioids.93 Cronbach ␣ was ⬎.80 for most domains.94 PCL-C/M/S TRT stability coefficient over 2–3d was .96 for Vietnam veterans.96 ICC ␣ coefficients in Vietnam and Persian Gulf veterans,96 victims of MVCs, and sexual assault survivors were .97 and .94, respectively, with internal consistencies of .92 to .93 for each subscale).96 Population estimates derived from BRFSS questions correlate with similarly worded questions from the NSDUH (r⫽.82). Higher estimates are obtained from the latter version, which has been attributed to computerassisted administration.91 The questions have been included in both national surveys and the TBIMS national data set for ⬎10y and have been used in multiple studies for monitoring and analyzing national trends. Concurrent validity: significantly associated with the MINI Plus (r⫽.93 for lifetime use; r⫽.76 with MINI-derived score for severity of abuse and dependence), AUDIT (r⫽.82), and ASI frequency of use (r⫽.84). Construct validity: significant and positive correlations between ASSIST scores reflecting abuse and dependence and MINI-derived scores of severity of abuse or dependence (r⫽.76 and r⫽.75, respectively). Discriminant validity: discriminates between groups classified based on use, abuse, or dependence; better with discriminating between use and abuse (ROC⫽.84 –.97) than between abuse and dependence (ROC⫽.62– .84, except for sedatives, ROC⫽.45). Predictive validity: there are no significant differences between ASSIST scores obtained at baseline and 3-mo follow-up.95 Combat veterans with PTSD score significantly higher (63.58⫾14.14 [SD]) than those without PTSD (34.40⫾14.09).96 The same pattern is true with MVC-related and sexual assault PTSD.97 In Vietnam veterans, PCL-M significantly correlated with other measures of PTSD (Spearman ␳ range, .77–.93).96 with adolescents, psychiatric populations, and across various countries and cultures, with studies indicating adequate reliability and validity.87,88 AUDIT has been used with persons with TBI.89 Comparisons have been made between the general population and a population of persons hospitalized 1y earlier due to TBI.92 Other CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde Validity In a cross-cultural RCT, ASSIST was sensitive to change associated with an ASSISTlinked brief intervention (WHO ASSIST Phase III Study Group).95 Factor analysis for data derived from Persian Gulf war veterans suggested that items are best accounted for by a single factor.96 Diagnostic sensitivity and specificity of the PCL: cutoff of 50 on the PCL-M resulted in 1660.e8 Arch Phys Med Rehabil Vol 91, November 2010 Supplementary Table 1 (Cont’d): Psychometric Properties of Recommended TBI Outcomes CDEs in Core and Supplemental Tiers Measure There are several versions, including the PCL-C, PCL-S, and PCL-M. The PCL-C is available in Spanish. The PCL was developed by the National Center for PTSD and is in the public domain. It maps directly onto DSM criteria. Supplementary Table 1 (Cont’d): Psychometric Properties of Recommended TBI Outcomes CDEs in Core and Supplemental Tiers Measure Reliability Validity In Persian Gulf veterans, PCL-M score was associated significantly with another measure of PTSD (.85).96 The PCL-M highly correlated with the Mississippi Scale for Combat-Related PTSD (.93), the PK scale of the MMPI (.77), and the Impact of Event Scale (.90).98 FAD Significant correlations were reported between a head injury follow-up questionnaire regarding common problems after TBI (eg, problems conversing, problems with facets of community reentry, fatigue at work, getting along with others) and the patient’s RPQ total score (Spearman ␳⫽.67 at 3mo postinjury and ␳⫽.56 at 6mo postinjury).108 No differences were found between patient-completed and interview-format responses. Modest predictive validity (r⫽.37; P⬍.05) was reported109 between 1-wk and 6-mo RPQ scores. At 3mo after mild TBI, the RPQ distinguished between patients with and without PCS, and those who were vs were not “on sick leave” from work. Ingebrigtsen et al.110 reported a trend between RPQ total score and serum S-100B protein level in patients with mild TBI 24h postinjury. However, Savola & Hillbom111 found that S100B on hospital admission was a significant predictor of RPQ total score at 1mo postinjury. sensitivity of .82 and specificity of .84 in 1 study96 and sensitivity of .78 and specificity of .86 in another.97 Published cutoff values should be used with caution because they were derived from samples with high prevalence rates of current PTSD. Reviews of the PCL can be found in Orsillo99 and Norris & Hamblen.100 One confirmatory factor analytic Reviews of the FAD can be found study supported the factor in Epstein et al.101 The FAD has shown good structure of the FAD, finding reliability and validity across similar factor structure in cultural groups, including in nonclinical, psychiatric, and China, The Netherlands, Great medical samples,103 whereas a second found low goodness-ofBritain, Italy, and Canada and in fit indexes, but good residual the United States with different error fit indexes.102 racial groups. RPQ total score does not appear to be associated with age, sex, cause of injury, severity of injury (GCS score), or duration of PTA in patients with mild TBI.114 Chan115 found no sex effect on RPQ total score. Eyres et al116 reported that all RPQ items functioned well across age and sex. No differences in RPQ total score were noted between patients with chronic pain and mild TBI.117 A significant difference was found on RPQ total scores between adolescents with mild TBI and an uninjured control group assessed an average of 3d postinjury.113 Rasch analysis suggested the RPQ was not unidimensional and the authors suggested splitting 3 items (headache, dizziness, nausea) into a separate scale. The resulting RPQ-13 and RPQ3 performed well in terms of external construct validity with a head injury follow-up questionnaire (RPQ-13, .83; RPQ-3, .62) and 2-wk TRT reliability (RPQ-13, .89; RPQ-3, .72). 1660.e9 Arch Phys Med Rehabil Vol 91, November 2010 The measure’s developers107 presented scatterplots (no reliability coefficients reported) that suggest good TRT reliability during a 24-h period at a mean 8d postinjury for 41 adult patients with mild to moderate TBI. A second scatterplot was presented that included 46 adults with mild to severe TBI at ⬃6mo postinjury that suggested good TRT reliability during a mean 10-d TRT interval. Other CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde TBI-related Symptoms RPQ TRT stability coefficients across scales Validity: correlations between FAD scales and during 1wk ranged from .66 –.76.101 clinician ratings of family functioning based ICC: In a large white sample (n⫽1302), on a semistructured clinical interview Cronbach ␣ ranged from .73 for the Roles (McMaster Structured Interview For Families) scale to .87 for the General Function ranged from .38 –.62.104 Moderate correlation also was found between FAD scales and the scale; they were slightly lower in a similar FES and scales on the SCL-90-R.105 Hispanic sample (n⫽323), ranging from Concurrent validity was shown between .59 for the Roles scale to .82 for the children’s FAD ratings and mother’s ratings of General Function scale.102 These were slightly higher than in another study, in family functioning.106 which ␣ ranged from .57–.86; only the Roles scale had ␣⬍.70. In the latter study, ␣ values were marginally higher in clinical than nonclinical samples.103 Additional Psychometric NSI Validity Additional Psychometric Other Higher RPQ total scores related to greater activations in fMRI tasks of working memory and selective attention in patients with mild TBI.112 FA of the corpus callosum measured by using DTI significantly related (␳ ⫽.76) to RPQ total score in a sample of adolescents with mild TBI assessed an average of 3d postinjury.113 Schwab et al118 reported that Afghanistan Schwab118 reported a moderate association and Iraq war veterans reporting a between number of TBI-related problems probable TBI had a higher prevalence of reported on a TBI screening interview and having ⱖ3 problematic PCS (using the number of moderate/severe PCS symptoms NSI) symptoms than that of veterans who reported on the NSI (r⫽.477; P⬍.001). did not report TBI (64% vs 41%; P⬍.001). Schwab118 also showed that in participants reporting a TBI on a brief TBI screening interview, the prevalence of ⱖ3 moderate/severe PCS symptoms was higher in those with than in those without self-reported TBI-related problems (74% vs 40%; P⫽.003). Intrascale reliability (normative sample)119: Family (total), .92; Apathy, .78; Disinhibition, .80; Ex Dysfunction, .87. Self-rating (total), .88; Apathy, .72; Disinhibition, .75; Ex Dysfunction, .79. Intrascale reliability (neurologic sample): Family (total), .94; Apathy, .87; Disinhibition, .84 Ex Dysfunction: .91. Self-rating (total), .92; Apathy, .83; Disinhibition, .78; Ex Dysfunction, .84. Cognitive activity limitation Cog-FIM Interrater reproducibility was .95.125 Both Cog-FIM and Motor FIM have excellent Please see next ICC126 (␣⫽.86 –.97; ␣⫽.89 for Cog-FIM).125 section for general information about the FIM instrument. This section provides information specific to Cog-FIM items. Principal factor analysis yielded 3 Correlation of FrSBe to Neuropsychiatric factors corresponding to a Inventory120: total, r⫽.64; P⬍.001; Apathy, r⫽.37; P⫽.04; Disinhibition, r⫽.62; P⬍.001. priori domains of apathy, Construct validity: the FrSBe has shown disinhibition, and executive significant differences in before and after dysfunction that accounted for ratings for people with frontal system ⬎46% of variance.124 lesions121 and also has differentiated frontal lesion populations from controls.122 The FrSBe has shown stronger correlation with a measure of community reentry than tests of executive functioning.123 Correlates (.51) with WAIS VIQ.125 Low correlations (discriminant validity) with physical and mental health status measures.125 Predicts amount of supervision (vs physical assistance) received in the home setting.127 Predicts falls more robustly than Motor FIM in rehabilitation inpatients.128 Ceiling issues: in the TBIMS, maximum score (all items 7) was attained at 1y postinjury by 16% using the Cog-FIM total; 20% for Memory, 56% for Social Interaction, 45% for Comprehension/Expression.129 Between 1 & 5y postinjury, 26% improved on Cog-FIM, 61% stayed the same, 14% worsened.129 Routinely collected at most rehabilitation facilities, so it may be cost-effective to obtain in that setting. Extensive use in studies of TBI and a wide range of other patient populations. Allows comparison across patient populations. Sensitive to cognitive rehabilitation vs functional skills training approach in RCT of subacute TBI.130 CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde Behavioral function FrSBe Reliability 1660.e10 Arch Phys Med Rehabil Vol 91, November 2010 Supplementary Table 1 (Cont’d): Psychometric Properties of Recommended TBI Outcomes CDEs in Core and Supplemental Tiers Measure Supplementary Table 1 (Cont’d): Psychometric Properties of Recommended TBI Outcomes CDEs in Core and Supplemental Tiers Measure Physical function FIM Validity Extensive demonstration of construct and predictive validity in TBI and a wide range of other patient populations. The FIM has clinically appropriate validity and interrater agreement.131 CHART-SF subscales closely approximate scores of subscales gathered by the original CHART. Reliability data are based on original CHART without Cognitive Domain.140 IRR (1-wk interval) was .80 –.95 (n⫽135 with SCI). Subject-proxy was .69 (economic selfsufficiency), .28 (social integration), 0.80 – .83 for remaining scales (n⫽135 persons with SCI and proxies). CHART-SF subscales closely approximate scores of subscales gathered by the original CHART. Validity data are based on original CHART without Cognitive Domain.140 Rehabilitation professionals rated 135 persons with SCI as either high or low levels of handicap. CHART scores and subscales (except for economic self-sufficiency) were significantly different in the expected direction. Other Sensitive to improvements in Possible ceiling effect after 1y. function for up to 1y post-TBI. Because FIM ratings affect Evaluation of the metric facility reimbursement and properties of the FIM has been many facilities use FIM change reported extensively.132-136 as a quality indicator, there may Precision (ability of the be pressure for the lowest instrument to detect possible admission FIM and meaningful change in level of highest possible discharge FIM function during rehabilitation) scores to be obtained if rated has been high.137 by clinicians. In a Rasch analysis of the FIM, 2 separate domains of items were defined: the motor domain consisting of 13 items and the cognitive domain consisting of 5 items.133,136 Previous analyses of FIM data from the SCI Model Systems suggested that the cognitive domain may be inappropriate for people with SCI.138 Ceiling effects of the FIM at rehabilitation discharge and particularly at 1y postinjury were observed in the moderate and severely injured TBI population.139 Of the sample, 49% and 84% had attained independence (average score, 7 or 6) by discharge and 1y postinjury, respectively; ie, the FIM is not sensitive to more subtle changes expected after acute inpatient rehabilitation discharge. 1660.e11 Total FIM has outstanding reliability, with TRT reliability, IRR, and ICC much ⬎0.90.131 Additional Psychometric CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde Arch Phys Med Rehabil Vol 91, November 2010 Social role participation CHART-SF Reliability Health economic measures EuroQOL148 Validity Additional Psychometric TRT (2-wk interval) for Cognitive Independence was .87.141 Subject-proxy for Cognitive Independence was .81. Rasch analysis indicated satisfactory separation of items along the handicap dimension. Items within each subscale fit well. A sample of 2259 participants weighted to represent the population of Colorado in 1999 showed that those who reported no activity limitations scored significantly higher than those who reported activity limitations on the BRFSS. A sample of 1110 participants was administered the CHART (including Cognitive Independence), persons with TBI or stroke had lower scores than those with multiple sclerosis, SCI, amputation, or burn.141,142 Correlation coefficients were higher between CHART Cognitive Independence and Cog-FIM subscale compared with CHART Cognitive Independence and FIM motor subscale.141 ICC was ⬎.80.144 TRT (2-mo interval) was .82 in 76 students.143 With 2-wk interval, it was .89.144 Factor and Rasch analysis support a single factor; however, item 5 (If I could live my life over, I would change almost nothing) is the least well associated. Content validity: initially 48 items were included; factor analysis showed that 10 items loaded highly (⬎.60) on a factor reflecting cognitivejudgmental evaluative processes; 5 items were redundant, resulting in the current 5-item scale. Criterion-validity: original validation studies compared SWLS scores with 10 measures of subjective well-being; all correlated at rⱖ.50. Construct validity: TRT stability decreases as interval increases. Consistent differences between populations in the expected directions have been found. Scores change in expected directions when major life events occur. Normative data are available for people with TBI (TBIMS national database).145-147 Norms are available for other populations.144 The MVH Group149 reported that a large valuation study of a general British population of 221 respondents had very reliable mean ICCs of .78 for questions and .73 for the VAS. Van Agt et al150 assessed TRT reliability in a Dutch population (N⫽208) after TBI by using several specific method approaches that indicated good TRT reliability. Brazier et al151 found evidence for construct validity of the EuroQol comparing it with the SF-36 in a large British sample (N⫽1582). Sintonen152 reported correlations of the 15-D HRQOL with EuroQol, among others, giving evidence for construct validity. In an RA study, Hurst et al153 showed clinically relevant correlations with other condition-specific instruments, indicating EuroQol construct validity in RA. Sensitive to change-intervention effects. Other CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde Perceived generic and disease-specific HRQOL SWLS143 Reliability 1660.e12 Arch Phys Med Rehabil Vol 91, November 2010 Supplementary Table 1 (Cont’d): Psychometric Properties of Recommended TBI Outcomes CDEs in Core and Supplemental Tiers Measure NOTE. Supplemental measures are as follows: Global outcome: MPAI-4, DRS, and SF-36; Recovery of consciousness: CRS-R; Neuropsychological impairment: BVMT-R, WAIS-III Number-Letter Sequencing subtest, COWAT, CWIT, WAIS-III Digit Span subtest, WRAT-4 Word Reading subtest, and GPT; Psychological status: MMPI-2-RF, AUDIT, TBIMS questions (based on BRFSS and NSDUH), ASSIST, PCL-C/M/S, and FAD; Postconcussive/TBI symptoms: NSI; and Behavioral function: FrSBe. shaded in gray. Abbreviations: ASI, Addiction Severity Index; AUDIT-C, Alcohol Use Disorders Identification Test Alcohol Consumption Questions; BDI-II, Beck Depression Inventory-II; BRFSS, Behavioral Risk Factor Surveillance Survey; CFL, CFL verbal fluency test; CIM, Community Integration Measure; CS, component summary; DSM, Diagnostic and Statistical Manual of Mental Disorders; DTI, diffusion tensor imaging; FA, fractional anisotropy; FAM, Functional Assessment Measure; FAS, FAS verbal fluency test; FES, Family Environment Scale; fMRI, functional magnetic resonance imaging; FOUR, Full Outline of Unresponsiveness; FSIQ, full scale intelligence quotient; GCS, Glasgow Coma Scale; GH, General Health; GSI, ; HPL, Health Problems List; HRQOL, health-related quality of life; HVLT, Hopkins Verbal Learning Test; ICC, internal consistency; IRR, interrater reliability; KTEA-II, Kaufman Test of Educational Achievement, Second Edition; MINI Plus, Mini International Neuropsychiatric Interview Plus; MTBI, mild TBI; MVC, motor vehicle collision; MVH, measurement of valuation of health; NFI, Neurobehavioral Functioning Inventory; NSDUH, National Survey on Drug Use and Health; PANAS, Positive and Negative Affect Schedule; PCS, Postconcussion Syndrome; PK, Keane PTSD scales of the MMPI; PTA, posttraumatic amnesia; PSY-5, Personality Psychopathology Five; RA, rheumatoid arthritis; RCT, randomized controlled trial; ROC, receiver operating characteristic; SCI, spinal cord injury; SCL, Symptoms Check List; SCL-90-R, Symptom Checklist-90-Revised; TBIMS, TBI Model Systems; TRT, test-retest; VAS, visual analog scale; VIQ, verbal intelligence quotient; VS, vegetative state; VR WMS, Visual Reproduction subscale of the Wechsler Memory Scale; WHIM, Wessex Head Injury Matrix; WIAT-II, Wechsler Individual Achievement Test, Second Edition; WMS III, Wechsler Memory Scale, Third edition. In TBI, Klose et al found decreased scores on the EuroQoL VAS in patients with posttraumatic hypopituitarism 12mo after injury. Bell et al155 reported, among other measures, significantly increased EuroQoL scores as an effect of a scheduled telephone intervention in patients with moderate to severe TBI. Additional Psychometric Validity 154 Reliability Measure Supplementary Table 1 (Cont’d): Psychometric Properties of Recommended TBI Outcomes CDEs in Core and Supplemental Tiers Other CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde 1660.e13 References 1. Jennett B, Snoek J, Bond MR, Brooks N. Disability after severe head injury: observations on the use of the Glasgow Outcome Scale. J Neurol Neurosurg Psychiatry 1981;44:285-93. 2. Wilson JT, Pettigrew LE, Teasdale GM. Structured interviews for the Glasgow Outcome Scale and the extended Glasgow Outcome Scale: guidelines for their use. J Neurotrauma 1998; 15:573-85. 3. Pettigrew LE, Wilson JT, Teasdale GM. Reliability of ratings on the Glasgow Outcome Scales from in-person and telephone structured interviews. J Head Trauma Rehabil 2003;18:252-8. 4. Wilson JT, Edwards P, Fiddes H, Stewart E, Teasdale GM. Reliability of postal questionnaires for the Glasgow Outcome Scale. J Neurotrauma 2002;19:999-1005. 5. Lu J, Murray GD, Steyerberg EW, et al. Effects of Glasgow Outcome Scale misclassification on traumatic brain injury clinical trials. J Neurotrauma 2008;25:641-51. 6. Gabbe BJ, Simpson PM, Sutherland AM, et al. Functional measures at discharge: are they useful predictors of longer term outcomes for trauma registries? Ann Surg 2008;247:854-9. 7. King JT Jr, Carlier PM, Marion DW. Early Glasgow Outcome Scale scores predict long-term functional outcome in patients with severe traumatic brain injury. J Neurotrauma 2005;22: 947-54. 8. Miller KJ, Schwab KA, Warden DL. Predictive value of an early Glasgow Outcome Scale score: 15-month score changes. J Neurosurg 2005;103:239-45. 9. Levin HS, Boake C, Song J, et al. Validity and sensitivity to change of the extended Glasgow Outcome Scale in mild to moderate traumatic brain injury. J Neurotrauma 2001;18:575-84. 10. Malec JF, Kragness M, Evans RW, Finlay KL, Kent A, Lezak MD. Further psychometric evaluation and revision of the MayoPortland Adaptability Inventory in a national sample. J Head Trauma Rehabil 2003;18:479-92. 11. Malec JF. Comparability of Mayo-Portland Adaptability Inventory ratings by staff, significant others and people with acquired brain injury. Brain Inj 2004;18:563-75. 12. Malec JF, Thompson JM. Relationship of the Mayo-Portland Adaptability Inventory to functional outcome and cognitive performance measures. J Head Trauma Rehabil 1994;9:1-15. 13. Bohac DL, Malec JF, Moessner AM. Factor analysis of the Mayo-Portland Adaptability Inventory: structure and validity. Brain Inj 1997;11:469-82. 14. Malec JF, Moessner AM, Kragness M, Lezak MD. Refining a measure of brain injury sequelae to predict postacute rehabilitation outcome: rating scale analysis of the Mayo-Portland Adaptability Inventory. J Head Trauma Rehabil 2000;15:670-82. 15. Malec JF, Buffington AL, Moessner AM, Degiorgio L. A medical/vocational case coordination system for persons with brain injury: an evaluation of employment outcomes. Arch Phys Med Rehabil 2000;81:1007-15. 16. Malec JF. Impact of comprehensive day treatment on societal participation for persons with acquired brain injury. Arch Phys Med Rehabil 2001;82:885-95. 17. Malec JF, Degiorgio L. Characteristics of successful and unsuccessful completers of 3 postacute brain injury rehabilitation pathways. Arch Phys Med Rehabil 2002;83:1759-64. 18. Constantinidou F, Thomas RD, Scharp VL, Laske KM, Hammerly MD, Guitonde S. Effects of categorization training in patients with TBI during postacute rehabilitation: preliminary findings. J Head Trauma Rehabil 2005;20:143-57. 19. Murrey GJ, Hale FM, Williams JD. Assessment of anosognosia in persons with frontal lobe damage: clinical utility of the MayoPortland Adaptability Inventory (MPAI). Brain Inj 2005;19:599603. Arch Phys Med Rehabil Vol 91, November 2010 1660.e14 CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde 20. Rappaport M, Hall KM, Hopkins K, Belleza T, Cope DN. Disability rating scale for severe head trauma: coma to community. Arch Phys Med Rehabil 1982;63:118-23. 21. Gouvier WD, Blanton PD, LaPorte KK, Nepomuceno C. Reliability and validity of the Disability Rating Scale and the Levels of Cognitive Functioning Scale in monitoring recovery from severe head injury. Arch Phys Med Rehabil 1987;68:94-7. 22. Novack TA, Bergquist TF, Bennett G, Gouvier WD. Primary caregiver distress following severe head injury. J Head Trauma Rehabil 1991;6:69-77. 23. Hall KM, Hamilton BB, Gordon WA, Zasler ND. Characteristics and comparisons of functional assessment indices: Disability Rating Scale, Functional Independence Measure and Functional Assessment Measure. J Head Trauma Rehabil 1993;8:60-74. 24. Hall K, Cope DN, Rappaport M. Glasgow Outcome Scale and Disability Rating Scale: comparative usefulness in following recovery in traumatic head injury. Arch Phys Med Rehabil 1985;66:35-7. 25. Jennett B, Bond M. Assessment of outcome after severe brain damage. Lancet 1975;1:480-4. 26. Smith RM, Fields FRJ, Lenox JL, Morris HO, Nolan JJ. A functional scale of recovery from severe head trauma. Clin Neuropsychol 1979;1:48-50. 27. MacKenzie EJ, McCarthy ML, Ditunno JF, et al. Using the SF-36 for characterizing outcome after multiple trauma involving head injury. J Trauma 2002;52:527-34. 28. Findler M, Cantor J, Haddad L, Gordon W, Ashman T. The reliability and validity of the SF-36 health survey questionnaire for use with individuals with traumatic brain injury. Brain Inj 2001;15:715-23. 29. Ware JE Jr, Sherbourne CD. The MOS 36-Item Short-Form Health Survey (SF-36). I. Conceptual framework and item selection. Med Care 1992;30:473-83. 30. Ware JE, Gandek B, Aaronson NK, et al. The SF-36 Health Survey: development and use in mental health research and the IQOLA project. Int J Mental Health 1994;23:49-74. 31. McNaughton HK, Weatherall M, McPherson KM. Functional measures across neurologic disease states: analysis of factors in common. Arch Phys Med Rehabil 2005;86:2184-8. 32. Paniak C, Phillips K, Toller-Lobe G, Durand A, Nagy J. Sensitivity of three recent questionnaires to mild traumatic brain injury-related effects. J Head Trauma Rehabil 1999;14:211-9. 33. Emanuelson I, Andersson Holmkvist E, Bjorklund R, Stalhammar D. Quality of life and post-concussion symptoms in adults after mild traumatic brain injury: a population-based study in western Sweden. Acta Neurol Scand 2003;108:332-8. 34. Giacino JT, Kalmar K, Whyte J. The JFK Coma Recovery Scale-Revised: measurement characteristics and diagnostic utility. Arch Phys Med Rehabil 2004;85:2020-9. 35. Schnakers C, Hustinx R, Vandewalle G, et al. Measuring the effect of amantadine in chronic anoxic minimally conscious state. J Neurol Neurosurg Psychiatry 2008;79:225-7. 36. Schnakers C, Majerus S, Giacino J, et al. A French validation study of the Coma Recovery Scale-Revised (CRS-R). Brain Inj 2008;22:786-92. 37. Lovstad M, Froslie K, Giacino JT, Anke A, Schanke A. Reliability and diagnostic characteristics of the JFK Coma Recovery Scale-Revised: exploring the influence of rater’s level of experience. J Head Trauma Rehabil 2010;25:349-56. 38. Giacino JT, Kezmarsky MA, DeLuca J, Cicerone KD. Monitoring rate of recovery to predict outcome in minimally responsive patients. Arch Phys Med Rehabil 1991;72:897-901. 39. O’Dell MW, Jasin P, Stivers M, Lyons N, Schmidt S, Moore DE. Interrater reliability of the Coma Recovery Scale. J Head Trauma Rehabil 1996;11:61-6. Arch Phys Med Rehabil Vol 91, November 2010 40. Schnakers C, Giacino J, Kalmar K, et al. Does the FOUR score correctly diagnose the vegetative and minimally conscious states? Ann Neurol 2006;60:744-5; author reply 745. 41. Coleman MR, Rodd JM, Davis MH, et al. Do vegetative patients retain aspects of language comprehension? Evidence from fMRI. Brain 2007;130:2494-507. 42. Coleman MR, Davis MH, Rodd JM, et al. Towards the routine use of brain imaging to aid the clinical diagnosis of disorders of consciousness. Brain 2009;132:2541-52. 43. Giacino JT, Schnakers C, Rodriguez-Moreno D, Kalmar K, Schiff N, Hirsch J. Behavioral assessment of patients with disorders of consciousness: gold standard or fool’s gold? Prog Brain Res 2009;177:33-48. 44. Giacino JT, Kalmar K. The vegetative and minimally conscious states: a comparison of clinical features and functional outcome. J Head Trauma Rehabil 1997;12:36-51. 45. Giacino JT, Trott CT. Rehabilitative management of patients with disorders of consciousness: grand rounds. J Head Trauma Rehabil 2004;19:254-65. 46. Smart CM, Giacino JT, Cullen T, et al. A case of locked-in syndrome complicated by central deafness. Nat Clin Pract Neurol 2008;4:448-53. 47. Schiff ND, Giacino JT, Kalmar K, et al. Behavioural improvements with thalamic stimulation after severe traumatic brain injury. Nature 2007;448:600-3. 48. Mitrushina M, Satz P. Effect of repeated administration of a neuropsychological battery in the elderly. J Clin Psychol 1991; 47:790-801. 49. van den Burg W, Kingma A. Performance of 225 Dutch school children on Rey’s Auditory Verbal Learning Test (AVLT): parallel test-retest reliabilities with an interval of 3 months and normative data. Arch Clin Neuropsychol 1999;14:545-59. 50. Strauss E, Sherman EMS, Spreen O. A compendium of neuropsychological tests: administration, norms, and commentary. New York: Oxford University Pr; 2006. 51. Dikmen SS, Heaton RK, Grant I, Temkin NR. Test-retest reliability and practice effects of expanded Halstead-Reitan Neuropsychological Test Battery. J Int Neuropsychol Soc 1999;5:346-56. 52. Levine AJ, Miller EN, Becker JT, Selnes OA, Cohen BA. Normative data for determining significance of test-retest differences on eight common neuropsychological instruments. Clin Neuropsychol 2004;18:373-84. 53. Goldstein G, Watson J. Test-retest reliability of the HalsteadReitan battery and the WAIS in a neuropsychiatric population. Clin Neuropsychol 1989;3:265-73. 54. Fals-Stewart W. An interrater reliability study of the Trail Making Test (Parts A and B). Percept Mot Skills 1992;74:39-42. 55. Dikmen S, Machamer JE, Winn HR, Temkin NR. Neuropsychological outcome at 1 year post head injury. Neuropsychology 1995;9:80-90. 56. Millis SR, Rosenthal M, Novack TA, et al. Long-term neuropsychological outcome after traumatic brain injury. J Head Trauma Rehabil 2001;16:343-55. 57. Iverson GL. Outcome from mild traumatic brain injury. Curr Opin Psychiatry 2005;18:301-17. 58. Wechsler D. Wechsler Adult Intelligence Scale III. San Antonio: Harcourt Assessment Inc; 1997. 59. Benedict RH. Brief Visuospatial Memory Test-Revised. Odessa: Psychological Assessment Resources Inc; 1997. 60. Tombaugh TN, Kozak J, Rees L. Normative data stratified by age and education for two measures of verbal fluency: FAS and animal naming. Arch Clin Neuropsychol 1999;14:167-77. 61. Ross TP. The reliability of cluster and switch scores for the Controlled Oral Word Association Test. Arch Clin Neuropsychol 2003;18:153-64. CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde 62. Iverson GL, Franzen MD, Lovell MR. Normative comparisons for the controlled oral word association test following acute traumatic brain injury. Clin Neuropsychol 1999;13:437-41. 63. Henry JD, Crawford JR. A meta-analytic review of verbal fluency performance in patients with traumatic brain injury. Neuropsychology 2004;18:621-8. 64. Burgess PW, Alderman N, Evans J, Emslie H, Wilson BA. The ecological validity of tests of executive function. J Int Neuropsychol Soc 1998;4:547-58. 65. Sarno MT, Postman WA, Cho YS, Norman RG. Evolution of phonemic word fluency performance in post-stroke aphasia. J Commun Disord 2005;38:83-107. 66. Batchelor J, Harvey AG, Bryant RA. Stroop colour word test as a measure of attentional deficit following mild head injury. Clin Neuropsychol 1995;9:180-6. 67. Cicerone KD, Azulay J. Diagnostic utility of attention measures in postconcussion syndrome. Clin Neuropsychol 2002;16:280-9. 68. Boyle PA, Paul RH, Moser DJ, Cohen RA. Executive impairments predict functional declines in vascular dementia. Clin Neuropsychol 2004;18:75-82. 69. Ivnik RJ, Malec J, Smith GE, Tangalos EG. Neuropscyhological test norms above age 55: COWAT, BNT, MAE token, WRAT-R reading, AMNART, Stroop, TMT, and JLO. Clin Neuropsychol 1996;10:262-78. 70. Kersel DA, Marsh NV, Havill JH, Sleigh JW. Neuropsychological functioning during the year following severe traumatic brain injury. Brain Inj 2001;15:283-96. 71. Greiffenstein MF, Baker WJ, Gola T. Validation of malingered amnesia meaures with a large clinical sample. Psychol Assess 1994;6:218-24. 72. Myers JE, Volbrecht M. Validation of reliable digits for detection of malingering. Assessment 1999;5:303-7. 73. Wilkinson GS, Robertson GJ. Wide range achievement test-4 professional manual. Lutz: Psychological Assessment Resources Inc; 2006. 74. Ball JD, Hart RP, Stutts ML, Turf E, Barth JT. Comparative utility of Barona Formulae, Wtar demographic algorithms, and WRAT-3 reading for estimating premorbid ability in a diverse research sample. Clin Neuropsychol 2007;21:422-33. 75. Johnstone B, Callahan CD, Kapila CJ, Bouman DE. The comparability of the WRAT-R reading test and NAART as estimates of premorbid intelligence in neurologically impaired patients. Arch Clin Neuropsychol 1996;11:513-9. 76. Orme DR, Johnstone B, Hanks R, Novack TA. The WRAT-3 Reading Subtest is a measure of premorbid intelligence among persons with brain injury. Rehabil Psychol 2004;49:250-3. 77. Kreutzer JS, Gordon WA, Rosenthal M, Marwitz J. Neuropsychological characteristics of patients with brain injury: preliminary findings from a multicenter investigation. J Head Trauma Rehabil 1993;8:47-59. 78. Boake C, Millis SR, High WM Jr, et al. Using early neuropsychologic testing to predict long-term productivity outcome from traumatic brain injury. Arch Phys Med Rehabil 2001;82:761-8. 79. Ip RY, Dornan J, Schentag C. Traumatic brain injury: factors predicting return to work or school. Brain Inj 1995;9:517-32. 80. Sherer M, Sander AM, Nick TG, High WM Jr, Malec JF, Rosenthal M. Early cognitive status and productivity outcome after traumatic brain injury: findings from the TBI model systems. Arch Phys Med Rehabil 2002;83:183-92. 81. Meachen SJ, Hanks RA, Millis SR, Rapport LJ. The reliability and validity of the Brief Symptom Inventory-18 in persons with traumatic brain injury. Arch Phys Med Rehabil 2008;89:958-65. 82. Derogatis LR. Brief Symptom Inventory 18 (BSI 18): administration, scoring and procedures manual. Minneapolis: NCS Pearson Inc; 2001. 1660.e15 83. Tellegen A, Ben-Porath YS. MMPI-2-RF (Minnesota Multiphasic Personality Inventory-2): technical manual. Minneapolis: Univ of Minneapolis Pr; 2008. 84. Reinert DF, Allen JP. The Alcohol Use Disorders Identification Test: an update of research findings. Alcohol Clin Exp Res 2007;31:185-99. 85. Bradley KA, Bush KR, McDonell MB, Malone T, Fihn SD. Screening for problem drinking: comparison of CAGE and AUDIT. J Gen Intern Med 1998;13:379-88. 86. Bergman H, Kallmen H. Alcohol use among Swedes and a psychometric evaluation of the Alcohol Use Disorders Identification Test. Alcohol Alcohol 2002;37:245-51. 87. Moussas G, Dadouti G, Douzenis A, et al. The Alcohol Use Disorders Identification Test (AUDIT): reliability and validity of the Greek version. Ann Gen Psychiatry 2009;8:11. 88. Hides L, Cotton SM, Berger G, et al. The reliability and validity of the Alcohol, Smoking and Substance Involvement Screening Test (ASSIST) in first-episode psychosis. Addict Behav 2009; 34:821-5. 89. Ponsford J, Whelan-Goodinson R, Bahar-Fuchs A. Alcohol and drug use following traumatic brain injury, a prospective study. Brain Inj 2007;21:1385-92. 90. Stein AD, Lederman RI, Shea S. The Behavioral Risk Factor Surveillance System questionnaire: its reliability in a statewide sample. Am J Public Health 1993;83:1768-72. 91. Miller JW, Gfroerer JC, Brewer RD, Naimi TS, Mokdad A, Giles WH. Prevalence of adult binge drinking: a comparison of two national surveys. Am J Prev Med 2004;27:197-204. 92. Horner MD, Ferguson PL, Selassie AW, Labbate LA, Kniele K, Corrigan JD. Patterns of alcohol use 1 year after traumatic brain injury: a population-based, epidemiological study. J Int Neuropsychol Soc 2005;11:322-30. 93. WHO ASSIST Working Group. The Alcohol, Smoking and Substance Involvement Screening Test (ASSIST): development, reliability and feasibility. Addiction 2002;97:1183–94. 94. Humeniuk R, Ali R, Babor TF, et al. Validation of the Alcohol, Smoking, and Substance Involvement Screening Test (ASSIST). Addiction 2008;103:1039-47. 95. Humeniuk R, Dennington V, Ali R. The effectiveness of a brief intervention for illicit drugs linked to the Alcohol, Smoking and Substance Involvement Screening Test (ASSIST) in primary health care settings: a technical report of phase III findings of the WHO ASSIST randomized controlled trial. Geneva: World Health Organization; 2008. 96. Weathers F, Litz B, Herman D, Huska J, Keane T. The PTSD Checklist (PCL): reliability, validity, and diagnostic utility. 9th Annual Convention of the International Society for Traumatic Stress Studies, October 1993, San Antonio, TX. Available at: http://www.pdhealth.mil/library/downloads/PCL_sychometrics. doc. Accessed October 15, 2010. 97. Blanchard EB, Jones-Alexander J, Buckley TC, Forneris CA. Psychometric properties of the PTSD Checklist (PCL). Behav Res Ther 1996;34:669-73. 98. Keane TM, Caddell JM, Taylor KL. Mississippi Scale for Combat-Related Posttraumatic Stress Disorder: three studies in reliability and validity. J Consult Clin Psychol 1988;56:85-90. 99. Orsillo SM. Measures for acute stress disorder and posttraumatic stress disorder. In: Antony MM, Orsillo SM, editors. Practitioner’s guide to empirically based measures of anxiety. New York: Kluwer Academic/Plenum; 2001. p 255-307. 100. Norris FH, Hamblen JS. Standardized self-report measures of civilian trauma and PTSD. In: Wilson JP, Keane TM, Martin T, editors. Assessing psychological trauma and PTSD. New York: Guilford Pr; 2004. p 63-102. Arch Phys Med Rehabil Vol 91, November 2010 1660.e16 CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde 101. Epstein NB, Baldwin LM, Bishop DS. Family Assessment Device (FAD). Handbook of psychiatric measures. Washington (DC): American Psychiatric Association; 2000. 102. Aarons GA, McDonald EJ, Connelly CD, Newton RR. Assessment of family functioning in Caucasian and Hispanic Americans: reliability, validity, and factor structure of the Family Assessment Device. Fam Process 2007;46:557-69. 103. Kabacoff RI, Miller IW, Bishop DS, Epstein NB, Keitner GI. A psychometric study of the McMaster Family Assessment Device in psychiatric, medical, and nonclinical samples. J Fam Psychol 1990;3:431-9. 104. Barney MC, Max JE. The McMaster Family Assessment Device and clinical rating scale: questionnaire vs interview in childhood traumatic brain injury. Brain Inj 2005;19:801-9. 105. Wenniger WFDB, Hageman WJJM, Arrindell WA. Cross-national validity of dimensions of family functioning: first experiences with the Dutch version of the McMaster Family Assessment Device (FAD). Pers Individual Differences 1993;14:769-81. 106. Bihun JT, Wamboldt MZ, Gavin LA, Wamboldt FS. Can the Family Assessment Device (FAD) be used with school aged children? Fam Process 2002;41:723-31. 107. King NS, Crawford S, Wenden FJ, Moss NE, Wade DT. The Rivermead Post Concussion Symptoms Questionnaire: a measure of symptoms commonly experienced after head injury and its reliability. J Neurol 1995;242:587-92. 108. Crawford S, Wenden FJ, Wade DT. The Rivermead head injury follow up questionnaire: a study of a new rating scale and other measures to evaluate outcome after head injury. J Neurol Neurosurg Psychiatry 1996;60:510-4. 109. King NS, Crawford S, Wenden FJ, Caldwell FE, Wade DT. Early prediction of persisting post-concussion symptoms following mild and moderate head injuries. Br J Clin Psychol 1999; 38(Pt 1):15-25. 110. Ingebrigtsen T, Romner B, Marup-Jensen S, et al. The clinical value of serum S-100 protein measurements in minor head injury: a Scandinavian multicentre study. Brain Inj 2000;14:1047-55. 111. Savola O, Hillbom M. Early predictors of post-concussion symptoms in patients with mild head injury. Eur J Neurol 2003;10: 175-81. 112. Smits M, Dippel DW, Houston GC, et al. Postconcussion syndrome after minor head injury: brain activation of working memory and attention. Hum Brain Mapp 2009;30:2789-803. 113. Wilde EA, McCauley SR, Hunter JV, et al. Diffusion tensor imaging of acute mild traumatic brain injury in adolescents. Neurology 2008;70:948-55. 114. Ingebrigtsen T, Waterloo K, Marup-Jensen S, Attner E, Romner B. Quantification of post-concussion symptoms 3 months after minor head injury in 100 consecutive patients. J Neurol 1998; 245:609-12. 115. Chan RC. Base rate of post-concussion symptoms among normal people and its neuropsychological correlates. Clin Rehabil 2001; 15:266-73. 116. Eyres S, Carey A, Gilworth G, Neumann V, Tennant A. Construct validity and reliability of the Rivermead Post-Concussion Symptoms Questionnaire. Clin Rehabil 2005;19:878-87. 117. Smith-Seemiller L, Fow NR, Kant R, Franzen MD. Presence of post-concussion syndrome symptoms in patients with chronic pain vs mild traumatic brain injury. Brain Inj 2003;17:199-206. 118. Schwab KA, Ivins B, Cramer G, et al. Screening for traumatic brain injury in troops returning from deployment in Afghanistan and Iraq: initial investigation of the usefulness of a short screening tool for traumatic brain injury. J Head Trauma Rehabil 2007;22:377-89. 119. Grace J, Malloy PF. Frontal Systems Behavior Scale professional manual. Lutz: Psychological Assessment Resources Inc; 2001. Arch Phys Med Rehabil Vol 91, November 2010 120. Caracuel A, Verdejo-Garcia A, Vilar-Lopez R, et al. Frontal behavioral and emotional symptoms in Spanish individuals with acquired brain injury and substance use disorders. Arch Clin Neuropsychol 2008;23:447-54. 121. Grace J, Stout JC, Malloy PF. Assessing frontal lobe behavioral syndromes with the frontal lobe personality scale. Assessment 1999;6:269-84. 122. Norton LE, Malloy PF, Salloway S. The impact of behavioral symptoms on activities of daily living in patients with dementia. Am J Geriatr Psychiatry 2001;9:41-8. 123. Reid-Arndt SA, Nehl C, Hinkebein J. The Frontal Systems Behaviour Scale (FrSBe) as a predictor of community integration following a traumatic brain injury. Brain Inj 2007;21:1361-9. 124. Stout JC, Ready RE, Grace J, Malloy PF, Paulsen JS. Factor analysis of the Frontal Systems Behavior Scale (FrSBe). Assessment 2003;10:79-85. 125. Hobart JC, Lamping DL, Freeman JA, et al. Evidence-based measurement: which disability scale for neurologic rehabilitation? Neurology 2001;57:639-44. 126. Stineman MG, Shea JA, Jette A, et al. The Functional Independence Measure: tests of scaling assumptions, structure, and reliability across 20 diverse impairment categories. Arch Phys Med Rehabil 1996;77:1101-8. 127. Corrigan JD, Smith-Knapp K, Granger CV. Validity of the Functional Independence Measure for persons with traumatic brain injury. Arch Phys Med Rehabil 1997;78:828-34. 128. Lee JE, Stokic DS. Risk factors for falls during inpatient rehabilitation. Am J Phys Med Rehabil 2008;87:341-50; quiz, 51, 422. 129. Hammond FM, Hart T, Bushnik T, Corrigan JD, Sasser H. Change and predictors of change in communication, cognition, and social function between 1 and 5 years after traumatic brain injury. J Head Trauma Rehabil 2004;19:314-28. 130. Vanderploeg RD, Schwab K, Walker WC, et al. Rehabilitation of traumatic brain injury in active duty military personnel and veterans: Defense and Veterans Brain Injury Center randomized controlled trial of two rehabilitation approaches. Arch Phys Med Rehabil 2008;89:2227-38. 131. Hamilton BB, Laughlin JA, Granger CV, Kayton RM. Interrater agreement of the seven-level Functional Independence Measure (FIM). Arch Phys Med Rehabil 1991;72:790. 132. Granger CV, Hamilton BB, Linacre JM, Heinemann AW, Wright BD. Performance profiles of the functional independence measure. Am J Phys Med Rehabil 1993;72:84-9. 133. Linacre JM, Heinemann AW, Wright BD, Granger CV, Hamilton BB. The structure and stability of the Functional Independence Measure. Arch Phys Med Rehabil 1994;75:127-32. 134. Dodds TA, Martin DP, Stolov WC, Deyo RA. A validation of the functional independence measurement and its performance among rehabilitation inpatients. Arch Phys Med Rehabil 1993; 74:531-6. 135. Heinemann AW, Kirk P, Hastie BA, et al. Relationships between disability measures and nursing effort during medical rehabilitation for patients with traumatic brain and spinal cord injury. Arch Phys Med Rehabil 1997;78:143-9. 136. Heinemann AW, Linacre JM, Wright BD, Hamilton BB, Granger C. Relationships between impairment and physical disability as measured by the functional independence measure. Arch Phys Med Rehabil 1993;74:566-73. 137. Granger CV, Cotter AC, Hamilton BB, Fiedler RC, Hens MM. Functional assessment scales: a study of persons with multiple sclerosis. Arch Phys Med Rehabil 1990;71:870-5. 138. Ditunno JF, Cohen ME, Formal CS, Whiteneck GG. Functional outcomes in spinal cord injury. In: Stove SL, Whiteneck GG, DeLisa J, editors. Spinal cord injury: clinical outcomes from the model systems. Gaithersburg: Aspen Publications; 1995. CDEs: TRAUMATIC BRAIN INJURY OUTCOME MEASURES, Wilde 139. Hall K, Mann N, High W, Wright J, Kreutzer J. Functional measures after traumatic brain injury: ceiling effects of FIM, FIM⫹FAM, DRS, and CIQ. J Head Trauma Rehabil 1996;11: 27-39. 140. Whiteneck GG, Charlifue SW, Gerhart KA, Overholser JD, Richardson GN. Quantifying handicap: a new measure of longterm rehabilitation outcomes. Arch Phys Med Rehabil 1992;73: 519-26. 141. Mellick D, Walker N, Brooks CA, Whiteneck GG. Incorporating the cognitive independence domain into CHART. J Rehabil Outcomes Meas 1999;3:12-21. 142. Walker N, Mellick D, Brooks CA, Whiteneck GG. Measuring participation across impairment groups using the Craig Handicap Assessment Reporting Technique. Am J Phys Med Rehabil 2003;82:936-41. 143. Diener E, Emmons RA, Larsen RJ, Griffin S. The Satisfaction With Life Scale. J Pers Assess 1985;49:71-5. 144. Pavot W, Diener E. Review of the Satisfaction With Life Scale. Psychol Assess 1993;5:164-72. 145. Corrigan JD, Smith-Knapp K, Granger CV. Outcomes in the first 5 years after traumatic brain injury. Arch Phys Med Rehabil 1998;79:298-305. 146. Corrigan JD, Bogner JA, Mysiw WJ, Clinchot D, Fugate L. Life satisfaction following traumatic brain injury. J Head Trauma Rehabil 2001;16:543-55. 147. Sokol K, Heinemann A, Bode R, Shin J, Van de Venter L. Community participation after TBI: factors predicting return to work and life satisfaction. 127th Annual Meeting of the American Public Health Association, November 7-11, 1999, Chicago, IL. 1660.e17 148. EuroQol—a new facility for the measurement of health-related quality of life. The EuroQol Group. Health Policy 1990;16:199–208. 149. Dolan P, Gudex C, Kind P, Williams A. The MVH Group. The measurement and valuation of health. First report on the main survey. Centre for Health Economics. pp 1-48. York: Univ of York; 1994. 150. van Agt HME, Essink-Bot ML, Krabbe PFM, Bonsel GJ. Testretest reliability of health state valuations collected with the EuroQol questionnaire. Soc Sci Med 1994;39:1537-44. 151. Brazier J, Jones N, Kind P. Testing the validity of the EuroQOL and comparing it with the SF-36 health survey questionnaire. Qual Life Res 1993;2:169-80. 152. Sintonen H. The 15-D measure of health-related quality of life: reliability, validity and sensitivity of its health state descriptive system. Melbourne: National Centre for Health Program Evaluation; 1995. 153. Hurst NP, Jobanputra P, Hunter M, Lambert M, Lochhead A, Brown H. Validity of EuroQOL—a generic health status instrument—in patients with rheumatoid arthritis. Economic and Health Outcomes Research Group. Br J Rheumatol 1994;33:655-62. 154. Klose M, Watt T, Brennum J, Feldt-Rasmussen U. Posttraumatic hypopituitarism is associated with an unfavorable body composition and lipid profile, and decreased quality of life 12 months after injury. J Clin Endocrinol Metab 2007;92:3861-8. 155. Bell KR, Temkin NR, Esselman PC, et al. The effect of a scheduled telephone intervention on outcome after moderate to severe traumatic brain injury: a randomized trial. Arch Phys Med Rehabil 2005;86:851-6. Arch Phys Med Rehabil Vol 91, November 2010