The Munich Wrist Questionnaire (MWQ) – development and validation of a new patient-reported outcome measurement tool for wrist disorders

Beirer, Marc; Serly, Julian; Vester, Helen; Pförringer, Dominik; Crönlein, Moritz; Deiler, Stephan; Biberthaler, Peter; Kirchhoff, Chlodwig

doi:10.1186/s12891-016-1029-4

Research article
Open access
Published: 14 April 2016

The Munich Wrist Questionnaire (MWQ) – development and validation of a new patient-reported outcome measurement tool for wrist disorders

Marc Beirer¹,
Julian Serly¹,
Helen Vester¹,
Dominik Pförringer¹,
Moritz Crönlein¹,
Stephan Deiler¹,
Peter Biberthaler¹ &
…
Chlodwig Kirchhoff¹

BMC Musculoskeletal Disorders volume 17, Article number: 167 (2016) Cite this article

Abstract

Background

Although self-assessment questionnaires for the wrist joint are numerous, most validation studies focus on a specific pathology and patient collectives. In addition the available questionnaires focus on subjective parameters such as pain, usual and specific activities but the range of motion (ROM) as an essential objective parameter in wrist disorders is rarely considered. Therefore the purpose of the presented study was to develop and validate a new universally applicable self-assessment score, the Munich Wrist Questionnaire (MWQ), which allows for the assessment of subjective as well as objective parameters of the wrist joint.

Methods

The MWQ consists of 16 items addressing three domains: pain, work and activities of daily living and wrist function including range of motion and grip strength. In a prospective clinical study validity, reliability and responsiveness of the MWQ of physical active patients were evaluated.

Results

Validation study included 100 patients (mean age 41 years, SD 16.3 years; range, 18–77 years). Test-retest reliability was substantial, with intraclass correlation coefficients ranging from 0.75 to 0.83 for the three domains. Construct validity and responsiveness were confirmed by correlation coefficients of at least 0.86 for construct validity and for responsiveness ranging from 0.61 to 0.65.

Conclusions

The MWQ presents a valid and reliable instrument for a qualitative self-assessment of subjective and objective parameters (e.g. range of motion) of the wrist joint. Quantitative measurement of wrist function may not longer be limited to specific wrist disorders or patient groups. The MWQ seems to allow for a broad application in clinical research and may facilitate the comparison of treatment results in wrist disorders.

Peer Review reports

Background

Clinical scoring systems became more and more popular in evaluating the efficacy of treatment procedures in wrist disorders [1, 2]. Numerous physician-based as well as patient-reported clinical measurement tools have been developed. However the physician-based clinical examination does not necessarily correlate with the patient’s satisfaction [3] and does not inevitably take into account further aspects related to an analysis of outcome such as the patient’s ability to perform activities of daily living and the ability to return to previous occupations [1]. Therefore the additional use of self-assessment questionnaires to clinical assessed parameters may result in a higher transparency of the patient’s wrist function and restrictions. A systematic review of the literature was performed to identify valid and commonly used scoring systems regarding follow-up examination in the field of wrist disorders. PubMed.gov was searched for wrist-specific terms (wrist, surgery, joint, upper extremity) combined with psychometric (validity, reliability, responsiveness, follow-up) and instrument specific terms (self-evaluation, patient-based, measurement tool, outcome measure, questionnaire). The Disabilities of the Arm, Shoulder and Hand (DASH) [4], the Patient-Rated Wrist Evaluation Score (PRWE) [5], the Cooney and Bussey Score (CBS) [6] and the Mayo Wrist Score (MWS) [7] were identified as frequently used and valid assessment measurement tools in wrist disorders. However the validation studies most commonly focus specific patient groups or diagnosis (e.g. fractures of the distal radius in the validation of the Patient-Rated Wrist Evaluation Score (PRWE)) and we are still far from a single outcome evaluation system which is reliable, valid and sensitive to clinically relevant change [4–8]. In addition the available self-assessment questionnaires focus on subjective parameters such as pain, usual and specific activities but the range of motion (ROM) as an essential objective parameter in wrist disorders is rarely considered. The Patient-Rated Wrist Evaluation Score (PRWE), for example, presents a wrist specific outcome instrument but it does not depict photographs to allow for a patient-based evaluation of the range of motion. The Disabilities of the Arm, Shoulder and Hand (DASH) score presents a frequently used and established self-assessment score for the general upper limb function but it does not constitute a wrist-specific rating instrument.

Therefore the purpose of this prospective study was to develop and validate an all-purpose Munich Wrist Questionnaire (MWQ) without limitations in the applicability regarding diagnosis or specific disorders for a patient-based follow-up examination considering subjective (pain, work and activities of daily living) as well as objective parameters (range of motion) in a heterogeneous patient collective.

The study protocol was approved by the local ethics committee (Ethics Committee of the medical faculty, Klinikum rechts der Isar, Technical University of Munich, Germany; study number 5316/12).

Methods

Development of the scoring system

To capture all aspects of the wrist function each scale of the DASH, the PRWE, the CBS and the MWS was analyzed for items either addressing general topics or specific items. Subsequently a matching of the general topics was performed and the dedicated items underwent a fusion to the final MWQ’s item (Additional file 1). Typical functional abilities were depicted as photographs (see Figs. 1 and 2) to assess the range of motion. Finally the MWQ contains 16 items addressing three domains: pain (five items), work and activities of daily living (work/ADL) (seven items) and wrist function including range of motion and grip strength (four items). The maximum value for all subjective parameters (subscales pain and work/ADL) is 120 out of 250 points (objective parameters (function) 130 out of 250 points) which means a subjective-objective ratio of almost 1:1. The overall score is than converted to a scale of 100 % whereas a value of 100 % indicates an excellent result and a value of zero percent a poor result. The MWQ can be downloaded from our official homepage.

Patient collective

A cohort of 100 consecutive patients who had suffered from traumatic soft tissue and/or osseous injures as well as degenerative disorders of the wrist joint were asked to complete all questionnaires at the outpatient clinic. All persons gave their informed consent prior to their inclusion in the study. People with limited legal capacity, under legal supervision or suffering from psychiatric diseases, dementia or other cognitive diseases were excluded.

Testing and evaluation of measurement qualities

Floor and ceiling effects

According to McHorney et al. [9] floor and ceiling effects exist, if more than 15 % of the patients achieve the lowest or highest possible score. Similarly we defined the presence of floor or ceiling effects, if more than 15 % of our patient collective would achieve the lowest (0 points) or highest (100 points) possible score of the MWQ.

Internal consistency

Internal consistency is defined by the degree of interrelation among the tested items [10]. The subscales are based on a reflective model in which all items are defined by a manifestation of the same underlying construct. According to previous published studies, Cronbach’s alpha was calculated per subscale and a score above 0.70 was considered as sufficient homogeneity of the subscales’ items [11–13].

Test-retest reliability

Test-retest reliability is defined as the extent to which scores of the same patients under the same conditions coincide in repeated measurements [10]. The time period between the repeated measurements should be long enough to prevent from recall of the tested items, and moreover should be short enough to ensure that no change of the clinical symptoms has occurred [11]. In this study a time period of 14 to 21 days after the initial examination was chosen to assess test-retest reliability. Intraclass correlation coefficients (ICC) were calculated and positive reliability was assumed when the ICC was at least 0.70 for all tested subscales [11].

Construct validity

Construct validity is defined as the degree to which the scores of a self-assessment instrument are consistent with a priori hypothesis, based on the assumption that the instrument validly measures the construct to be measured [10]. Construct validity was assessed by correlating the subscales “pain” and “work/ADL” of the MWQ with the subscales “pain” and “activities” of the PRWE. The subscale “function” of the MWQ was correlated with the subscale “range of motion/grip strength” of the MWS. The Pearson correlation coefficient (PCC) was calculated. Similar to previous studies, a positive construct validity was assumed when the PCC was at least 0.70 for all measured subscales [13, 14].

Responsiveness

Responsiveness is defined as the ability of an instrument to detect changes over time of the construct to be measured [10]. Responsiveness was evaluated four to six months after the initial presentation of the patient. To assess responsiveness patients completed the MWQ and a Global Perceived Effect (GPE) score consisting of only one question per subscale on the patients’ subjective opinion regarding improvement or worsening during the last months. A list of potential answers contained seven categories (much better (+3), better (+2), somewhat better (+1), no change (0), somewhat worse (–1), worse (–2), much worse (–3)) for each subscale of the MWQ. The time period of four to six months was chosen to be long enough to allow for a clinical change, and short enough to ensure that the patients are able to recall their health state during their initial presentation. The Spearman’s correlation coefficient (SCC) was calculated. SCC between the change of the MWQ and the GPE score of at least 0.40 was assumed to indicate positive responsiveness [12, 13].

Correlation of the MWQ with established wrist scores

We supposed that at least a moderate correlation would be obtained between the new MWQ and already established wrist rating systems (DASH, PRWE, CBS, MWS). The PCC was calculated followed by a linear regression analysis. A positive correlation was assumed when the PCC was at least 0.70.

Statistical analysis

The results were compared by calculating the SCC and PCC with a linear regression analysis. A p-value <0.05 determined significance.

Results

Patients and study design

Validity, reliability and responsiveness of the MWQ were determined in a prospective, clinical study. Between August 2012 and November 2013 100 consecutive patients (mean age 41 years, SD 16.3 years; range, 18–77 years) were asked to complete the MWQ, the DASH, the PRWE, the CBS and the MWS at initial presentation for evaluating validity. Completion of the MWQ lasted about eight minutes (mean time 7.7 min, SD 2.2 min, Min. 4.0 min, Max. 15.7 min). Table 1 summarizes patient’s diagnosis representing a wide spectrum of traumatic and degenerative wrist disorders. Figure 3 shows the clinical study profile. Figure 4 shows the results of the correlation between the MWQ and frequently used wrist rating systems. The PCC between the MWQ and the DASH was 0.90, 0.84 for the PRWE, 0.94 for the CBS and 0.93 for the MWS (p < 0.05).

Table 1 Study population, n = 100

Full size table