Skip to main content

Psychometric properties of patient-reported outcome measures (PROMs) in wrist osteoarthritis: test–retest reliability and construct validity

Abstract

Background

Patient-reported outcome measures (PROMs) are frequently used to assess the effects of treatments in patients with wrist osteoarthritis (OA), but their psychometric properties have not been evaluated in this group of patients. Our aim was to evaluate the psychometric properties of the Numeric Rating Scale (NRS pain at rest, pain on motion without load, and pain on load), the Disabilities of the Arm, Shoulder and Hand (DASH) and the Patient Rated Wrist Evaluation (PRWE) questionnaires in patients with wrist OA regarding test–retest reliability and construct validity.

Methods

The NRS, DASH and PRWE were self-administered by 50 patients (40 men and 10 women, mean age 66 years) in a postal survey on two occasions, two weeks apart. Test–retest reliability was evaluated by Kappa statistics and the Spearman rank correlation coefficients (rho) were calculated to evaluate construct validity.

Results

The Kappa coefficients for DASH, PRWE and NRS pain on motion without load and NRS pain on load were > 0.90, 95% CI ranging from 0.84 to 0.98, while NRS pain at rest was 0.83, 95% CI 0.73–0.92. The construct validity of the PROMs was confirmed by three formulated hypotheses: a higher correlation between PRWE and NRS (rho 0.80–0.91, p < 0.001) was found, compared to DASH and NRS (rho 0.68–0.80, p < 0.001); the NRS pain on motion without load and NRS pain on load correlated more strongly to PRWE and DASH (rho 0.71–0.91, p < 0.001) compared to NRS pain at rest (rho 0.68–0.80) and a high correlation between PRWE and DASH was found (rho 0.86, p < 0.001).

Conclusions

The NRS, DASH and PRWE demonstrate excellent test–retest reliability and moderate to high construct validity in patients with wrist OA. These PROMs are highly related, but they also differ. Therefore, they complement each other in ensuring a comprehensive evaluation of perceived disability in wrist OA. As PRWE showed the highest test–retest reliability and the highest relation to the other PROMs, the sole use of the PRWE can be recommended in clinical practice.

Peer Review reports

Background

Patients with wrist osteoarthritis (OA) commonly suffers from pain, decreased range of motion (ROM) and reduced grip strength [1, 2]. These impairments can have a significant impact on patient’s ability to perform daily activities and to participate in society [3]. Reducing pain and improving the patient’s ability to use their hand and wrist in daily activities are a prime focus in non-surgical and surgical treatment strategies for patients with wrist OA [1, 2, 4].

Indications for interventions in patients with wrist OA are often the patient’s self- perceived symptoms and disabilities [3]. Therefore, patient-reported outcome measures (PROMs) are important to include when assessing the effects of different treatments for wrist OA because they provide a more complete picture of the disability from the patient’s perspective [5, 6].

In wrist and hand OA, commonly used PROMs are pain rating scales, such as the Visual analogue Scale (VAS) and the Numerical Rating Scale (NRS) [7,8,9,10] for measuring pain intensity, the Disabilities of the Arm, Shoulder and Hand (DASH) questionnaire [8, 11,12,13], and the Patient-Rated Wrist Evaluation (PRWE) questionnaire [14, 15]. The DASH measures self-reported upper extremity physical function and symptoms taking the whole upper extremity into account, irrespective of which hand or if both hands are used and is the most commonly used PROM in clinical trials for patients with wrist OA [8]. The PRWE is a wrist specific PROM originally developed for the assessment of perceived disability after a distal radius fracture. Content validity of PRWE has previously been evaluated in the context of hand and wrist arthritis (27% with OA, 67% with rheumatoid arthritis and 6% with psoriatic arthritis) [16], but apart from this, none of these PROMs have been evaluated regarding their psychometric properties in the context of wrist OA.

As the psychometric properties of a measure are closely linked to the population it is intended for, there is a need to investigate the psychometric properties for these PROMs in patients with wrist OA [17, 18]. The aim of our study was to assess and compare the psychometric properties of the NRS, DASH, and PRWE in a group of patients with wrist OA regarding test–retest reliability and construct validity.

Material and methods

Participants

Data were collected from January 2020 to December 2021. Inclusion criteria were 1) radiographically confirmed wrist OA and 2) age ≥ 18. Exclusion criteria were 1) the presence of other diseases or disorders that could affect arm and hand function, 2) previous surgery to the wrist, and 3) inability to understand and follow test instructions due to communicative, mental, or cognitive impairments.

Via the hospital’s administrative patient system, 66 patients (54 men and 12 women) seeking medical care for wrist OA at the Department of Hand Surgery, Skåne University Hospital Malmö, Sweden between the years 2016 to 2021 were identified by reviewing the patients’ medical records. Thirteen patients did not respond and three declined to participate, leaving 50 patients that were included in the study.

Prior to inclusion, information about the purpose of the study was provided, and all participants gave their written consent to participate. The principles of the Declaration of Helsinki were followed. The study was approved by the Swedish Ethical Review Authority, Dnr 2019–02,437. The COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist, guidelines for methodological quality in psychometric studies, was followed [19].

Outcome measures

The NRS [20,21,22] is a numeric 11-point pain rating box scale with numerical descriptors on the box, ranging from 0 representing one pain extreme (no pain) to 10 representing the other pain extreme (worst pain imaginable). Patients select a value that is most in line with the intensity of pain they have perceived in the affected wrist over the last week. Three measures of pain are rated in this study: 1) pain at rest, 2) pain on motion without load, and 3) pain on load [23]. The NRS have been found to be valid and reliable in different acute and chronic pain conditions and in healthy populations [20, 22].

The main part of the DASH is a 30-item disability/symptom scale concerning the patient's health status during the preceding week [11, 12]. The items ask about the degree of difficulty in performing different physical activities because of arm, shoulder, or hand problems (21 items), the severity of each of the symptoms of pain, activity-related pain, tingling, weakness and stiffness (5 items), as well as the problem's impact on social activities, work, sleep, and self-image (4 items). Each item has five response options. The scores for all items are then used to calculate a scale score ranging from 0 (no disability) to 100 (most severe disability). The score for the disability/symptom scale is called the DASH score. The DASH is one of the most important PROMs for measuring upper extremity disability, and its psychometric properties have been evaluated in a range of conditions involving the upper extremity [24, 25]. In this study, the validated Swedish version of DASH was used [13].

The PRWE [14] includes 15 questions, divided into two subscales assessing pain (5 items) and function (10 items, 6 concerning specific tasks and 4 the ability to perform daily activities) over the past week. The questions are scored on a 10-point ordered categorical scale, ranging from no pain or no difficulty (0 points), to worst pain or unable to do (10 points). The total score of the subscales pain (sum of 5 items) and function (sum of 10 items divided by 2) ranges from 0 to 50. The maximum total score of PRWE is 100 and represents the worst disability, whereas 0 represents no disability. The PRWE has mainly been found to be valid and reliable in patients with distal radius fractures, but also in a variety of other hand and wrist-related injuries and disorders [7]. In the current study, the validated Swedish version of PRWE by Wilcke et al. was used [26].

The quality of a PROM depends on its psychometric properties, such as reliability and validity [18]. Reliability implies that the measure produces similar results under consistent conditions. We evaluated this in a test–retest situation, i.e. the outcome measures should ideally produce similar results from one test occasion to the next, thus indicating that the measurements are stable over time [17]. Validity refers to the degree of which a test measures what it is intended to measure, and can be evaluated as construct validity, where outcomes of the same construct should be related to each other [18]. As no ´gold standard´ exists for PROMs, the commonly used way to investigate construct validity is to test hypotheses about expected relationships with other outcome measures of good quality [27]. To assess construct validity, we have chosen three reliable and valid PROMs frequently used to evaluate the effect of treatment on patients with wrist OA. We formulated the following hypotheses: 1) PRWE should correlate more strongly to NRS compared to DASH and NRS because PRWE includes the subscale pain and both NRS and PRWE are wrist specific; 2) NRS pain on motion without load and NRS pain on load should correlate more strongly to PRWE and DASH compared to NRS pain at rest as both PRWE and DASH mainly contain activity based questions; and 3) PRWE and DASH should have a moderate to high correlation since they measure the same construct, i.e. upper extremity disability. However, since these questionnaires differ slightly, with DASH being more generic than PRWE, we did not expect a very high correlation (> 0.90).

Procedures

Information about the study with an informed consent form was sent to the participants by surface mail together with the PROMs (NRS, DASH and PRWE) for test occasion 1 (T1). The participants noted the date when the questionnaires were responded and returned them in a prepaid envelope. When the researcher had received the responses of the T1 questionnaires, the same questionnaires for test occasion 2 (T2) were sent to the participants. If a participant failed to send T2 within two weeks, he/she was reminded by telephone or a second surface mail. The time interval of the responses between T1 and T2 became approximately two weeks, with some outliers taking longer time, which is standard in test–retest studies [18, 28].

Background data – such as gender, age, affected side and handedness – were obtained from the medical records. All patients included in the study had wrist radiographs and/or computer tomography taken prior to referral to our clinic. The radiographs were evaluated by an experienced hand surgeon (EB), and the type of wrist OA (scapholunate advanced collapse; SLAC, scaphoid non-union advanced collapse; SNAC, idiopathic wrist OA or Mb Kienböck) was recorded (see Table 1). If the participants had bilateral wrist OA, the wrist that the participant reported as the most affected was included in the wrist specific assessments.

Table 1 Characteristics of 50 participants with wrist osteoarthritis (OA)

Statistical analysis

Descriptive statistics – such as means, standard deviations (SD), frequencies, median (range) – were calculated as appropriate.

The test–retest reliability of NRS (pain at rest, pain on motion without load, and pain on load), DASH and PRWE (total score and subscales pain and function) were evaluated by Kappa statistics (the proportion of agreement observed beyond the agreement expected by chance) using quadratic weights [28,29,30]. The strength of the Kappa coefficient can be interpreted as follows: < 0.40 poor, 0.40 to 0.75 fair to good, and > 0.75 excellent [31].

The Spearman rank correlation coefficients (rho) were calculated to evaluate construct validity. The PRWE and DASH were correlated to NRS, and the correlation between PRWE and DASH was also calculated. Data from the first test occasion were used in the construct validity analyses. The strength of the correlations was interpreted as follows: rho < 0.5 low, 0.5 to < 0.7 moderate, > 0.7 high [32].

Data were analyzed with the IBM SPSS Statistics version 27 (IBM Corporation, Armonk, New York, United States). P-values < 0.05 were considered statistically significant.

Results

The clinical characteristics of the 50 participants are presented in Table 1. Their mean age was 66 years (SD 9) and the dominant hand was the most commonly affected. A majority of the participants were diagnosed with a SLAC wrist. The mean interval between the responses was 16.7 days (SD 17).

The Kappa coefficients for DASH, PRWE (total score and the subscales pain and function) and NRS pain on motion without load and NRS pain on load were > 0.90, 95% CI ranging from 0.84 to 0.98, while NRS pain at rest was 0.83, 95% CI 0.73–0.92 (Table 2).

Table 2 Test–retest reliability of the NRS, DASH and PRWE in 50 patients with wrist osteoarthritis (OA)

The highest correlations were found between PRWE (total score and subscales) and NRS pain on motion without load (rho 0.89–0.91, p < 0.001), and between PRWE total score and DASH (rho 0.86, p < 0.001). The PRWE subscale pain correlated higher (rho 0.84, p < 0.001) than the other measures to NRS pain at rest. Somewhat lower correlations were seen between DASH and NRS (rho 0.68–0.80, p < 0.001) (Table 3).

Table 3 Spearman rank correlations (rho) between NRS, DASH and PRWE in 50 patients with wrist osteoarthritis (OA)

Discussion

This study has evaluated test–retest reliability and construct validity for the NRS, DASH, and PRWE on patients with wrist OA. Our results showed excellent test–retest reliability for these PROMs when applied to this specific group of patients. High correlations were seen between PRWE, NRS and DASH, whilst moderate to high correlations were seen between DASH and NRS.

The NRS, DASH and PRWE are rated in ordinal scales, therefore, retest reliability was analysed with Kappa statistics as recommended [27]. In a systematic review by Shafiee et al. [7], the intraclass correlation coefficient (ICC) was included when evaluating test–retest reliability, which should actually be used when evaluating agreement for parametric data [33]. Thus, the retest reliability results of the current study are not fully comparable even though a quadratic kappa gives about the same result as an ICC [34].

Excellent agreement according to the kappa-coefficient was found for the NRS (0.83–0.92). Earlier studies have mainly used a visual analogue scale (VAS) score evaluating general pain intensity for the wrist, but also with excellent test–retest agreement (ICC 0.84) [7, 35]. Although high test–retest agreement of NRS has been reported [36], it is not as thoroughly investigated as VAS [22]. A high correlation between VAS and NRS has been established, and NRS is recommended to use when assessing pain intensity on the basis of higher compliance rates, better responsiveness, ease of use, and good applicability relative to VAS [20]. Our study showed that NRS pain at rest had somewhat lower Kappa coefficient (0.83) compared to NRS pain on motion without load and NRS pain on load (0.91–0.92). Osteoarthritic pain is complex and variable with “good days and bad days” mainly occurring on use, movement, weight-bearing and later in the day [3, 37]. All these factors probably affect pain at rest, thereby resulting in more pain variations.

An excellent kappa coefficient was found for DASH in our study (0.91). This is in line with previous studies [13, 38,39,40,41,42] that have shown excellent test–retest agreement for DASH with ICC > 0.90 in patients with various upper extremity disorders. The PRWE also showed excellent agreement for the total score and the subscales (Kappa coefficients 0.93–0.94). This is in accordance with a previous systematic review evaluating test–retest reliability for PRWE in 24 studies of various types of wrist and hand injuries [7] that reported ICCs of 0.91–0.93 [7]. Taken together, the NRS, DASH and PRWE can be considered reliable PROMs in wrist OA; thus, they produce similar results under consistent conditions.

When construct validity was evaluated, the formulated hypotheses were confirmed. The PRWE correlated higher with the NRS pain scale compared to DASH. Since PRWE contains a very detailed section with five specific pain questions compared to DASH, which only contains two general pain questions, this was a rather expected result. Previous studies, including patients with various shoulder-, hand/wrist disorders as well as RA patients with upper extremity disabilities, have mainly found moderate to high correlations between DASH and VAS (rho 0.60–0.72) [38, 39, 41, 42]. However, one study, including various arm, shoulder and hand problems, [40] demonstrated low to moderate correlations between DASH and VAS pain at rest (rho 0.44) and VAS pain during activity (rho 0.56). Possible explanations for this low correlation could be that the recruited subjects had heterogeneous diagnoses with variable degrees of pain intensity. In accordance with the present study, Beaton et al. [38] found a high correlation (> 0.70) between DASH and VAS in patients with various shoulder- and hand/wrist disorders. Moderate to high correlations have also been found between PRWE and VAS (rho 0.50–0.74) [43,44,45,46]. When PRWE and DASH were correlated to NRS, the highest correlations were found for NRS pain on motion without load. This can be explained by the fact that most questions in both PRWE and DASH concern the disability on activities of daily living.

High correlations were found between DASH and PRWE (rho 0.86), which is comparable to previous studies that have found moderate to high correlations between DASH and PRWE (rho 0.61–0.94) [7]. Our results are expected since DASH and PRWE to a large extent resemble each other in rating the same construct, i.e., upper extremity disability, predominantly in activities. However, even if they are highly correlated, they also differ slightly probably as DASH is a generic upper limb instrument, whereas PRWE is wrist specific.

The application of self-reported outcome measures allows healthcare professionals to assess the course of treatment; furthermore, it facilitates comparison between groups in clinical trials [8]. To include different types of PROMs – such as symptom specific, wrist specific and more generic questionnaires – provides the opportunity to evaluate separate symptoms and overall self-reported upper extremity disability. This strengthens the concept that NRS, DASH and PRWE complement each other when evaluating patients with wrist OA, especially in a research setting. However, in clinical practice, barriers exists in incorporating PROMs to measure outcomes into the clinical routine because such measurements are time-consuming; thus, they can decrease productivity [47]. As we found a high correlation between PRWE and both DASH and NRS, all three PROMs may not be necessary for the assessment of wrist OA. The PRWE has been shown to be easier for patients to complete, quicker to administer and easier to score than DASH [47]. Consequently, PRWE might be recommended to be used as a sole PROM in clinical practice for assessing functional outcome of wrist OA.

Strengths and limitations

One of the main strengths of our study is the investigation of test–retest reliability and construct validity for a specific patient group, wrist OA, since a measure’s psychometric properties are closely linked to the population they are intended for [17, 18].

According to the COSMIN checklist, the number of participants in this study (n = 50) is an adequate sample size. However, a sample size ≥ 100 would have been considered as very good [18, 48]. The time interval between the repeated ratings in our study is approximately two weeks, which is recommended [18]. However, as there were a few participants that took longer than two weeks to respond to T2, they needed to be reminded, which can be seen as a minor limitation. None of the participants were actively seeking care for their wrist problems during the data collection since data was collected retrospectively, but we cannot fully ensure that the patients were completely stable in the interim period. Most patients were men (80%), and SLAC wrist was the most represented diagnosis (76%). This is characteristic of patients with wrist OA [4, 49], but it can limit the generalization to the whole wrist OA population.

This study provides novel data on the psychometric properties of three important PROMs used to assess perceived symptoms and disabilities in patients with wrist OA, which can improve the evaluation of different treatments for this group of patients. The recommended sole use of PRWE could also increase the use of PROMs in clinical practice. For future research, it would also be valuable to evaluate the responsiveness of these PROMs in patients with wrist OA.

Conclusions

The NRS, DASH and PRWE demonstrate excellent test–retest reliability and moderate to high construct validity in patients with wrist OA. These PROMs are highly related, but they also differ. Therefore, they complement each other in ensuring a comprehensive evaluation of perceived disability in wrist OA. As PRWE showed the highest test–retest reliability and the highest relation to the other PROMs, the sole use of the PRWE can be recommended in clinical practice.

Availability of data and materials

The datasets generated and/or analysed during the current study are not publicly available but available from corresponding author on reasonable request. Public access to data is restricted by the Swedish government (Public Access to Information and Secrecy Act; https://www.government.se/information-material/2009/09/public-access-to-information-and-secrecy-act/). However, data may be available for researchers upon special review and includes approval of the research project by both an Ethics Committee at national level and governmental data safety committees.

Abbreviations

OA:

Osteoarthritis

ROM:

Range of Motion

PROM:

Patient-Reported Outcome Measure

NRS:

Numeric Rating Scale

VAS:

Visual Analogue Scale

DASH:

Disabilities of the Arm, Shoulder and Hand

PRWE:

Patient-Rated Wrist Evaluation

CMC:

Carpometacarpal

T1:

Test occasion 1

T2:

Test occasion 2

COSMIN:

COnsensus-based Standards for the selection of health Measurement INstruments

SLAC:

Scapholunate Advanced Collapse

SNAC:

Scaphoid Non-union Advanced Collapse

ICC:

Intraclass Correlation Coefficient

IQR:

Interquartile Range

CI:

Confidence Interval

SD:

Standard Deviation

References

  1. Weiss AP. Osteoarthritis of the wrist. Instr Course Lect. 2004;53:31–40.

    PubMed  Google Scholar 

  2. Laulan J, Marteau E, Bacle G. Wrist osteoarthritis. Orthop Traumatol Surg Res. 2015;101(1 Suppl):S1-9.

    Article  CAS  PubMed  Google Scholar 

  3. Larsson S, Carlsson IK, Rosberg HE, Björkman A, Brogren E. Patients’ experiences before and after total wrist fusion or total wrist arthroplasty: A qualitative study of patients with wrist osteoarthritis. J Hand Ther. 2022;35(1):41–50.

    Article  PubMed  Google Scholar 

  4. Kozin SH, Michlovitz SL. Traumatic arthritis and osteoarthritis of the wrist. J Hand Ther. 2000;13(2):124–35.

    Article  CAS  PubMed  Google Scholar 

  5. Gagnier JJ. Patient reported outcomes in orthopaedics. J Orthop Res. 2017;35(10):2098–108.

    Article  PubMed  Google Scholar 

  6. Makhni EC. Meaningful clinical applications of patient-reported outcome measures in orthopaedics. J Bone Joint Surg Am. 2021;103(1):84–91.

    Article  PubMed  Google Scholar 

  7. Shafiee E, MacDermid J, Farzad M, Karbalaei M. A systematic review and meta-analysis of Patient-Rated Wrist (and Hand) Evaluation (PRWE/PRWHE) measurement properties, translation, and/ or cross-cultural adaptation. Disabil Rehabil. 2021:1–15. online ahead of print

  8. McPhail SM, Bagraith KS, Schippers M, Wells PJ, Hatton A. Use of condition-specific patient-reported outcome measures in clinical trials among patients with wrist osteoarthritis: a systematic review. Adv Orthop. 2012;2012: 273421.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Visser AW, Boyesen P, Haugen IK, Schoones JW, van der Heijde DM, Rosendaal FR, et al. Instruments measuring pain, physical function, or patient’s global assessment in hand osteoarthritis: a systematic literature search. J Rheumatol. 2015;42(11):2118–34.

    Article  PubMed  CAS  Google Scholar 

  10. Stjernberg-Salmela S, Karjalainen T, Juurakko J, Toivonen P, Waris E, Taimela S, et al. Minimal important difference and patient acceptable symptom state for the Numerical Rating Scale (NRS) for pain and the Patient-Rated Wrist/Hand Evaluation (PRWHE) for patients with osteoarthritis at the base of thumb. BMC Med Res Methodol. 2022;22(1):127.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Hudak PL, Amadio PC, Bombardier C. Development of an upper extremity outcome measure: the DASH (disabilities of the arm, shoulder and hand) [corrected]. The Upper Extremity Collaborative Group (UECG). Am J Ind Med. 1996;29(6):602–8.

    Article  CAS  PubMed  Google Scholar 

  12. De Smet L. The DASH questionnaire and score in the evaluation of hand and wrist disorders. Acta Orthop Belg. 2008;74(5):575–81.

    PubMed  Google Scholar 

  13. Atroshi I, Gummesson C, Andersson B, Dahlgren E, Johansson A. The disabilities of the arm, shoulder and hand (DASH) outcome questionnaire: reliability and validity of the Swedish version evaluated in 176 patients. Acta Orthop Scand. 2000;71(6):613–8.

    Article  CAS  PubMed  Google Scholar 

  14. MacDermid JC. Development of a scale for patient rating of wrist pain and disability. J Hand Ther. 1996;9(2):178–83.

    Article  CAS  PubMed  Google Scholar 

  15. MacDermid JC, Turgeon T, Richards RS, Beadle M, Roth JH. Patient rating of wrist pain and disability: a reliable and valid measurement tool. J Orthop Trauma. 1998;12(8):577–86.

    Article  CAS  PubMed  Google Scholar 

  16. Bobos P, MacDermid JC, Boutsikari EC, Lalone EA, Ferreira L, Grewal R. Evaluation of the content validity index of the Australian/Canadian osteoarthritis hand index, the patient-rated wrist/hand evaluation and the thumb disability exam in people with hand arthritis. Health Qual Life Outcomes. 2020;18(1):302.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Lexell JE, Downham DY. How to assess the reliability of measurements in rehabilitation. Am J Phys Med Rehabil. 2005;84(9):719–23.

    Article  PubMed  Google Scholar 

  18. Streiner DLNGR. Health Measurements Scales- A practical guide to their development and use. Fourth edition ed. Oxford: Oxford University Press; 2008.

  19. Gagnier JJ, Lai J, Mokkink LB, Terwee CB. COSMIN reporting guideline for studies on measurement properties of patient-reported outcome measures. Qual Life Res. 2021;30(8):2197–218.

    Article  PubMed  Google Scholar 

  20. Hjermstad MJ, Fayers PM, Haugen DF, Caraceni A, Hanks GW, Loge JH, et al. Studies comparing Numerical Rating Scales, Verbal Rating Scales, and Visual Analogue Scales for assessment of pain intensity in adults: a systematic literature review. J Pain Symptom Manage. 2011;41(6):1073–93.

    Article  PubMed  Google Scholar 

  21. Downie WW, Leatham PA, Rhind VM, Wright V, Branco JA, Anderson JA. Studies with pain rating scales. Ann Rheum Dis. 1978;37(4):378–81.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Williamson A, Hoggart B. Pain: a review of three commonly used pain rating scales. J Clin Nurs. 2005;14(7):798–804.

    Article  PubMed  Google Scholar 

  23. Carlsson IK, Ekstrand E, Åström M, Stihl K, Arner M. Construct validity, floor and ceiling effects, data completeness and magnitude of change for the eight-item HAKIR questionnaire: a patient-reported outcome in the Swedish National Healthcare Quality Registry for hand surgery. Hand Therapy. 2021;26(1):3–16.

    Article  Google Scholar 

  24. Dowrick AS, Gabbe BJ, Williamson OD, Cameron PA. Outcome instruments for the assessment of the upper extremity following trauma: a review. Injury. 2005;36(4):468–76.

    Article  PubMed  Google Scholar 

  25. Schoneveld K, Wittink H, Takken T. Clinimetric evaluation of measurement tools used in hand therapy to assess activity and participation. J Hand Ther. 2009;22(3):221–35 (quiz 36).

    Article  PubMed  Google Scholar 

  26. Wilcke MT, Abbaszadegan H, Adolphson PY. Evaluation of a Swedish version of the patient-rated wrist evaluation outcome questionnaire: good responsiveness, validity, and reliability, in 99 patients recovering from a fracture of the distal radius. Scand J Plast Reconstr Surg Hand Surg. 2009;43(2):94–101.

    Article  PubMed  Google Scholar 

  27. Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, et al. The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study. Qual Life Res. 2010;19(4):539–49.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Sim J, Wright CC. The kappa statistic in reliability studies: use, interpretation, and sample size requirements. Phys Ther. 2005;85(3):257–68.

    Article  PubMed  Google Scholar 

  29. Cohen J. A coefficient of agreement for nominal scales. Educ Psychol Meas. 1960;20:37–46.

    Article  Google Scholar 

  30. Cohen J. Weighted kappa: nominal scale agreement provision for scaled disagreement or partial credit. Psychol Bull. 1968;70(4):213.

    Article  CAS  PubMed  Google Scholar 

  31. Fleiss JL. Statistical methods for rates and proportions. 2nd ed. New York: John Wiley; 1981.

    Google Scholar 

  32. Mukaka M. A guide to appropriate use of correlation coefficient in medical research. Malawi Med. 2012;24(3):69–71.

    CAS  Google Scholar 

  33. Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psychol Bull. 1979;86(2):420–8.

    Article  CAS  PubMed  Google Scholar 

  34. de Raadt A, Warrens MJ, Bosker RJ, Kiers HAL. A Comparison of Reliability Coefficients for Ordinal Rating Scales. J Classif. 2021;38:519–43.

    Article  Google Scholar 

  35. Mehta SP, Mhatre B, MacDermid JC, Mehta A. Cross-cultural adaptation and psychometric testing of the Hindi version of the patient-rated wrist evaluation. J Hand Ther. 2012;25(1):65–77 (quiz 8).

    Article  PubMed  Google Scholar 

  36. Hawker GA, Mian S, Kendzerska T, French M. Measures of adult pain: visual analog scale for pain (VAS Pain), numeric rating scale for pain (NRS Pain), McGill Pain questionnaire (MPQ), short-form McGill Pain questionnaire (SF-MPQ), chronic pain grade scale (CPGS), short form-36 bodily pain scale (SF-36 BPS), and measure of intermittent and constant osteoarthritis pain (ICOAP). Arthritis Care Res (Hoboken). 2011;63(Suppl 11):S240–52.

    Article  Google Scholar 

  37. Creamer P. Current perspectives on the clinical presentation of joint pain in human OA. Novartis Found Symp. 2004;260:64–74 (discussion -8, 100-4, 277-9).

    PubMed  Google Scholar 

  38. Beaton DE, Katz JN, Fossel AH, Wright JG, Tarasuk V, Bombardier C. Measuring the whole or the parts? Validity, reliability, and responsiveness of the Disabilities of the Arm, Shoulder and Hand outcome measure in different regions of the upper extremity. J Hand Ther. 2001;14(2):128–46.

    Article  CAS  PubMed  Google Scholar 

  39. Raven EE, Haverkamp D, Sierevelt IN, van Montfoort DO, Pöll RG, Blankevoort L, et al. Construct validity and reliability of the disability of arm, shoulder and hand questionnaire for upper extremity complaints in rheumatoid arthritis. J Rheumatol. 2008;35(12):2334–8.

    Article  PubMed  Google Scholar 

  40. Lee JY, Lim JY, Oh JH, Ko YM. Cross-cultural adaptation and clinical evaluation of a Korean version of the disabilities of arm, shoulder, and hand outcome questionnaire (K-DASH). J Shoulder Elbow Surg. 2008;17(4):570–4.

    Article  PubMed  Google Scholar 

  41. Orfale AG, Araujo PM, Ferraz MB, Natour J. Translation into Brazilian Portuguese, cultural adaptation and evaluation of the reliability of the Disabilities of the Arm, Shoulder and Hand Questionnaire. Braz J Med Biol Res. 2005;38(2):293–302.

    Article  CAS  PubMed  Google Scholar 

  42. Changulani M, Okonkwo U, Keswani T, Kalairajah Y. Outcome evaluation measures for wrist and hand: which one to choose? Int Orthop. 2008;32(1):1–6.

    Article  PubMed  Google Scholar 

  43. Boeckstyns ME, Merser S. Psychometric properties of two questionnaires in the context of total wrist arthroplasty. Dan Med J. 2014;61(11):A4939.

    PubMed  Google Scholar 

  44. Kim JK, Kang JS. Evaluation of the Korean version of the patient-rated wrist evaluation. J Hand Ther. 2013;26(3):238–43 (quiz 44).

    Article  PubMed  Google Scholar 

  45. Imaeda T, Uchiyama S, Wada T, Okinaga S, Sawaizumi T, Omokawa S, et al. Reliability, validity, and responsiveness of the Japanese version of the Patient-Rated Wrist Evaluation. J Orthop Sci. 2010;15(4):509–17.

    Article  PubMed  Google Scholar 

  46. Wah JW, Wang MK, Ping CL. Construct validity of the Chinese version of the Patient-rated Wrist Evaluation Questionnaire (PRWE-Hong Kong Version). J Hand Ther. 2006;19(1):18–26 (quiz 7).

    Article  PubMed  Google Scholar 

  47. MacDermid JC, Tottenham V. Responsiveness of the disability of the arm, shoulder, and hand (DASH) and patient-rated wrist/hand evaluation (PRWHE) in evaluating change after hand therapy. J Hand Ther. 2004;17(1):18–23.

    Article  PubMed  Google Scholar 

  48. Terwee CB, Bot SD, de Boer MR, van der Windt DA, Knol DL, Dekker J, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60(1):34–42.

    Article  PubMed  Google Scholar 

  49. Watson HK, Ballet FL. The SLAC wrist: scapholunate advanced collapse pattern of degenerative arthritis. J Hand Surg Am. 1984;9(3):358–65.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

The authors are grateful to the individuals who volunteered to participate.

Funding

Open access funding provided by Lund University. The study was supported by grants from Lund University and Skåne University Hospital and by governmental funding of clinical research within the NHS (National Health Services).

Author information

Authors and Affiliations

Authors

Contributions

SL, EB and EE contributed to the design of the study and SL and EE drafted the manuscript. All authors participated in the interpretation of the results and revised the manuscript. All authors have read and approved the final manuscript. In addition, SL collected the data and performed the statistical analysis together with EE and EB evaluated the radiographs.

Corresponding author

Correspondence to Sara L. Larsson.

Ethics declarations

Ethics approval and consent to participate

The study was performed in accordance with the ethical guidelines stated in the Declaration of Helsinki and approved by the Swedish Ethical Review Authority (Dnr 2019–02437). Prior to inclusion, information about the purpose of the study was provided, and each person gave their written informed consent to participate.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s note

Springer nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Larsson, S.L., Brogren, E., Dahlin, L.B. et al. Psychometric properties of patient-reported outcome measures (PROMs) in wrist osteoarthritis: test–retest reliability and construct validity. BMC Musculoskelet Disord 23, 558 (2022). https://doi.org/10.1186/s12891-022-05511-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12891-022-05511-6

Keywords