- Research article
- Open Access
- Open Peer Review
Measure of activity performance of the hand (MAP-Hand) questionnaire: linguistic validation, cultural adaptation and psychometric testing in people with rheumatoid arthritis in the UK
BMC Musculoskeletal Disorders volume 19, Article number: 275 (2018)
Developed in the Norway, the Measure of Activity Performance of the Hand (MAP-Hand) assesses 18 activities performed using the hands. It was developed for people with rheumatoid arthritis (RA) using patient generated items, which are scored on a 0–3 scale and summarised into a total score range (0 to 54). This study reports the development and psychometric testing of the British English MAP-Hand in a UK population of people with RA.
Recruitment took place in the National Health Service (NHS) through 17 Rheumatology outpatient clinics. Phase 1 (cross-cultural adaptation) involved: forward translation to British English; synthesis; expert panel review and cognitive debriefing interviews with people with RA. Phase 2 (psychometric testing) involved postal completion of the MAP-Hand, Health Assessment Questionnaire (HAQ), Upper Limb HAQ (ULHAQ), Short-Form 36 (SF-36v2) and Disabilities of the Arm Shoulder Hand (DASH) to measure internal consistency (Cronbach’s alpha); concurrent validity (Spearman’s correlations) and Minimal Detectable Difference (MDC95). The MAP-Hand was repeated three-weeks later to assess test-retest reliability (linear weighted kappa and Intra-Class Correlations (ICC (2,1)). Unidimensionality (internal construct validity) was assessed using (i) Confirmatory Factor Analysis (CFA) (ii) Mokken scaling and (iii) Rasch model. The RUMM2030 software was used, applying the Rasch partial credit model.
In Phase 1, 31 participants considered all items relevant. In Phase 2, 340 people completed Test-1 and 273 (80%) completed Test-2 questionnaires. Internal consistency was excellent (α = 0.96). Test-retest reliability was good (ICC (2,1) = 0.96 (95% CI 0.94, 0.97)). The MAP-Hand correlated strongly with HAQ20 (rs = .88), ULHAQ (rs = .91), SF-36v2 Physical Functioning (PF) Score (rs = −.80) and DASH (rs = .93), indicating strong concurrent validity. CFA failed to support unidimensionality (Chi-Square 236.0 (df 120; p < 0.001)). However, Mokken scaling suggested a probabilistic ordering. There was differential item functioning (DIF) for gender. Four testlets were formed, resulting in much improved fit and unidimensionality. Following this, testlets were further merged in pairs where opposite bias existed. This resulted in perfect fit to the model.
The British English version of the MAP-Hand has good validity and reliability in people with RA and can be used in both research and clinical practice.
Rheumatoid arthritis (RA) is a chronic autoimmune disorder affecting joints and surrounding tissues . Most commonly, RA results in swollen, hot and painful joints and generalised stiffness, which worsens with rest. The hands and wrists are the most commonly affected joints. Typically, metacarpophalangeal (MCP) and proximal interphalangeal (PIP) joints become swollen and painful. As a result, people struggle with daily activities requiring gripping, pinching and carrying. If unresolved, these difficulties may lead to activity limitation, participation restriction and loss of independence in later life . Therefore, early recognition and rehabilitation of hand pain and problems may help to improve people with RA’s future health and quality of life.
The National Institute for Clinical Excellence (NICE) Guidelines for the Management of Adults with RA  recommend that patients should have access to specialist occupational therapy if they have difficulties with hand function. To maintain or improve these abilities, occupational therapists need to effectively identify individual’s difficulties and evaluate therapy outcomes. To this end, valid, reliable patient-informed Patient Reported Outcome Measures (PROMs) that are relevant to the interventions rheumatology occupational therapists provide are necessary, but there is only one appropriate measure is currently validated for use in the UK . Several self-reported measures of hand function are available for use in RA [5,6,7,8] however these did not involve patients in their development . It is increasingly recognised that patients should inform the development of such measurement tools  to ensure that issues most relevant and important to them are included.
The Measure of Activity Performance of the Hand (MAP-Hand) questionnaire is an 18-item PROM of hand activity performance, which was developed and rigorously tested in Norway. It has good evidence of reliability and validity in Norwegian people with RA (n = 134) . To ensure items were representative of normal hand function, items were matched to the eight main handgrips using the Sollerman handgrip classification . Rasch analysis was used to finalise the 18-itemstructure representing a range of item difficulty. The scale is unidimensional and has a high person separation index of 0.93. Test-retest reliability is good (ICC = 0.94) although only conducted with 34 people. The MAP-Hand significantly correlated with the AIMS2 hand and finger function (r = 0.78) and arm scales (r = 0.66) . Following testing the MAP-Hand was translated into North American English in accordance with the recommended translation procedures for scale development . The MAP-Hand developers recommended further testing in different countries to establish its psychometric properties and cultural validity.
Linguistic translation of self-administered questionnaires for use in different cultural contexts is insufficient [12, 13]. Researchers must also ensure cross-cultural adaptation to establish items are relevant and understandable to the population of interest, and whether additional items need including to avoid systematic bias . Once adapted, further psychometric testing is required to ensure content validity and reliability is retained across different cultures [12, 13]. Beaton et al.  published guidelines for cross-cultural adaptation of self-report measures to standardise this process. A decade following this, Consensus-based Standards for the selection of health Measurement Instruments (COSMIN) checklist was developed to evaluate the methodological quality of the studies reporting measurement qualities [14, 15]. More recently COSMIN methodology for evaluating the content validity of patient-reported outcome measures were proposed as a rating system to summarise the evidence of a PROM’s content validity . This is deemed to be more detailed, standardised, and transparent than earlier published guidelines, including the previous COSMIN standards .
The overall aim of this study was to develop a British English version of the MAP-Hand following the recommended linguistic and cultural adaptation guidelines and test its psychometric properties (internal consistency, construct and concurrent validity, test-retest reliability and minimal detectable difference) using both the classical testing theory and item response theory in a UK population of people with RA.
Participants were recruited from rheumatology and occupational therapy departments in 17 National Health Service (NHS) Hospitals across the UK.
Within a test-retest design it is important to ensure participants’ disease status is clinically stable at two time points to avoid risk of bias . Therefore a recent change in medication may result in changes of the participant’s hand function and/or physical and mental health status. Patients were screened at the rheumatology outpatient clinics by research nurses and occupational therapists using an eligibility checklist and excluded if they are about to or recently started (during the last 3-months) or increased dose of a biologic or disease modifying anti-rheumatic drugs (DMARDs), low dose oral steroids or received an intra-muscular steroid injection, had cognitive impairment affecting ability to understand and complete the study questionnaire; had another health condition(s) which is moderately to severely affecting their ability to participate in activities and/or hand function; had mental health problem(s) or terminal illness and it was inappropriate to request participation; or were unable to provide informed, written consent. However, people with Fibromyalgia, Osteoarthritis or other conditions that is secondary to RA (e.g. heart disease) were not excluded from the study. Those who were aged 18 years and above: able to read, write and understand English, and diagnosed with RA by a rheumatology consultant were included in the study providing written informed consent was obtained.
Phase 1: Cross-cultural adaptation
The linguistic validation and cross-cultural adaptation guidelines were followed [12,13,14]. As a North American English version of the MAP-Hand was already available from the Norwegian developers, backward translation was not required (Additional file 1). Two native British English speakers forward translated the MAP-Hand; one of whom was unfamiliar with health outcome measures and not involved in health care; and the other was a rheumatology health professional. Translators synthesised translations to resolve discrepancies and an Expert Panel reviewed translations to agree a pre-final British English MAP-Hand. The Expert Panel included health professionals (occupational therapists and a physiotherapist), native English speakers, a methodologist and a layperson with RA. The panel reviewed the MAP-Hand for semantic (i.e. do words mean the same thing?), idiomatic (e.g. presence of colloquialism or idioms), experiential and conceptual equivalence.
Cognitive de-briefing interviews
A purposive sample of participants with wide range of demographic characteristics and health status were identified from the past participants of the EDAQ [Evaluation of Daily Activity Questionnaire] study [17, 18] within five rheumatology outpatient departments in the North West of England. They were mailed an invitation letter, participant information sheet, reply form and a FREEPOST envelope. Upon receipt of a positive reply, they were telephoned by an occupational therapist to go through the eligibility checklist, explain what the study involves and provide an opportunity to answer any questions they may have prior to deciding to take part. Consenting participants were booked in to partake in a cognitive de-briefing interview and mailed the Phase 1 questionnaire booklet to complete at home, in their own time no sooner than 1 week prior to the arranged telephone or face-to-face interview. These semi-structured interviews were aimed to ascertain whether the participant found the MAP-Hand items relevant, understandable and comprehensive (i.e. did they adequately reflect the most common daily activity difficulties experienced when using their hands and whether any additional items should be included). Such interviews are recommended during PROM development to ensure participants’ understanding of their content matches the intended use . The interviewer used a five-point rating scale to assess the relevancy and ease of comprehension of each item in the MAP-Hand (relevancy was measured as 1 = not relevant to 5 = very relevant; and comprehension was measured as 1 = very easy to understand to 5 = very difficult to understand). Interviews were audio-recorded, transcribed and analysed to identify the need for recommended changes and/or the inclusion of new items. A summary of the findings was reviewed by the Expert Panel to decide whether further changes were required. Following this, a detailed report of the linguistic validation and the cross-cultural adaptation process taken place and the finalised British English MAP-Hand were submitted to the Norwegian developers for review and approval.
Phase 2: Psychometric testing
It is highly recommended that culturally adapted questionnaires should be further tested following the cross-cultural validation process to ensure the new version demonstrated the psychometric properties needed for the intended application . Therefore Phase 2 consisted of psychometric testing of the British English MAP-Hand Questionnaire.
During Phase 2, participants were recruited from rheumatology outpatient clinics within 17 NHS hospitals across the UK. These included both rural and urban populations and a wide mix of socio-demographics.
Participants were mailed a Test 1 questionnaire booklet, which included demographic and health data (e.g. age, gender, marital, educational and employment status, disease duration, medication) and following outcome measures: the (i) Health Assessment Questionnaire (HAQ) which includes ability to perform 20 daily activities rated on a 0–3 scale (0 = not at all difficult; 3 = unable to do)  and Upper Limb HAQ (ULHAQ)[7 Upper Limb HAQ items] ; the Medical Outcomes Survey 36 item Short-Form 36 (SF-36v2) from which sub-scale of Physical Function was selected [21, 22]; British English Disabilities of the Arm Shoulder Hand (DASH) which consists of 30 items, measured using five-point Likert scales, 21 daily activity ability items, five symptom items, three participation items, and one of self-image ; and Symptom Numeric Rating Scales (NRS) from the EDAQ Part 1, rating hand and wrist pain and arthritis severity on a 10-point scale [17, 18].
Participants were mailed a repeat [Test 2] questionnaire booklet two to 3 weeks later to complete at home to conduct the test-retest reliability of the British English MAP-Hand. Test 2 questionnaire booklet only included brief items on basic demographics (i.e. date of birth, gender and postcode); two single items about current health status and functioning (i.e. “Considering all the ways that your condition affects you, how have you been over the past month?” and “Overall, how much your arthritis is troubling you now compared to when you last completed this questionnaire a few weeks ago?”) and the British English MAP-Hand for repeat testing (Additional file 2).
The sample size calculation suggested that, for Rasch analysis a sample size of 243 will give 99% confidence of the person estimate being within ±0.5 logits, irrespective of whether or not the scale is well targeted to the patients . A minimum of 79 sets of repeated responses were required to demonstrate that a test-retest correlation of 0.7 differs from a background correlation (constant) of 0.45, with 90% power and 99% significance.
The MAP-hand is reported to be a unidimensional extant scale . As such, confirmation of its structure from a classical test perspective would follow from a Confirmatory Factor Analysis (CFA), where a priori there is evidence that the item set constitutes one factor . Although the Rasch model assumes unidimensionality, and this can be tested post-hoc, it can still be informative to examine the scale through a CFA, particularly as Mokken scaling also has this assumption. Following Kline, fit is determined by a non-significant chi square statistic . Approximate (or ancillary) fit statistics include the Root Mean Square Error of Approximation (RMSEA) where a value less than 0.06 would be appropriate, the Comparative Fit Index (CFI), a comparison of final model and baseline model and the Tucker Lewis Index (TLI), another incremental fit Index which adds penalties for increasing the parameters. Both indices would suggest good fit with values above 0.95. Thus in the present study the item set is fit to a CFA model in Mplus using a polychoric correlation matrix .
The Mokken scale is a non-parametric probabilistic model that utilises Loevinger’s H coefficient to determine the ‘scalability’ of a set of items. ‘H’ is a measure of the degree to which the score is able to discriminate between persons in the given sample . It has been argued that Mokken scaling is a natural starting point for item analysis, and it is used here in that context, to identify if any items from the MAP-Hand display a level of discrimination inconsistent with the expectations of the Rasch model, as represented by low values (< 0.3) of H. In the present study Mokken scaling is examined through the msp procedure in STATA 13 .
The Rasch model is widely applied to PROMs to ascertain if a quantitative structure is present for the domain(s) measured . A practical realisation of additive conjoint measurement, where data are shown to meet the model expectations, it allows the transformation of ordinal data into an interval level latent estimate [30,31,32,33,34]. The model expectations are associated with a series of assumptions, or requirements, including the stochastic ordering of items (or fit), unidimensionality and local independence . Fit is evaluated by several fit statistics, including chi-square statistics for items and in total (which should be non-significant, Bonferroni adjusted), standardised item and person residuals (within a range ± 2.5), and summary residuals with a mean of zero and standard deviation of one where data have perfect fit to the model. Local response dependency can be examined through the residual correlation matrix . When the local independence assumption is violated, items can be grouped into ‘testlets’ which absorb the dependency . When data are made into testlets, this delivers a bi-factor solution for the latent estimate, where any unique non-error variance is discarded .
A post-hoc test of unidimensionality was also undertaken following the approach described by Smith . Finally, within the Rasch model framework, emphasis is placed upon the invariance of comparisons between groups, such that at the same level of the trait being measured (e.g. hand function), the probability of response to an item should be equal across groups, otherwise Differential Item Functioning (DIF) is present and will require adjustment [40, 41]. Consequently the process of fitting data to the Rasch model, widely referred to as Rasch analysis, consists of a series of tests related to the assumptions of the model, and adjustments to accommodate deviations from those expectations. This process, in relation to the measurement of health outcomes, is described in detail elsewhere .
In the current application, emphasis is placed upon fit to the model expectations, and invariance by contextual group, in this case by age, gender, employment and marital status, duration of disease and magnitude of disability as expressed by the HAQ. The analysis used the RUMM2030 software utilising the partial credit parameterisation of the Rasch model [43, 44].
The MAP-Hand scores were compared with comparative health measures, specifically the HAQ-20 and ULHAQ ; SF-36v2 (Physical Functioning Score Norm-Based; General Health and Physical Component) [21, 22]; British English DASH ; numeric rating scales of hand and wrist pain (i.e. pain in the hand and wrist past week) and arthritis severity (i.e. effect of arthritis in the past month; pain when resting; pain when moving) . Concurrent validity was measured using Spearman’s correlations between the MAP-Hand and these comparative health measures.
Internal consistency was measured using Cronbach’s Alpha (α) and the Person Separation Index (PSI) which, should the data have a normal distribution, is equivalent to Cronbach’s Alpha .
Test-retest reliability of the MAP-Hand was assessed using linear weighted kappas from Test-1 and Test-2 items and at scale level using Rasch transformed estimates and Intra Class Correlations (ICC) (2,1). Analyses were conducted using IBM SPSS Statistics v20 and MedCalc Statistical Software.
Measurement error was assessed by transforming the MAP-Hand scores into logits and linearly transforming them to produce an interval-scale. Following this, Standard Error of Measurement (SEM) and the minimal detectable change (MDC95) score were calculated [46, 47]. If ≥15% of responders achieved either the lowest or highest score, floor and ceiling effects were considered to be present [48, 49].
Phase-1: Cross-cultural adaptation
Cognitive debriefing interviews were conducted with 31 participants. Participants’ socio-demographic and health characteristics are detailed in Table 1.
Overall, the interviews showed that the MAP-Hand items were both understandable and relevant, and take 2 minutes to complete. Specific items that were highlighted as potentially problematic were:
Item 10 (slicing bread using a knife); as participants often bought sliced bread, this item was not applicable to most (n = 24). However, most responded to the item by recalling the last time they sliced bread, such as baguettes, as it is instructed.
Item 16 (type on a computer) was not applicable to some participants (n = 7) as they didn’t use computers. As the ‘not applicable’ option is not available in the response options, these participants either left this item blank or guessed their ability based on other activities require similar input (e.g. using a mobile phone to text) to answer.
Nevertheless, all items were deemed to be relevant by the participants and expert panel and therefore retained in the British English MAP-Hand. Cultural adaptations included making small changes in the wording of three items i.e. the item 7 was reworded as “opening screw top bottles” instead of “opening bottle screw tops”; the item 8 was reworded as “opening cans (any type)” instead of “opening hermetic cans” and the item 12 was reworded as “stirring food in a pan” instead of “stirring food in a pot” (Additional file 2).
Phase 2: Psychometric testing
In Phase 2, 340 participants completed the Test 1 questionnaire and re-test was completed by the 80% (n = 273) of the responders. The recruitment progress is summarised in Fig. 1 and the participants’ socio-demographic and health characteristics are detailed in Table 1.
Construct validity (Mokken and Rasch models)
A CFA failed to support the unidimensional structure of the 18-item set of the British English MAP-Hand (Chi-Square 236.0 (df 120; p < 0.001). RMSEA was 0.53 (90%CI:0.43–0.63); CFI 0.995; TLI 0.994. Mokken scaling suggested that all 18 items showed a probabilistic ordering with a moderate scaling level of 0.61. Initial fit of the 18 items of the MAP-Hand to the Rasch model showed some misfit to the model (Table 2, Analysis 1), and a significant breach of the unidimensionality assumption. DIF was largely absent across all contextual factors, but was present for gender for the item 3 ‘ tying shoelaces’, the item 7 ‘opening screw top bottles’ and the item 18 ‘carrying heavy objects’. At any level of hand function, males were more likely to score higher (worse) than females for tying shoelaces, and females were more likely to score higher than males with opening screw top bottles. Reliability was high, but possibly inflated by the local dependency.
Clusters of locally dependent items could be observed. For example, button, socks and laces, any form of opening jars or cans and carrying bags or heavy objects. Consequently four testlets were formed from the item set and the data refitted to the model. Here fit was much improved and the unidimensionality assumption held (Analysis 2). The average latent correlation between the four testlets was 0.91, and the amount of common non-error variance in the latent estimate was 0.97, meaning that just 3% of the non-error variance was discarded. This suggests the earlier breach of the unidimensionality assumption was caused by clusters of locally dependent items. Nevertheless, some gender DIF persisted. As earlier it was noted that some items favoured males, and others favoured females, the testlets were further merged in to pairs where opposite bias existed. This resulted in perfect fit to the model, and no DIF (Analysis 3). The scale and patients were slightly off-target in that the latter were more able (less difficulties) than the average of the scale (Fig. 2). However, the floor effect was minimal (5.6%). Table 3 provides the ordinal raw score- interval scale transformation.
A significant gradient of the transformed metric is seen across groups of functional limitation as defined by the HAQ (Table 4, Fig. 3) (ANOVA F = 217.1; p < 0.001). Females also showed more limitations in hand function than males (t-test; t = 3.1; p = 0.002). Duration of disease also showed a significant difference, mainly due to the group with duration over 21 years (ANOVA post-hoc tests). There was no significant difference by age group (ANOVA F = 0.254; p 0.851).
MAP-HAND correlated strongly with HAQ20 (rs = .88), ULHAQ (rs = .91), SF-36v2 (PF) Score (rs = −.80) and DASH (rs = .93), indicating strong concurrent validity (Table 5).
Internal consistency (reliability)
The Map-Hand showed a high Person Separation Index reliability (PSI), even after adjustment for local dependency (PSI range: 0.95–0.92). Reliability measured by Cronbach Alpha (α) was also excellent (α = 0.96).
Test-retest reliability was good: at item-level linear weighted kappa scores were good (range 0.61–0.75); at scale level, the ICC (2,1) score was 0.96 (95% CI 0.94, 0.97).
The SEM was 1.44 and the MDC95 score was 3.99. There was no floor or ceiling effects present.
Summary of the results
The MAP-Hand questionnaire was linguistically validated and culturally adapted using the recommended guidelines [12,13,14] in a UK population of adults aged ≥18 years with RA . The British English version of the MAP-Hand retained all original 18 items, with some changes to the phrasing of these and the instructions provided to make it easily understandable by British English speakers. The British English MAP-Hand was then psychometrically tested using both classical test theory and item response theory to provide quantitative assessments of the validity and reliability of this PROM in a UK population of adults with RA. The results of the analyses support the validity and the reliability of the British English Map-Hand [50, 51]. The raw score is a sufficient statistic for hand function, and an interval scale metric is provided on Table 3 .
The British English MAP-Hand is a brief, valid and reliable measure of hand activity performance, which can be completed in an average of 2 minutes by people with RA. Due to its ease of use and precision it is an ideal questionnaire to utilise in busy clinical environments, such as the NHS outpatient rheumatology and hand therapy clinics. As a psychometrically robust measure, it can be used to evaluate clinical outcomes, for research purposes, and to describe the patterns and extent of hand activity performance in people with RA in the UK.
Implications for clinical and research practice
The British English MAP-Hand correlated highly with the HAQ20 and ULHAQ . Although these outcomes are measuring similar constructs, the MAP-Hand was developed using patient generated items and this is reflected in the way in which the functional limitation is defined and scores are calculated. For instance, the hand activity performance is measured in the MAP-Hand at a person-level functioning (activity)  with or without the use of aids and adaptations (i.e. gadgets). This means that the MAP-Hand functional limitation score does not increase when the responder appraise their ability to do an activity such as “Writing by hand” (item 15) as “No difficulty” due to their use of pen grip to reduce pain and ease function. People with RA are increasingly encouraged to self-manage their condition, which includes adherence to joint protection advice, that may require behavioural modifications and the use of aids and environmental adaptations . In the hands PIP, thumb interphalangeal (IP) and Distal Interphalangeal (DIP) joints are most involved in executing tasks, hence the use of gadgets such as adapted cutlery, can and bottle opener, zip puller and button hooks are recommended by occupational therapists to encourage joint protection and decrease dependency on others to help with such activities (i.e. the need for help and assistance), decrease pain and prevent joint damage. Therefore use of aids and environmental adaptations are viewed as enablers of function and independence, rather than a marker of disability. Nevertheless, the scoring system used within the HAQ [21, 22] increases with the use of special device by 1 point, if the patient ‘needs’ help from others to do an activity by 2 points and if the patient usually needs both a special device and help from another person by 3 points . Moreover, permanent adaptations of the person’s environment (e.g. changing faucets in the bedroom or kitchen, using Velcro closures on clothing) are also counted as aids and devices . This means, even if the responder has chosen to appraise their ability to do an activity as “without any difficulty” [scored 0] due to their use of aids and adaptations to enable function, their disability score will increase when they disclose the use of aids to help their functioning, unlike how the MAP-Hand is scored.
Another comparative scale that is commonly used in clinical and research practice and recently culturally validated for use in the UK is DASH . DASH scores were also highly correlated with the MAP-Hand scores in this study (Table 6). DASH is a considerably larger, comprehensive scale of upper limb function, which also includes optional modules for assessing upper limb performance at work (WORKDASH) and measuring abilities and symptoms of athletes and performing artists (SPAMDASH). The optional modules are scored separately . As the MAP-Hand, the DASH scores are calculated based on the responder’s ability, regardless of how they do the task. However, unlike the MAP-Hand the DASH includes items measuring both activity limitation (person-level) and participation restriction (societal-level) and as well as the activities performed using hands, the arm and shoulder function is also measured. Although DASH is a comprehensive assessment including 30 items, QUICKDASH is available as a shorter version and consists of 11 items (6 daily activity ability; two symptoms (pain and tingling); and three participation) [4, 55]. Therefore, although the MAP-Hand and the DASH questionnaires appear to measure the same construct at a first glance, their remits differ at conceptual and measurement level and clinicians and researchers should take these differences into consideration if they are having to choose the use of one measure over the other.
In this study the unidimensionality of the MAP-Hand was challenged by the CFA, but supported by the Rasch analysis. In both cases, substantial adjustments had to be made to accommodate the local dependency of the item set. The clusters of locally dependent items made clinical sense, grouping items with similar functional requirements, such as opening jars or cans.
Differential item function may also have contributed to the disturbance of dimensionality. The presence of DIF is not uncommon in health status measures of functioning . The fact that, at any level of hand function males had more difficulty tying shoelaces may simply reflect that women are less likely to wear shoes with laces. Also that at any level of hand function, women have more problem opening jars may simply be a function of men having stronger grip. However at the scale level, the item DIF was cancelled out and, so as long as all 18 items are administered, the total score should remain unaffected.
In the analysis of the local independence, assumption took prominence, which was not reported in the original validation. This can inflate reliability, although in this case only marginally, and the level of reliability remained high, consistent with individual use. By using a testlet solution to absorb the local dependency, a satisfactory fit was achieved. The metric conversion that follows good fit will allow for an appropriate calculation of change scores, as well as aspects such as the minimal important difference, the calculations of which are invalid on ordinal scales .
We only tested the British MAP-Hand in people with RA. Further testing of the British MAP-Hand in other conditions is needed to ensure the scale has validity and reliability for use in these conditions. In addition, further studies should consider longitudinal design with multiple follow-up points to test the British English MAP-Hand’s ability to detect change in hand function over time (i.e. Responsiveness).
The British English version of the MAP-Hand has been linguistically and culturally validated, and found to be a valid and reliable measure of hand function for people with RA in the UK. The British English MAP-Hand meets the COSMIN standards for evaluating the quality of PROM items  and can be used in both clinical practice and research.
Confirmatory Factor Analysis
Comparative Fit Index
Disabilities of the Arm Shoulder Hand
Differential Item Functioning
Disease Modifying Anti-Rheumatic Drugs
Health Assessment Questionnaire
The Measure of Activity Performance of the Hand
- MDC95 :
Minimal Detectable Difference
National Health Service
The National Institute for Clinical Excellence
Numeric Rating Scales
Patient Reported Outcome Measures
Person Separation Index
Standard Error of Measurement
- SF-36v2 :
Tucker Lewis Index
Upper Limb HAQ
Imboden JB, Hellman DB, Stone JH. Current Dagnosis & Treatment Rheumatology. 3rd ed. USA: The McGraw-Hill Companies; 2013.
Hammond A, Prior Y. The effectiveness of home hand exercise programmes in rheumatoid arthritis: a systematic review. Br Med Bull. 2016:1–14. https://doi.org/10.1093/bmb/ldw024.
National Collaborating Centre for Chronic Conditions. In: Rheumatoid arthritis: national clinical guideline for management and treatment in adults. Royal College of Physicians; 2009. https://www.ncbi.nlm.nih.gov/pubmedhealth/PMH0009576/. Accessed 29 Mar 2018.
Hammond A, Prior Y, Tyson S. Linguistic validation, Validity and Reliability of the British English versions of the Disabilities of the Arm, Shoulder and Hand (DASH) questionnaire and QuickDASH in people with rheumatoid arthritis. BMC Musculoskelet Disord. 2018; 19–118 doi: https://doi.org/10.1186/s12891-018-2032-8. The British English DASH is available at: http://www.dash.iwh.on.ca/sites/dash/public/translations/DASH_English_UK.pdf Accessed 29 Mar 2018.
Durez P, Fraselle V, Houssiau F, Thonnard J-L, Nielens H, Penta M. Validation of the ABILHAND questionnaire as a measure of manual ability in patients with rheumatoid arthritis. Ann Rheum Dis. 2007;66:1098–105.
Duruoz MT, Poiraudeau S, Fermanian J, Menkes CJ, Amor B, Dougadoz M, et al. Development and validation of a rheumatoid hand functional disability scale that assesses functional handicap. J Rheumatol. 1996;23:1167–72.
Leeb BF, Sautner J, Andel I, Rintelen B. SACRAH: a score for assessment and quantification of chronic rheumatic affections of the hands. Rheumatology (Oxford, England). 2003;42:1173–8.
Chung KC, Pillsbury MS, Walters MR, Hayward RA, Arbor A. Reliability and validity testing of the Michigan hand outcomes questionnaire. J Hand Surg [Am]. 1998;23:575–87.
Paulsen T, Grotle M, Garratt A, Kjeken I. Development and psychometric testing of the patient-reported measure of activity performance of the hand (MAP-Hand) in rheumatoid arthritis. J Rehabil Med. 2010;42:636–44.
Kirwan JR, Hewlett SE, Heiberg T, Hughes RA, Carr M, Hehir M, et al. Incorporating the patient perspective into outcome assessment in rheumatoid arthritis: progress at OMERACT 7. J Rheumatol 2005; 32:2250–2256.
Sollerman C, Ejeskar A. Sollerman hand function test. A standardised method and its use in tetraplegic patients. Scand J Plast Reconstractive Surg Hand Surg. 1995;29:167–76.
Beaton DE, Bombardier C, Guillemin F, Ferraz MB. Guidelines for the process of cross-cultural adaptation of self-report measures. Spine. 2000;25(24):3186–91.
Acquadro C, Joyce CRB, Patrick DL, Ware JE, Wu AW. Linguistic validation manual for patient-reported outcomes (PRO) instruments. Mapi Res Trust. 2004; https://store.mapigroup.com/. Accessed 29 Mar 2018
Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL et al. The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study. Qual Life Res 2010;19:539–549.
Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, Bouter LM, de Vet HCW. International consensus on taxonomy, terminology, and definitions of measurement properties: results of the COSMIN study. J Clin Epidemiol. 2010;63:737–45.
Terwee CB, Prinsen CAC, Chiarotto A, Westerman MJ, Patrick DL, Alonso J, Bouter LM, de Vet HCW, Mokkink LB. COSMIN methodology for evaluating the content validity of patient-reported outcome measures: a Delphi study. Qual Life Res. 2018; https://doi.org/10.1007/s11136-018-1829-0.
Hammond A, Tyson S. Prior Y, Hawkins R, Tennant a, Nordenskiold U, Thyberg I, Sandqvist G, Cederlund R. Linguistic validation and cultural adaptation of an English version of the evaluation of daily activity questionnaire in rheumatoid arthritis. Health Qual Life Outcomes. 2014;12(143)
Hammond A, Tennant A, Tyson A, Nordenskiold U, Hawkins R, Prior Y. The reliability and validity of the English version of the evaluation of daily activity questionnaire for people with rheumatoid arthritis. Rheumatology. 2015;54(9):1605–15.
Kirwan JR, Reeback JS. Stanford health assessment questionnaire modified to assess disability in British patients with rheumatoid arthritis. Br J Rheumatol. 1986;25(2):206–9.
Johnsson P, Eberhardt K. Hand deformities are important signs of disease severity in patients with early rheumatoid arthritis. Rheumatology. 2009;48:1398–401.
Ware JE Jr, Sherbourne CD. The MOS 36-item short-form health survey (SF-36). Conceptual framework and item selection. Med Care. 1992;30(6):473–83.
Ware JE Jr. SF-36 health survey update. Spine. 2000;25(24):3130–9.
Linacre JM. Sample Size and Item calibration stability. Rasch Meas Trans. 1994;7(4):328.
Brown TA. Confirmatory factor analysis for applied research: 2nd ed. London: Guildford Press; 2006.
Kline RB. Principles and practice of structural equation modeling third edition edn. New York: Guildford Press; 2011.
Muthén LK, Muthén BO. Mplus User's Guide. 6th ed. Los Angeles: Muthén & Muthén; 2011.
Christensen KB, Kreiner S. Monte Carlo tests of the Rasch model based on scalability coefficients. Br J Math Stat Psych. 2010;63:101–11.
StataCorp: Stata Statistical Software: Release 13. In. College Station: TX: StataCorp LP; 2013.
Leung YY, Png ME, Conaghan P, Tennant A. A systematic literature review on the application of Rasch analysis in musculoskeletal disease - a special interest group report of OMERACT 11. J Rheum. 2014;41(1):159–64.
Rasch G. Probabilistic models for some intelligence and attainment tests. Chicago: University of Chicago Press; 1960.
Luce RDTJ. Simultaneous conjoint measurement: a new type of fundamental measurement. J Math Psych. 1964;1:1–27.
Fischer GH, Molenaar IW. Rasch models: foundations, recent developments, and applications. New York: Springer; 1995.
Bond TG, Fox CM: Applying the Rasch Model. Fundamental Measurement in the Human Sciences. 2nd ed. Mahwah: University of Toledo; 2007.
Newby VA, Conner GR, Grant CP, Bunderson CV. The Rasch model and additive conjoint measurement. J Appl Meas. 2009;10(4):348–54.
Gustafsson JE. Testing and obtaining fit of data to the Rasch model. Br J Math Stat Psychol. 1980;33:205–33.
Marais I, Andrich D. Formalizing dimension and response violations of local independence in the unidimensional Rasch model. J Appl Meas. 2008;9(3):200–15.
Wainer HKG. Item clusters and computer adaptive testing: a case for testlets. J Educ Meas. 1987;24:185–202.
Andrich D. Cronbach’s alpha in the presence of subscales. In: International conference on outcomes measurement. Maryland: Bathesda. p. 2010.
Smith EV Jr. Detecting and evaluating the impact of multidimensionality using item fit statistics and principal component analysis of residuals. J Appl Meas. 2002;3(2):205–31.
Tennant A, Penta, M., Tesio, L., Grimby, G., Thonnard, J-L., Slade, A. et al. Assessing and adjusting for cross cultural validity of impairment and activity limitation scales through differential item functioning within the framework of the Rasch model: the pro-ESOR project. Med Care 2004; 42(Suppl 1):37–48.
Terresi JA, Kleinman M, Ocepek-Welikson K. Modern psychometric methods for detection of differential item functioning: application to cognitive assessment measures. Stat Med. 2000;19:1651–83.
Tennant A, Conaghan PG. The Rasch measurement model in rheumatology: what is it and why use it? When should it be applied, and what should one look for in a Rasch paper? Arthritis Rheum. 2007;57(8):1358–62.
Masters G. A Rasch model for partial credit scoring. Psychometrika. 1982;47:149–74.
Andrich D, Sheridan BED, Luo G. RUMM2030: Rasch unidimensional models for measurement. Western Australia: RUMM. Laboratory. 2009;
Cronbach IJ. Coefficient alpha and the internal structure of tests. Psychometrika. 1951;16:297–333.
Stratford PW. Getting more from the literature: estimating the standard error of measurement from reliability studies. Physiother Can. 2004;56:27–30.
Donoghue D. Physiotherapy research and older people (PROP) group, stokes EK: how much change is true change? The minimum detectable change of the berg balance scale in elderly people. J Rehabil Med. 2009;41:343.
Terwee CB, Bot SDM, de Boer MR, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol 2007; 60:34–42.
Fitzpatrick R, Davey C, Buxton MJ, Jones DR. Evaluating patient-based outcome measures for use in clinical trials. Health Technol Assess. 1998;2(i-iv):1–74.
Prior Y, Hammond A, Tyson S, Tennant A. Development and testing of the British English measure of activity performance of the HAND (MAP_HAND) questionnaire in rheumatoid arthritis. Ann Rheum Dis. 2015;74(Suppl 2):1324. https://doi.org/10.1136/annrheumdis-2015-eular.3404.
Prior Y, Tennant A, Hammond A, Tyson S. Psychometric testing of measure of activity performance in the hand (Map-Hand) questionnaire in rheumatoid arthritis: Rasch analysis. Clin Rehabil. 2015;29(10):1014.
International Classification of Functioning. Disability, and health. ICF. Geneva: World Health Organization; 2001.
Hammond A, Bryan J, Hardy A. Effects of a modular behavioural arthritis education programme: a pragmatic parallel-group randomized controlled trial. Rheumatology. 2008;47:1712–8.
The health assessment questionnaire (HAQ) disability index (di) of the Clinical health assessment questionnaire. https://www.niehs.nih.gov/research/resources/assets/docs/haq_instructions_508.pdf. Accessed 29 03 2018.
Kennedy CA, Beaton DE, Solway S, McConell S, Bombardier C. The DASH and QuickDASH outcome measure user’s manual. 3rd ed. Institute for Work & Health: Toronto; 2011.
Randall M, Imms C, Carey LM, Pallant JF. Rasch analysis of the Melbourne assessment of unilateral upper limb function. Dev Med Child Neurol. 2014;56(7):665–72.
Rouquette A, Blanchin M, Sebille V, Guillemin F, Cote SM, Falissard B, Hardouin JB. The minimal clinically important difference determined using item response theory models: an attempt to solve the issue of the association with baseline score. J Clin Epidemiol. 2014;67(4):433–40.
The authors wish to thank all the study participants; Robert Peet and Kate Woodward-Nut, Centre for Health Sciences, University of Salford, for assistance with data collection and data entry; and all the Principal Investigators, rheumatology consultants, rheumatology and research nurses and occupational therapists assisting with participant recruitment and study support at the participating sites: Prof Terry O’Neill, Ann McGovern, Jennifer Green, Angharad Walker (Salford Royal Hospital); Prof Ian Bruce, Lindsey Barnes, Elizabeth Beswick, Sarah Evans (Manchester Royal Infirmary); Dr. Leena Dass, Dr. Sophia Naz, Lorraine Lock (North Manchester General Hospital); Dr. Chris Deighton, Alison Booth, Jo Morris (Royal Derby Hospital); Prof David Walsh, Debbie Wilson, Jayne Smith (Kings Mill Hospital, Sherwood Forest Hospitals NHS Foundation Trust); Dr. Chetan Mukhtyer, Loretta Dean, Susan Rowell (Norfolk and Norwich Hospitals); Dr. Bela Szenbenyi, Carol Gray (Diana Princess of Wales, Grimsby); Dr. Mike Green, Anne Gill, Lisa Carr (York Hospital); Dr. Kirsten Mackay, Julie Easterbrook, Liz Burnett (Torbay Hospital); Dr. Mike Green, Alison Miernik, Rachel Bailey-Hague (Harrogate District Hospital); Dr. Atheer Al-Ansari, Jayne Edwards, Julia Nicholas (Robert Jones & Agnes Hunt Hospital, Oswestry); Dr. Wendy Holden, Janet Cushnaghan, Angie Dempster, Hayley Paterson (Basingstoke and North Hampshire Hospital); Mr. David Johnson, Lindsey Barber, Jan Smith (Stepping Hill Hospital); Dr. Karen Douglas, Lucy Kadiki, Chitra Ramful, Daljit Kaur (Russell Hall Hospital, Dudley); Dr. Anca Ghiurlic, Christine Graver (Royal Hampshire Hospital, Winchester); Dr. Frank McKenna, Jane McConiffe (Trafford Hospitals); Dr. Sophia Naz and Lorraine Lock (Fairfield Hospital).
This paper presents independent research funded by the Arthritis Research UK [Grant No: 20031]. The views expressed of the authors are not necessarily of those of the NHS or Arthritis Research UK. The sponsor and the funding source (Arthritis Research UK) had no role in the design of this study, its execution, analyses, interpretation of the data, or decision to submit results, apart from study oversight.
Availability of data and materials
Data and materials can be accessed through a request from the lead author.
AH conceived the study and was the Chief Investigator. AH, YP, AT and ST initiated the study design. AT, AH and YP conducted the statistical analysis. IK was an advisor and TSC member. All authors contributed to refinement of the study protocol and approved the final manuscript.
Ethics approval and consent to participate
Ethical approval was obtained from the NRES Committee North West (Greater Manchester North) [12/NW/0841] and the University of Salford Research Ethics Panel prior to the start of the study. Approvals have been obtained from the Research and Development departments and therapy service managers at each hospital. Good Clinical Practice (GCP) and informed consent training were completed by all Occupational Therapists and research facilitators taking part in this study. Eligible participants had provided written, informed consent prior to the commencement of the study.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
- Patient reported outcome measures
- Hand activity performance
- Hand function
- Hand pain
- Psychometric testing
- Rasch analysis