- Research article
- Open Access
- Open Peer Review
Minimal clinically important decline in physical function over one year: EPOSA study
BMC Musculoskeletal Disordersvolume 20, Article number: 227 (2019)
The Australian/Canadian hand Osteoarthritis Index (AUSCAN) and the Western Ontario and McMaster Universities knee and hip Osteoarthritis Index (WOMAC) are the most commonly used clinical tools to manage and monitor osteoarthritis (OA). Few studies have as yet reported longitudinal changes in the AUSCAN index regarding the hand. While there are published data regarding WOMAC assessments of the hip and the knee, the two sites have always evaluated separately. The current study therefore sought to determine the minimal clinically important difference (MCID) in decline in the AUSCAN hand and WOMAC hip/knee physical function scores over 1 year using anchor-based and distribution-based methods.
The study analysed data collected by the European Project on Osteoarthritis, a prospective observational study investigating six adult cohorts with and without OA by evaluating changes in the AUSCAN and WOMAC physical function scores at baseline and 12–18 months later. Pain and stiffness scores, the performance-based grip strength and walking speed and health-related quality of life measures were used as the study’s anchors. Receiver operating characteristic curves and distribution-based methods were used to estimate the MCID in the AUSCAN and WOMAC physical function scores; only the data of those participants who possessed paired (baseline and follow up-measures) AUSCAN and WOMAC scores were included in the analysis.
Out of the 1866 participants who were evaluated, 1842 had paired AUSCAN scores and 1845 had paired WOMAC scores. The changes in the AUSCAN physical function score correlated significantly with those in the AUSCAN pain score (r = 0.31). Anchor- and distribution-based approaches converged identifying 4 as the MCID for decline in the AUSCAN hand physical function. Changes in the WOMAC hip/knee physical function score were significantly correlated with changes in both the WOMAC pain score (r = 0.47) and the WOMAC stiffness score (r = 0.35). The different approaches converged identifying two as the MCID for decline in the WOMAC hip/knee physical function.
The most reliable MCID estimates of decline over 1 year in the AUSCAN hand and WOMAC hip/knee physical function scores were 4 and 2 points, respectively.
The Australian/Canadian Hand Osteoarthritis Index (AUSCAN)  and the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) [2, 3] scales are self-report instruments measuring pain, stiffness and physical function linked to osteoarthritis (OA), and have been used by the European Project on Osteoarthritis (EPOSA) to assess personal and societal variables affected by OA, such as quality of life (QoL), social participation, and health care use in several ageing European cohorts. The individuals enrolled in the project were receiving treatment for severe OA, had undiagnosed or untreated OA or did not have OA at all .
The Minimum Clinically Important Difference (MCID) is defined as the smallest change in a score that a patient perceives as beneficial or detrimental . There are different types of MCID, depending on whether there has been an improvement or a worsening in the variable being measured and on the external standard being employed .
Until now, to our knowledge, the MCID in the AUSCAN hand and WOMAC hip/knee physical function scales has received scarce attention. Specifically few studies report longitudinal changes in the AUSCAN . While some studies have investigated the WOMAC scales [7,8,9], the two sites of hip and knee have always been evaluated separately [10,11,12,13,14,15]. Moreover, the MCID has almost always been considered from an improvement perspective, as the majority of studies have aimed to examine the efficacy of pharmacological interventions [11, 13], rehabilitation programs [8, 9], and/or of surgical treatments [10, 12, 14, 15].
The aim of the current study was therefore to estimate the MCID in the AUSCAN and WOMAC physical function subscales using distribution-based and anchor-based methods for longitudinal changes. We postulated that the AUSCAN and the WOMAC physical function scores would worsen [16, 17] with time (i.e., there would be a rise in both) and that the changes in the AUSCAN and WOMAC physical function scores would correlate significantly with changes in other well-established OA health variables or performance-based measures [18,19,20,21,22,23,24,25].
The current study analysed data collected by the European Project on OSteoArthritis (EPOSA), a population-based study involving cohorts living in Germany, Italy, the Netherlands, Spain, Sweden, and the UK, that recruited 2942 adults between the ages of 65–85. All the participants gave written informed consent; the study design and methodology are outlined in detail elsewhere . The study design was granted approval by the appropriate local ethics committees (Germany: Universitat Ulm Ethikkommission [312/08]. Italy: Comitato Etico Provinciale Treviso [XLIV-RSA/AULSS7]. The Netherlands: Medisch Ethische Toetsingscommissie Vrije Universiteit Amsterdam [2002/141]. Spain: Comité Ético de Investigación Clínica del Hospital Universitario La Paz Madrid [PI-1080]. Sweden: Till forskningsetikkommittén vid Karolinska Instituted Stockholm [00–132]. UK: Hertfordshire Research Ethics Committee [10/H0311/59]).
The project aimed to evaluate the participants once at baseline (between November 2010 and November 2011) and a second time 12–18 months later. During the assessment the participants underwent a clinical examination and were interviewed at home or in a health care centre by trained physicians and nurses using a standardized questionnaire.
Physical function, pain and stiffness of hand OA were assessed at baseline and 12–18 months later using the three subscales of the AUSCAN (15 items grouped into 3 scales: pain (5 items), stiffness (1 item), and physical function (9 items)) that utilized a 5-point Likert scale (responses ranged from none to extreme; 0 = none, 1 = mild, 2 = moderate, 3 = severe, and 4 = extreme) .
Physical function, pain and stiffness of hip and/or knee OA were measured at baseline and 12–18 months later using the three subscales of the WOMAC (24 items grouped into 3 scales: pain (5 items), stiffness (2 items), and physical function (17 items)) that utilized a 5-point Likert scale. Hip/knee pain and stiffness were defined as the maximum value of two joints [2, 3].
The Grip strength of both hands at baseline and 12–18 months later was measured twice, using a strain gauge dynamometer and the result (the highest values of the right and left hands were summed and divided by 2) is expressed in kilograms .
Walking speed was measured by time, registered in seconds, for a 3-m course marked out on the floor with no obstructions for an additional 2 ft at both ends.
Anxiety and depression were evaluated at baseline and 12–18 months later using the Hospital Anxiety and Depression Scales (HADS), a 14-item self-assessment instrument that measures anxiety and/or depression separately . Scores that are 8 or higher (scores range between 0 and 21) on each/either of the subscales indicate altered states.
Health-related QoL was measured at baseline and 12–18 months later using: the 5-level EQ-5D, consisting of a descriptive system comprising five dimensions (mobility, self care, usual activities, pain/discomfort, anxiety/depression) and the EQ VAS, a vertical visual analogue scale . The scores of the EQ-5D were converted into a single index, based on general population surveys, using the time trade-off (TTO) valuations from the general population of the UK; scores between − 0.594 and 1. 1 indicate good or satisfying health. Scores of the EQ-VAS range between 0 and 100, with higher scores indicating better health.
Clinical diagnosis of OA was formulated on the basis of the participant’s medical history and a physical examination (only at baseline), utilising algorithms in accordance with the clinical criteria developed by the American College of Rheumatology (ACR)  and the recommendations of the European League Against Rheumatism .
Clinical hand OA (classified as present vs absent) was diagnosed using specific AUSCAN sections : the cut-off in the algorithm for hand pain was ≥3 and it was ≥1 for stiffness. At least 2 of the following criteria were required for a diagnosis of hand OA: a) hard tissue enlargement of two or more joints, b) hard tissue enlargement of two or more distal inter-phalangeal joints, c) deformity of at least one hand joint. Swelling of the metacarpophalangeal joints criteria was a variable that was assessed only in the English and German participants.
Clinical hip/knee OA, defined as the presence of OA in at least one or both of these joints, was diagnosed on the basis of the outcome of specific WOMAC sections. Pain in the hip/knee on at least one side [2, 3] was evaluated during the physical examination using a cut-off ≥3. For the participants, to be diagnosed with knee OA, at least two of the following were necessary: a) a morning stiffness score from mild to extreme; b) crepitus with active motion on at least one side at the physical examination; c) bone tenderness on at least one side at the physical examination; d) bone enlargement at the physical examination on at least one side; e) no palpable warmth of synovium at the physical examination in either knees. All of the following were needed for a positive hip OA assessment: a) pain in the hip on at least one side associated with restricted hip internal rotation at a physical examination; b) morning stiffness of the hip lasting < 60 min, evaluated using the stiffness section of the WOMAC with a score from mild to extreme.
Data analyses and graphical presentations were carried out using SAS software (SAS System, SAS Institute Inc., Cary, NC), version 9.4. Data were analysed using a set of weights calculated per sex and per 5-year age class with respect to the 2010 Standard European Population .
The changes over times (in the 12–18 months between the baseline evaluation and the follow-up one) were evaluated as continuous variables using the non-parametric signed rank test. Spearman’s correlation was used to compare the changes in the AUSCAN and WOMAC physical function scales and the changes in the other variables; the Cronbach α coefficient was used to measure the scales’ reliability (internal consistency) (values of α ≥ 0.7 reflect a good reliability) .
Only the data of the participants whose assessments were considered complete, that is they had completed both the baseline and follow-up assessments, were included in the statistical analysis. The MCID was calculated by measuring the changes from basal to follow-up measurements scores. Since the MCID for subject-reported outcome measures may vary in different populations and depending on the context, as recommended by Revicki et al. , we used multiple approaches to estimate the MCID in the AUSCAN and WOMAC physical function scores to triangulate on a single value or on a small range of values.
For anchor-based estimation of MCID we used the receiver operating characteristic (ROC) curve on the change score in the anchor. The variables assessed as possible anchors for the AUSCAN hand OA physical function score were: the AUSCAN for hand OA Pain, the AUSCAN for hand OA Stiffness, the Grip strength, the HADS anxiety, the HADS depression, the EQ-5D-5 L, and the EQ VAS. The variables evaluated as possible anchors for the WOMAC for hip/knee OA physical function score were: the WOMAC for hip/knee OA Pain, the WOMAC for hip/knee OA Stiffness, the Walking-test time, the HADS anxiety, the HADS depression, the EQ-5D-5 L, and the EQ VAS.
An anchor should be chosen because of a significant correlation between the change in the physical function score and the change in the anchor and a correlation coefficient ≥ |0.30| .
A ROC curve was constructed for those participants showing stable or worsened anchor scores; the area under the curve (AUC) summarizes the instrument’s ability to distinguish between individuals who have or do not have a minimal clinically important difference in functionality. The criteria used to calculate the probability of an optimal cut-off were: the Youden index (J) , the Euclidean distance (D), and the equality sensitivity and specificity (S). The percentage of participants exceeding the MCID were estimated for each cut-off value.
The following were considered for the distribution based-methods:
The standard error of measurement (SEM)  of the changes, considering a 63% confidence interval (CI) (SEM63), a 90% CI (SEM90), and a 95% CI (SEM95).
The Edwards-Nunnally (EN) method , at the 90% CI (EN90), and at the 95% CI (EN95).
Anchor-based and distribution-based-methods were used to determine the MCID, and on that basis the participants were divided into two categories: worse/no worse functionality at the second assessment point. Triangulation was used to examine multiple values from different approaches to converge on a single value, with Cohen’s k (range from − 1 to 1, with one indicating a perfect agreement).
Out of the original 2942 participants who completed the baseline evaluation (Fig. 1), 2455 (83%) agreed to undergo a follow-up evaluation 12–18 months later. It was not possible to re-evaluate 487 participants (16.6% of the baseline sample) because they had died, were untraceable, or declined to participate. The non-completers were significantly older, more likely to be female, less educated, and predominantly Italian with respect to the completers.
Since information from the German group was incomplete, data from that country (n = 336, 14% of 2455), were not analysed. The data that was analysed and upon which our results are therefore based represents 1866 participants who completed both of the study’s evaluations. One thousand, eight hundred forty-two of these had paired AUSCAN measurements and 1845 had paired WOMAC measurements.
Table 1 (weighted data) shows that approximately 17% of the participants had clinical hand OA and more than 22% had clinical hip/knee OA. The median changes in the physical function scores detected using the AUSCAN hand and WOMAC hip/knee subscales were significant, as were the median changes in the WOMAC stiffness score. There were no significant changes in the AUSCAN pain and stiffness subscales and in the WOMAC pain subscale over time.
All other changes in grip strength, walking-test time, the HADS scales, and the EQ-5D-5 L were significant over time.
Table 2 shows the correlation coefficient in the change in the AUSCAN and the WOMAC physical function scores and in the change in the other measures that were considered as possible anchors. For the hand, only the changes in the AUSCAN pain scores were significantly correlated with a coefficient greater than |0.3| (r = 0.31) with the changes in the AUSCAN physical function scores. For the hip/knee, both the changes in the WOMAC pain scores and the changes in the WOMAC stiffness scores were correlated with the changes in the WOMAC physical function score with respectively a correlation coefficient of 0.47 and 0.35 (greater than |0.3|).
The AUSCAN and WOMAC physical function scores showed a very good reliability of coefficient (Cronbach’s α of 0.92 and 0.94 respectively).
AUSCAN physical function estimates of MCID
Using hand pain as an external anchor, the estimates of the MCID in the AUSCAN hand physical function were consistent and equal to one. The only divergent criteria was the Youden index according to which the estimated MCID for the hand was four. Using distribution-based methods, the estimate for significant worsening in the AUSCAN physical function score ranged from 1 to 8.
Based on these cut-offs, the participants were divided into worse vs not worse in functionality 12–18 months after baseline (Fig. 2).
When the percentage of values obtained with distribution-based MCID methods were compared with those produced by anchor-based methods, the two sets agreed most strongly according to Cohen’s k. The first set (k values at approximately 0.97) formed by the ROC D Euclidean distance, the ROC S for equal sensitivity and specificity, and the SRM2, identified approximately 34–35% of the participants with clinically significant physical function decline at the 12–18 month follow-up evaluation. The second set (k values ranging from 0.95 to 1) which was formed by the ROC J Youden index, the SRM5, and the SEM63, uncovered that 24% of the participants had a clinically significant decline. In view of the concordance and the recommendation to privilege the anchor-based methods , we compared the MCID based on the ROC J/SEM63 and the one based on the ROC D/S. Out of the 639 worse participants identified by the ROC D/S criterion, 453 were the same ones identified by ROC J/SEM63. The MCID based on the ROC J/SEM63, which estimated a change of 4 points, was found to be the most reliable criterion to analyse the loss of hand functionality at 12–18 months.
WOMAC physical function estimates of MCID
Using hip/knee pain and stiffness scales as external anchors, the estimates of the MCID of the WOMAC hip/knee physical function were consistent, although the magnitude of the correlations can only be considered moderate. The ROC analysis of the anchor responses to the WOMAC pain and stiffness scales estimated that the MCID was almost always one. Once again, the divergent criteria was the Youden index with stiffness as the anchor that estimated a two point MCID for hip/knee physical function.
Using distribution-based methods, the estimate for significant worsening in the WOMAC physical function score ranges from 1 to 9. Figure 3 shows the percentage of participants who had worse hip/knee functionality 12–18 months after baseline according to the different methods utilized. The MCIDs for hip/knee physical function decline that showed the highest degree of agreement were: those based on the SRM2, those that used the WOMAC pain score as the anchor minimized the Euclidean distance and the equality sensitivity and specificity as well as those that maximized the Youden index (k = 0.94). The SRM2 also agreed with those that, using the WOMAC stiffness score as the anchor, minimized the Euclidean distance and the equality sensitivity and specificity (k = 0.94), or maximized the Youden index (k = 1). These methods identified 30 and 33% of participants with clinically significant hip/knee physical function decline 12–18 months after baseline respectively. Finally, there was a strong agreement (k = 0.89) between the SEM63 and the Youden index with the stiffness score used as an anchor; they respectively detected 26 and 30%.of the participants.
As the highest degree of agreement was found between the Youden index (using ROC with stiffness as the anchor) and the SRM2, the MCID based on these criteria seemed to be the most suitable one to analyse the loss of hip/knee functionality at 12–18 months. Both criteria were consistent in identifying two as the best discriminating WOMAC physical function change cut-off.
While it is true that the AUSCAN and WOMAC scales are the most commonly used clinical tools to manage and monitor OA patients, to our knowledge the MCID for decline picked up by these measures has never been evaluated. Only the study of Angst et al. , which focused on patients with OA of the lower extremities after a rehabilitation intervention, reported a MCID showing worsening in the WOMAC hip/knee physical function of approximately 1.33, based on a scale from 0 to 10. The study’s initial premise that hand and hip/knee physical function would deteriorate significantly over a year’s time was confirmed by our data showing higher AUSCAN and WOMAC physical function scores.
Although the relevance of the MCID approach remains controversial and despite the fact that physical function values can depend on the population being examined, the context, the time and methods used , it remains an important assessment instrument. The magnitude of the MCID was inferior in the participants studied here using the WOMAC instrument with respect to other studies [7,8,9,10,11,12,13,14,15]. As those studies focused on patients before and after interventions, the differences in the magnitude of the MCID might be connected to patient expectations regarding surgical interventions, as compared to non-surgical interventions . Other factors that might explain the differing MCID values could be: the severity of the participant’s baseline health status, the length of the period being examined, the accuracy of the measurement instruments, and the direction in the change in the MCID (i.e. towards improvement or worsening).
Other studies have demonstrated that changes in the AUSCAN and WOMAC physical function scores correlate significantly with changes in other generic, adapted, or performance-based measures used to gauge pain and function in the hand and hip/knee OA [18,19,20,21,22,23,24,25]. Our study did not, however, find any correlations between the changes in the AUSCAN and WOMAC physical function and the changes in other more generic measures such as the Hospital Anxiety and Depression Scale and the European QoL Surveys. The anchors with the strongest correlations were the pain-specific questionnaires (AUSCAN and WOMAC pain subscale), presumably because they are basically measures of pain during physical activities rather than unspecific pain measures .
The estimates of the MCID in both the AUSCAN hand physical function and the WOMAC hip/knee physical function according to the ROC analysis using different anchor responses and criteria, were consistent. The only divergent criteria was the Youden index that overestimated the MCIDs.
But as explained above, besides an anchor-based method, we also used a distribution-based approach to estimate the MCID, given that the two are complementary . Distribution-based approaches, which are based on the statistical characteristics of the samples studied and reliable measures, generated a wide range of different estimates of the MCID for the AUSCAN hand and WOMAC hip/knee physical function that were greater than the anchor-based estimates. Both methods converged to a common result.
While distribution-based estimates are able to furnish supportive information when the change is significant, they do not provide a direct measure of minimum clinically important difference. That is why precedence was shown to the anchor based estimates. Moreover, since the MCIDs estimated using distribution-based methods were greater than the mean change reported 12–18 months after baseline, it is possible that the data from the distribution-based methods provide information about clinical significance but might overestimate the true MCID.
This study has several limitations. First, the changes in outcome measures could hypothetically be associated with baseline levels. Second, there is the possibility that the participants selected did not experience much or any change over the 12–18 month study period. Third, there may be even important differences in the populations studied and in the cut-off values of the MCID physical function decline. Indeed, estimates can differ depending on the instrument, domain, country, and condition, at least for condition-specific measures, and further research is required before the estimates presented here can be generalized to other instruments .
The study’s most important strength was undeniably its large population base: the participants were randomly selected from older community-dwelling European populations. Not only persons with OA but also large numbers of healthy individuals not affected with OA were analysed. The methodology used was the same in all of the countries, and OA was diagnosed in accordance with standardized international guidelines . Our study was based on valid standardized globalized measures (the WOMAC and AUSCAN Indexes) suggested by guidance documents [41, 42], and, in fact, they proved to be quite reliable. The data are longitudinal in nature. Another study strength was that it describes the decision-making process leading to the selection of a single value from a range of different MCID cut offs by comparing the percentages of change scores exceeding the MCID. The process considered the major concordance between those based on anchor-based methods and those based on distribution-based approaches, privileging those based on the former [40, 43, 44], and evaluated the differences in terms of clinical OA.
To conclude, the study shows that the AUSCAN hand and WOMAC hip/knee physical function scores are indeed sensitive to the effects of OA. The data analysed using various health and physical performance measures as external anchors showed that the minimally important decline over 1 year in the AUSCAN and WOMAC physical function scores was four and two points respectively. Further research is required to confirm the robustness of these estimates and to evaluate their temporal consistency and country-dependency.
Australian/Canadian Hand Osteoarthritis hand Index
EN with 90% CI
EN with 95% CI
European Project on OSteoArthritis
- EQ VAS:
Health status using the visual analogue scale
- EQ-5D-5 L:
Health status using five dimensions
Hospital Anxiety and Depression Scales
Minimum Clinically Important Difference
Receiver operating characteristic
Standard Error Measurement
SEM with 63% CI
SEM with 90% CI
SEM with 95% CI
standardized response mean
SRM with Cohen’s threshold 0.20
SRM with Cohen’s threshold 0.50
SRM with Cohen’s threshold 0.80
Western Ontario and McMaster Universities knee and hip Index
Bellamy N, Campbell J, Haraoui B, Gerecz-Simon E, Buchbinder R, Hobby K, et al. Clinimetric properties of the AUSCAN osteoarthritis hand index: an evaluation of reliability, validity and responsiveness. Osteoarthr Cartil. 2002;10:863–9.
Bellamy N, Buchanan WW, Goldsmith CH, Campbell J, Stitt LW. Validation study of WOMAC: a health status instrument for measuring clinically important patient relevant outcomes to antirheumatic drug therapy in patients with osteoarthritis of the hip or knee. J Rheumatol. 1988;15:1833–40.
Roorda L, Jones C, Waltz M, Lankhorst G, Bouter L, van der Eijken JW, et al. Satisfactory cross cultural equivalence of the Dutch WOMAC in patients with hip osteoarthritis waiting for arthroplasty. Ann Rheum Dis. 2004;63:36–42.
van der Pas S, Castell MV, Cooper C, Denkinger M, Dennison EM, Edwards MH, et al. European project on osteoarthritis: design of a six-cohort study on the personal and societal burden of osteoarthritis in an older European population. BMC Musculoskelet Disord. 2013;14:138–48.
Jaeschke R, Singer J, Guyatt GH. Measurement of health status. Ascertaining the minimal clinically important difference. Control Clin Trials. 1989;10:407–15.
King MT. A point of minimal important difference (MID): a critique of terminology and methods. Expert Rev Pharmacoecon Outcomes Res. 2011;11:171–84.
Bellamy N, Hochberg M, Tubach F, Martin-Mola E, Awada H, Bombardier C, et al. Development of multinational definitions of minimal clinically important improvement and patient acceptable symptomatic state in osteoarthritis. Arthritis Care Res (Hoboken). 2015;67:972–80.
Angst F, Aeschlimann A, Michel BA, Stucki G. Minimal clinically important rehabilitation effects in patients with osteoarthritis of the lower extremities. J Rheumatol. 2002;29:131–8.
Angst F, Aeschlimann A, Stucki G. Smallest detectable and minimal clinically important differences of rehabilitation intervention with their implications for required sample sizes using WOMAC and SF-36 quality of life measurement instruments in patients with osteoarthritis of the lower extremities. Arthritis Rheum. 2001;45:384–91.
Quintana JM, Escobar A, Bilbao A, Arostegui I, Lafuente I, Vidaurreta I. Responsiveness and clinically important differences for the WOMAC and SF-36 after hip joint replacement. Osteoarthr Cartil. 2005;13:1076–83.
Tubach F, Ravaud P, Baron G, Falissard B, Logeart I, Bellamy N, et al. Evaluation of clinically relevant changes in patient reported outcomes in knee and hip osteoarthritis: the minimal clinically important improvement. Ann Rheum Dis. 2005;64:29–33.
Escobar A, Quintana JM, Bilbao A, Aróstegui I, Lafuente I, Vidaurreta I. Responsiveness and clinically important differences for the WOMAC and SF-36 after total knee replacement. Osteoarthr Cartil. 2007;15:273–80.
Ornetti P, Dougados M, Paternotte S, Logeart I, Gossec L. Validation of a numerical rating scale to assess functional impairment in hip and knee osteoarthritis: comparison with the WOMAC function scale. Ann Rheum Dis. 2011;70:740–6.
Escobar A, García Pérez L, Herrera-Espiñeira C, Aizpuru F, Sarasqueta C, Gonzalez Sáenz de Tejada M, et al. Total knee replacement; minimal clinically important differences and responders. Osteoarthr Cartil. 2013;21:2006–12.
Clement ND, Bardgett M, Weir D, Holland J, Gerrand C, Deehan DJ. What is the minimum clinically important difference for the WOMAC index after TKA? Clin Orthop Relat Res. 2018;476(10):2005–14.
Zhang Y, Niu J, Kelly-Hayes M, Chaisson CE, Aliabadi P, Felson DT. Prevalence of symptomatic hand osteoarthritis and its impact on functional status among the elderly: the Framingham study. Am J Epidemiol. 2002;156:1021–7.
Dekker J, van Dijk GM, Veenhof C. Risk factors for functional decline in osteoarthritis of the hip or knee. Curr Opin Rheumatol. 2009;21:520–4.
Ostendorf M, van Stel HF, Buskens E, Schrijvers AJ, Marting LN, Verbout AJ, et al. Patient-reported outcome in total hip replacement: a comparison of five instruments of health status. J Bone Joint Surg Br. 2004;86:801–8.
Maly MR, Costigan PA, Olney SJ. Determinants of self-report outcome measures in people with knee osteoarthritis. Arch Phys Med Rehabil. 2006;87:96–104.
Bellamy N, Buchanan WW. A preliminary evaluation of the dimensionality and clinical importance of pain and disability in osteoarthritis of the hip and knee. Clin Rheumatol. 1986;5:231–41.
Barthel HR, Peniston JH, Clark MB, Gold MS, Altman RD. Correlation of pain relief with physical function in hand osteoarthritis: randomized controlled trial post hoc analysis. Arthritis Res Ther. 2010;12:R7.
Louie GH, Ward MM. Association of measured physical performance and demographic and health characteristics with self-reported physical function: implications for the interpretation of self-reported limitations. Health Qual Life Outcomes. 2010;8:84.
Altman RD. Hand Function in Osteoarthritis In: Duruöz MT, editor. Hand function. A practical guide to assessment. New York: Springer Science+Business Media; 2014. p. 63–9.
Liu R, Damman W, Kaptein AA, Rosendaal FR, Kloppenburg M. Coping styles and disability in patients with hand osteoarthritis. Rheumatology (Oxford). 2016;55:411–8.
van Dijk GM, Veenhof C, Spreeuwenberg P, Coene N, Burger BJ, van Schaardenburg D, et al. Prognosis of limitations in activities in osteoarthritis of the hip or knee: a 3-year cohort study. Arch Phys Med Rehabil. 2010;91:58–66.
Roberts HC, Denison HJ, Martin HJ, Patel HP, Syddall H, Cooper C, et al. A review of the measurement of grip strength in clinical and epidemiological studies: towards a standardised approach. Age Ageing. 2011;40:423–9.
Zigmond A, Snaith R. The hospital anxiety and depression scale. Acta Psychiatr Scand. 1983;67:361–70.
Brooks R, Rabin R, De CF. The measurement and valuation of health status using EQ-5D: a European perspective: Kluwer Academic Publishers; 2003. https://www.springer.com/gp/book/9781402012143.
Altman RD. Classification of disease: osteoarthritis. Semin Arthritis Rheum. 1991;20(Suppl 2):40–7.
Zhang W, Doherty M, Leeb BF, Alekseeva L, Arden NK, Bijlsma JW, et al. EULAR evidence-based recommendations for the diagnosis of hand osteoarthritis: report of a task force of ESCISIT. Ann Rheum Dis. 2009;68:8–17.
Terwee CB, Bot SD, de Boer MR, van der Windt DA, Knol DL, Dekker J, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60:34–42.
Revicki D, Hays RD, Cella D, Sloan J. Recommended methods for determining responsiveness and minimally important differences for patient-reported outcomes. J Clin Epidemiol. 2008;61:102–9.
Perkins NJ, Schisterman EF. The Youden index and the optimal cut-point corrected for measurement error. Biom J. 2005;47:428–41.
Stucki G, Liang MH, Fossel AH, Katz JN. Relative responsiveness of condition-specific and generic health status measures in degenerative lumbar spinal stenosis. J Clin Epidemiol. 1995;48:1369–78.
Cohen J. Statistical power analysis for the behavioral sciences. New York: Academic Press; 1977.
Wyrwich KW, Tierney WM, Wolinsky FD. Further evidence supporting an SEM-based criterion for identifying meaningful intra-individual changes in health-related quality of life. J Clin Epidemiol. 1999;52:861–73.
Speer DC. Clinically significant change: Jacobson and Truax (1991) revisited. J Consult Clin Psychol. 1992;60:402–8 Erratum in: J Consult Clin Psychol. 1993;61:27.
Devji T, Guyatt GH, Lytvyn L, Brignardello-Petersen R, Foroutan F, Sadeghirad B, Buchbinder R, Poolman RW, Harris IA, Carrasco-Labra A, Siemieniuk RAC, Vandvik PO. Application of minimal important differences in degenerative knee disease outcomes: a systematic review and case study to inform BMJ rapid recommendations. BMJ Open. 2017;7(5):e015587. https://doi.org/10.1136/bmjopen-2016-015587.
Gandek B. Measurement properties of the Western Ontario and McMaster universities osteoarthritis index: a systematic review. Arthritis Care Res (Hoboken). 2015;67:216–29.
Crosby RD, Kolotkin RL, Williams GR. Defining clinically meaningful change in health-related quality of life. J Clin Epidemiol. 2003;56:395–407.
Altman R, Brandt K, Hochberg M, Moskowitz R, Bellamy N, Bloch DA, et al. Design and conduct of clinical trials in patients with osteoarthritis: recommendations from a task force of the osteoarthritis research society. Results from a workshop. Osteoarthr Cartil. 1996;4:217–43.
Maheu E, Altman RD, Bloch DA, Doherty M, Hochberg M, Mannoni A, et al. Design and conduct of clinical trials in patients with osteoarthritis of the hand: recommendations from a task force of the osteoarthritis research society international. Osteoarthr Cartil. 2006;14:303–22.
Wells G, Beaton D, Shea B, Boers M, Simon L, Strand V, et al. Minimal clinically important differences: review of methods. J Rheumatol. 2001;28:406–12.
Wells G, Anderson J, Beaton D, Bellamy N, Boers M, Bombardier C, et al. Minimal clinically important difference module: summary, recommendations, and research agenda. J Rheumatol. 2001;28:452–4.
The authors would like to acknowledge the study’s primary contributor, Thorsten Nikolaus, MD, a researcher who worked at the Bethesda Geriatric Clinic of the University of Ulm, Germany and died in September of 2013.
We would also like to express our thanks to all of the individuals who participated in any and all ways in the EPOSA study.
Appreciation is also expressed to Linda Inverso Moretti for her assistance in editing the manuscript.
EPOSA Research Group
Nikolaus T, Peter R, Denkinger MD, Herbolsheimer F, Maggi S, Zambon S, Limongi F, Noale M, Siviero P, Deeg DJ, van der Pas S, Schaap LA, van Schoor NM, Timmermans EJ, Otero A, Castell MV, Sanchez-Martinez M, Quieipo R, Pedersen NL, Broumandi R, Dennison EM, Cooper C, Edwards MH, Parsons C.
This work was supported by a non-commercial private funder.
The Indicators for Monitoring COPD and Asthma - Activity and Function in the Elderly in Ulm study (IMCA - ActiFE) was supported by the European Union  and the Ministry of Science, Baden-Württemberg. The Italian cohort study is part of the National Research Council Project on Aging (PNR). The Longitudinal Aging Study Amsterdam (LASA) is financially supported by the Dutch Ministry of Health, Welfare and Sports. The Peñagrande study was partially supported by the National Fund for Health Research (Fondo de Investigaciones en Salud) of Spain [FIS PI 05/1898; FIS RETICEF RD06/0013/1013 and FIS PS09/02143]. The Swedish Twin Registry is supported in part by the Swedish Ministry of Higher Education. The Hertfordshire Cohort Study is funded by the Medical Research Council of Great Britain, Arthritis Research UK, the British Heart Foundation and the International Osteoporosis Foundation.
Availability of data and materials
The data that support the findings of this study are available from https://www.eposa.org/ but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of the EPOSA Research Group.
Ethics approval and consent to participate
All the participants gave written informed consent and the study design and procedures, outlined in detail elsewhere , were granted approval by local ethics committees (Germany: Universitat Ulm Ethikkommission [312/08]. Italy: Comitato Etico Provinciale Treviso [XLIV-RSA/AULSS7]. The Netherlands: Medisch Ethische Toetsingscommissie Vrije Universiteit Amsterdam [2002/141]. Spain: Comité Ético de Investigación Clínica del Hospital Universitario La Paz Madrid [PI-1080]. Sweden: Till forskningsetikkommittén vid Karolinska Instituted Stockholm [00–132]. UK: Hertfordshire Research Ethics Committee [10/H0311/59]).
Consent for publication
CC: consultancy, lecture fees and honoraria from Alliance for Better Bone Health, Amgen, Eli Lilly, GSK, Medtronic, Merck, Novartis, Pfizer, Roche, Servier, Takeda, and UCB (less than $ 10,000 each).
ED: speaking fees from Eli Lilly (less than $ 10,000).
No competing interests were reported by the other authors.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.