Skip to main content

Development of a brief core set for knee dysfunction based on the International Classification of Functioning, Disability and Health: assessing construct validity and measurement potential

Abstract

Background

The comprehensive core set for knee dysfunction was developed to classify the functioning of people with any knee dysfunction. To be used as a clinical instrument to measure the functioning of people with knee dysfunction, the construct validity of the core set still needs to be assessed. The purpose of this study was to analyze the construct validity of the comprehensive core set for knee dysfunction as an instrument to measure functioning.

Methods

A cross-sectional study with 200 participants with knee dysfunction with or without clinical diagnosis of knee pathology, with or without complaint of pain, with or without instability, and/or with or without knee movement restriction of any type. Participants were assessed using the comprehensive core set for knee dysfunction with 25 categories, the subjective form from the International Knee Documentation Committee scale, and measures of self-perceived general health and functioning. The construct validity of the core set was assessed by Rasch analysis, and the external construct validity was assessed by correlation between the score of the brief core set for knee dysfunction with the subjective form from the International Knee Documentation Committee scale, and scores of self-perception of health and functioning.

Results

Twelve categories were consistent with a unidimensional construct, with no difference in the response pattern for age, sex, educational level, and time of complaint. These categories were included in the brief core set for knee dysfunction. The mean score of the brief core set was 37 ± 21 points, a value classified as moderate impairment regarding functioning. Correlations with the subjective form from the International Knee Documentation Committee scale and scores of self-perception were adequate (p < 0.01; r > 0.5).

Conclusion

The brief core set for knee dysfunction, a set with 12 categories, can be used as a clinical instrument to measure and score the functioning of people with knee dysfunction, aged between 18 and 89 years, with adequate construct validity.

Peer Review reports

Background

Knee dysfunction is a complex interaction of multiple factors [1]. Knee dysfunction can be due to a pathological process at the knee joint [1], or changes in body functions, activities, and social participation [2]. Previous injuries or knee structure changes are not always observed [2]. Since knee dysfunction can lead to limitations and disabilities that negatively impact quality of life and functioning during activities of daily living [3], it is important to assess the functioning of patients using an instrument with a comprehensive perspective, as recommended by the International Classification of Functioning, Disability and Health (ICF).

The ICF was developed to broaden the perspective on people’s functioning, including components such as body functions and structures, activities, participation, environmental, and personal factors [4]. To increase the viability and operational use of the ICF, the core sets project was created [5, 6]. The purpose of a core set is to establish a selection of the ICF categories that describe the functioning of a person in a specific health condition or in a specific care context [6]. This selection and subsequent reduction in the number of categories in the core sets, as opposed to the use of the entire ICF for the assessment of a health condition, encourages clinical use [5, 6].

A preliminary core set for knee dysfunction with 24 categories capable of assessing the functioning and health of people with knee dysfunction was created based on the opinion of patients and then using a statistical model by regression analyses [7]. Afterwards, a panel of experts participated in the content validity assessment of this preliminary core set [8]. After a 4-round Delphi, 25 categories showed adequate content validity to compose the comprehensive core set for knee dysfunction [8]. This comprehensive core set indicates the main ICF categories that should be considered when assessing patients with knee dysfunction [8].

The ICF is based on a conceptual model in which functioning is an umbrella term for body functions and structures as well as for activities and participation [4, 9, 10]. A clinical measure of functioning based on the integration of information obtained from an ICF core set can be considered a unidimensional model [9,10,11]. Rasch analysis allows this unidimensional analysis by checking the data variance, which is the sum of the squares of the values predicted by the analysis around their central values [12]. Rasch analysis can also test whether the order of qualifiers in the comprehensive core set for knee dysfunction is adequate to represent the degree of knee dysfunction [13]. In addition, the consistency between factors such as age, sex, educational level, and time of complaint in the core set [14] allows comparing the functioning in different contexts and populations. Therefore, if the categories of the comprehensive core set for knee dysfunction reflect a unidimensional instrument for assessing body functions and structures, activities and participation, and environmental factors and if the categories are consistent in populations, well-directed, and not redundant, this core set can become a clinical instrument to measure the functioning of people with any knee dysfunction.

Thus, the primary objective of this study was to analyze the construct validity of the comprehensive core set for knee dysfunction as a potential instrument for measuring functioning. The secondary objective was to verify whether there is a difference in the construct validity of the comprehensive core set for knee dysfunction when applied during an interview (conducted by a person trained in the use of the ICF) or self-administered.

Methods

Study design

Cross-sectional study to assess construct validity. All methods were carried out in accordance with relevant guidelines and regulations.

Setting

Face-to-face data collection with an interview was carried out at all levels of care. The interview settings were: Primary Health Care Unit of Bonsucesso (Guarapuava, Paraná), Physical Therapy School of Universidade Estadual do Centro-Oeste (Guarapuava, Paraná), Center for Excellence in Clinical Research in Physical Therapy at Universidade Cidade de São Paulo (São Paulo, São Paulo), and São Vicente de Paulo Charity Hospital (Guarapuava, Paraná). Due to the Covid-19 pandemic, which resulted in social distancing, remote assessments were carried out with a self-administered questionnaire in all levels of care and in several cities in Brazil.

Participants

Participants were aged 18 years or older and had subjective complaints of pain, instability, and/or knee movement restriction of any type. Clinical diagnosis of knee pathology and undergoing treatment were not mandatory for inclusion, as well as having prior surgery was not considered for exclusion. Exclusion criteria were lower limb amputation and congenital and/or acquired malformation distal to the knee. Eligibility was assessed by a standardized set of questions.

Two hundred urban participants were recruited and assessed: 100 were recruited by verbal invitation at the study settings for face-to-face interview assessments, and 100 were identified in social networks of associations of physiotherapists and groups of professionals involved with knee rehabilitation, and in the general community for the remote assessments with a self-administered questionnaire. The assessments were conducted from October 2019 to June 2020.

Variables and measurement

Sample characteristics were assessed with a questionnaire including questions on sociodemographic details and clinical conditions. After that, the comprehensive core set for knee dysfunction was applied using the description of each category as a question and the qualifiers as answer options. The comprehensive core set for knee dysfunction had 25 categories, including 11 categories from the component body functions, three representing body structures, 10 activities and participation, and one environmental factor [8]. The qualifiers for the components body functions, body structures, and activities and participation had five possible answers, ranging from 0 to 4, with 0 being no problem and 4 being a major problem. Additional qualifiers for the body structures and activities and participation components were not used. The qualifiers for the environmental factors component had nine possible answers, ranging from facilitators (+ 1 to + 4, mild facilitator to complete facilitator, respectively) to barriers (1 to 4, mild barrier to complete barrier, respectively) or no barriers/no facilitators (qualifier 0). For all components, qualifier 8 meaning “not specific” and qualifier 9 meaning “not applicable” could also be used [4].

The subjective knee assessment form of the International Knee Documentation Committee (IKDC) [15] was also applied. The IKDC had 10 questions that evaluate symptoms, sports activities, and functioning. The total score ranged from 0 to 100, in which 100 indicated that the patient had no limitations with daily living or sports activities, as well as the absence of symptoms [16].

Finally, to assess self-perception of general health and functioning, patient answered two questions: one on general health status and one on functioning, using a numerical scale from 0 to 10 as answers, in which 0 meant poor general health and/or functioning, and 10 meant excellent general health and/or functioning [17].

Procedures

For the face-to-face interviews, patients and their caregivers who met the eligibility criteria were approached and assessed. Clinical and resident physiotherapists previously trained in the use of the ICF conducted the interview by applying the sociodemographic and clinical questionnaire and the measurement instruments (comprehensive core set for knee dysfunction [8], IKDC [16], and self-perception of general health and functioning [17]). For remote assessments, the same questionnaires and instruments were sent via an online form for the participants recruited via social networks. The online form was assessed through a link and had an initial invitation text with the eligibility criteria to participate in the study and the informed consent form. Additionally, the form was answered by the participants themselves without an interview.

Statistical analysis

Descriptive statistics were used to characterize the sample. The sociodemographic and clinical characteristics of the participants assessed by a face-to-face interview and a self-administered questionnaire were compared. For continuous data with normal distribution, the t-test for unpaired data was applied. For data with non-normal distribution, the Mann-Whitney test was applied. For categorical data, the chi-square test was applied. For these analyses, Statistical Package for the Social Sciences version 23.0 was used, and p < 0.05 was considered significant.

Rasch model was used to examine whether the categories of the comprehensive core set for knee dysfunction constitute a unidimensional instrument with adequate construct validity to assess the functioning of people with knee dysfunction. For this analysis, Winsteps version 4.5.5 was used.

For all components, the qualifiers ranged from 0 to 4. To make this ordinal scale feasible for the environmental factors component, the qualifiers that indicated a facilitator were recoded as the qualifier 0 (no barriers). As qualifiers 8 and 9 were not integrated into the ordinal scale of qualifiers, they were considered as missing values, because Rasch model allowed analysis with missing values [18, 19].

The construct validity for the components body functions and structures, activities and participation, and environmental factors was assessed using Rasch model for polytomous categories. The Partial Credit Model was applied, which was valid without the assumption of equidistance between limits between the categories of the ICF [20]. For this study, the variable “person” was the study participant, while the variable “item” was the ICF category.

Initially, the separation coefficient between the participants and the categories was calculated. This coefficient is analogous to Cronbach’s alpha, a statistical calculation that determines the scale’s reliability level [21]. In addition, the separation coefficient between the participants produces a less misleading estimate, because Cronbach’s alpha always exceeds the maximum possible value for the reliability of the measures resulting from Rasch model [12].

The two primary fit statistics for Rasch model were considered: the information-weighted (or inlier-sensitive) fit statistic (infit), which was more sensitive to responses in the categories close to the participants’ functioning levels, and the outlier sensitive fit statistic (outfit) which was more influenced by responses in the categories that were lighter or more severe than the participant’s functioning [22, 23]. Thus, these statistical calculations quantified residues (MnSq) for the items in relation to the model [12, 23]. The ideal adjustment value for MnSq infit or MnSq outfit was 1, with values between 0.50 and 1.50 being considered acceptable [12]. MnSq values greater than 1.5 indicated that the qualifiers in these categories were unpredictable or erratic. In other words, unexpectedly participants with lower severity were identified with greater severity in one category or vice versa. This indicated that the category did not match the other categories to define a continuum as far as functioning is concerned or that there were problems in its definition, requiring revision. In contrast, MnSq values less than 0.5 indicated little variability of qualifiers in that category, that is, the response pattern was predictable or deterministic. The first result represented a major threat to construct validity; the second indicated that the item did not differentiate people with different functioning levels, contributing little to the definition of the construct [24].

To assess the unidimensional characteristics of the core set for knee dysfunction, we verified whether the first Rasch dimension explained most of the data variance, considering the functioning classified by the ICF. To guarantee the unidimensional characteristics of the scale, the variance explained by the measure in the first dimension should be greater than 50% (R2 > 0.50) [12]. Correlations between residues of items with r above 0.30 were also assessed, which indicated local dependence and possible violations of unidimensional characteristics [25].

The scale for measures of functioning and condition severity of the ICF categories was a scale of logits, which ranged from − 3.00 to + 3.00 [23]. The estimates of difficulty of each item in Rasch model had their average set at 0, with negative values indicating that the category was easier (representing a less intense level of the construct continuum) than the average of the set of categories and conversely, a category with a positive logit was more difficult than the set average [23]. Thus, the greater the participant’s functioning impairment was, the greater their likelihood of receiving high qualifiers in all categories, easy or difficult. The easier the category was, the more likely the person was to receive a low qualifier. When all items in a test met these expectations, it meant that the test fitted the model, and the likelihood was that participants with more functioning had lower scores than those with less functioning [24].

A pattern of uniform responses when considering variation in age, sex, educational level, and time of complaint was also assessed using analyses of Differential Item Functioning (DIF). The bias in the items of the instruments, known as DIF, occurred when there was a difference in the probability of response to an item for two individuals belonging to different population groups (for example, men and women), but who had the same level in the continuum of the measure [26]. Thus, the presence of DIF was indicative that the variation in the qualifiers was not only influenced by the construct investigated by the instrument, but also by a second characteristic, which interacted with the property that was the target of the evaluation [27]. DIF was an undesirable aspect because it could produce differences in meaning between the scores of different population groups, even when there were no real differences, underestimating or overestimating the assessed category [28]. DIF was assessed by analyzing the variance of each category, thus comparing the scores according to age, sex, educational level, and time of complaint [14]. For these analyses, age and time of complaint were dichotomized below and above the mean and median, respectively. The educational level was also dichotomized into below and above 11 years of study, considering high school completion as a cut-off point. We compared the differences in the distribution of responses in each category between groups (face-to-face and remote assessment). The Mantel-Haenszel chi-square was used to test the statistical significance of this difference. We opted for a conservative significance level of p < 0.01, due to the large number of statistical tests without previous hypotheses, which increased the chance of type I error. The categories that presented adequate results in all steps of the analysis were included in the brief core set for knee dysfunction, which could be considered as a measuring tool.

Finally, the external validity of the construct was determined by the correlation of the brief core set for knee dysfunction with the IKDC and with the measures of self-perception of general health and functioning. To that end, we created a score calculated by adding the qualifier scores and dividing it by the number of categories answered and multiplying the result by 25. Thus, the total score varied from 0 to 100, allowing the percentage of the ICF qualifiers to be used as a reference [4]. Scores ranging from 0 to 4 indicated that there was no impairment in functioning, scores from 5 to 24 indicated mild impairment of functioning, scores from 25 to 49 indicated moderate impairment in functioning, scores from 50 to 95 indicated severe impairment in functioning, and scores above 96 indicated total impairment in functioning. Data with normal distribution were analyzed using Pearson’s correlation coefficient and data with non-normal distribution were analyzed using Spearman’s correlation coefficient. Given that the IKDC and the self-perception of functioning measured a construct similar to the core set, a negative correlation with IKDC ≥ 0.50 was considered adequate, and a positive correlation with the measure of self-perception of functioning ≥ 0.50 was considered adequate. For the measure of self-perception of general health, a negative correlation with the brief core set for knee dysfunction between 0.30 and 0.50 was considered appropriate, as they measure different, but related constructs [29].

Results

Table 1 presents the clinical and sociodemographic characteristics of the patients. As the groups showed significant differences in most variables (except sex, knee as the most affected condition, knee surgery, and time of surgery), we decided to analyze the groups separately.

Table 1 Sample characteristics

Initially, we analyzed the construct validity of the comprehensive core set for knee dysfunction applied face-to-face via interview. After analysis of the MnSq infit and MnSq outfit, 11 categories were excluded (Appendix A). The categories b134 “sleep functions”, b260 “proprioceptive function”, b710 “mobility of joint functions”, b715 “stability of joint functions”, b760 “control of voluntary movement functions”, s7500 “structure of thigh”, s7502 “structure of ankle and foot”, d240 “handling stress and other psychological demands”, d540 “dressing”, and e150 “design, construction and building products and technology of buildings for public use” presented an unpredictability problem (MsSq infit and outfit MnSq > 1.5). Regarding the predictability of the categories (infit and outfit MnSq < 0.5), only the category d455 “moving around” was excluded. No residual correlations were observed. The categories that remained in the core set presented an adjustment of the separation reliability coefficient between participants of 0.88, which indicated high reliability. In addition, data variance explained by the measure greater than 50% was found in the first Rasch dimension (55.5%), a criterion that satisfied the unidimensional characteristic of the scale. The categories that remained in the core set and were submitted to DIF analysis are shown in Table 2.

Table 2 Estimates of difficulty and adjustment to Rasch model of the comprehensive core set for knee dysfunction applied face-to-face via interview conducted by a clinical or resident physiotherapist trained in the use of ICF

The analysis of the response pattern of the face-to-face via interview assessments, considering the variation of age, sex, educational level, and time of complaint through DIF analyses, is shown in Table 3. Two categories showed a difference in the probability of response and were excluded from the comprehensive core set for knee dysfunction: b770 “gait pattern functions” and d450 “walking”. Thus, the brief core set for knee dysfunction with 12 categories to be applied face-to-face via interview has adequate construct validity.

Table 3 Differential Item Functioning of the categories of the comprehensive core set for knee dysfunction applied face-to-face via interview conducted by a clinical or resident physiotherapist trained in the use of ICF that did not present adjustment problem, Mantel-Haenszel Chi-square statistics (p value)

The construct validity of the comprehensive version of the remote self-administered core set for knee dysfunction was also analyzed. After the MnSq infit and MnSq outfit analysis, only the category s7502 “structure of ankle and foot” was excluded from the comprehensive core set for knee dysfunction due to an unpredictability problem (MnSq outfit > 1.5). The e150 “design, construction and building products and technology of buildings for public use” category was also excluded for showing a measure of the participants’ functioning and difficulty of the categories (logits) above 3 (Appendix B).

Despite the removal of these two categories, the core set presented a unidimensional characteristic problem, as only 42.9% of the data variance was explained by the measure. Thus, possible residual correlations (r > 0.30) were assessed. Two correlations were identified: between categories b235 “vestibular functions” (balance) and b760 “control of voluntary movement functions” (r = 0.51) and between categories b260 “proprioceptive function” and b760 “control of voluntary movement functions” (r = 0.46). Even after removing these categories, either alone or in combination, the comprehensive core set for knee dysfunction with a remote self-administered questionnaire still presented a unidimensional characteristic problem, because in none of the analyses the variance explained by the measure was above 43.7%, hindering the subsequent analyses.

Considering the results of the analyses separated by group, we decided not to carry out the analysis with the entire sample (n = 200), because the results of the analyses for the remote self-administered questionnaire were not adequate to build a measurement tool. Thus, 12 categories of the core set for knee dysfunction applied face-to-face via interview showed adequate results to compose a measurement tool. These 12 categories were considered the brief core set for knee dysfunction. Five categories belong to the body functions component, one category to the body structures component, and six categories to the activities and participation component (Table 3). This version was subjected to the external construct validity test. The qualifier scores for each participant were added up, and the total was divided by the number of categories answered and multiplied by 25. The mean score was 37 points and the standard deviation was 21 points, classified as moderate impairment in functioning, and the minimum and maximum values were 0 and 85, respectively. Adequate and significant correlations (p < 0.01) were observed in all analyses. The correlation with the IKDC was − 0.75 (95% confidence interval [95% CI]: -0.83 to -0.65), with the self-perception of functioning was − 0.66 (95% CI: -0.76 to -0.52), and with the self-perception of general health was − 0.60 (95% CI: -0.71 to -0.45). The brief core set for knee dysfunction with instructions for application and scoring is presented in the Appendix C.

Discussion

This study assessed the construct validity of the comprehensive core set for knee dysfunction. The core set with a remote self-administered questionnaire did not show adequate results and should not be used. When the comprehensive core set for knee dysfunction was applied face-to-face via interview by a researcher, 12 of the 25 categories showed adequate consistency with a unidimensional construct, free of DIF for age, sex, educational level, and time of complaint, and can be used to assess functioning. Furthermore, we built an accurate and useful measurement tool based on the comprehensive core set for knee dysfunction, which included not only the symptoms and affected structure, but the body functions and activities and social participation.

The proposal for a new core set is a process with many steps that involves the participation of patients and health professionals [6]. For this reason, we decided to validate a general core set that can be used in various clinical conditions of the knee and not a core set based on a specific condition. Therefore, functioning and disability are the focus of assessment and not a particular condition. In addition, the clinical implementation of the ICF depends on the development of practical tools that allow not only classification but measurement of functioning [30].

This study used psychometric and statistical analysis to develop a stable and generalizable tool. The comprehensive core set for knee dysfunction, with 25 categories, remains an applicable instrument for clinical practice and research, as it indicates the main categories that should be considered in an assessment. The current findings suggest that the brief core set for knee dysfunction, with 12 categories, indicates which categories should be included in the calculation of a score. The other 13 categories provide extra information about the patient’s functioning, but do not add measurement accuracy. This variation in the number of categories that should be used to calculate a score was also observed in the core set for breast cancer, which has 43 categories for classification and, according to Rasch analysis, only 30 can be used as a measurement tool [31] to calculate a score.

The brief core set was based on a Rasch model that provides an interval scale and calibrates the functioning of a participant and the limitation in a category in the same analysis. Thus, through Rasch analysis, the brief core set was developed from the categories of the comprehensive core set for knee dysfunction that met all the premises of the analysis [14], based exclusively on the criteria defined for Rasch model. Although it is possible to maintain a marginally adequate or even slightly inadequate category in the analysis of construct validity for a tool, even if it affects statistical properties such as item adjustment or limit order [19], in our study we did not extrapolate the values defined for Rasch analysis. Other studies have included categories with adjustment problems during the assessment of construct validity, but authors did not justify the reason for this decision [25, 31].

The brief core set for knee dysfunction is unidimensional, indicating that a participant’s response to a category was explained by their own functional level in that category and not by other factors [32]. All categories from the brief core set for knee dysfunction were included adhering to a single construct: functioning. The brief core set also showed high scores for the separation reliability coefficient between participants and categories, which point to a tool with high reliability.

Although the ICF recognizes environmental factors as a component of the multidimensional assessment of people with some dysfunction [33], the only category in the comprehensive core set for knee dysfunction (e150 “design, construction and building products and technology of buildings for public use”) was removed from the brief version as it had an adjustment problem. For this component, an adjustment of the qualifiers to maintain the ordinal scale of the qualifiers from the other components was needed. This fact may have influenced the results because all qualifiers that indicated a facilitator were recoded as no barrier (qualifier 0). The difficulty in assessing the construct of environmental factors has already been reported in another study [14], in which authors reported that the nine levels of response do not represent a cumulative measure of the impact of these environmental factors, and it is necessary to re-code the qualifier of this component. Furthermore, there is still no consensus on the best strategy for this process of re-coding environmental factors [14, 25, 31].

Other core sets have also had construct validity assessed using Rasch analysis, such as the core sets for osteoarthritis [14], low back pain [25], breast cancer [31], and stroke [10]. The core set for generalized chronic pain had its construct validity assessed for patients with fibromyalgia [19]. One study on chronic obstructive pulmonary disease reported the construct validity of only one component of the core set [34], which had adequate construct validity for the activities and participation component. In the core set for low back pain, a self-applicable version of the activities and participation component was proposed [35], which presented adequate construct validity through Rasch analysis. Although Rasch analysis is used on a recurring basis to assess the construct validity of the ICF, there is a divergence between studies regarding the parameters adopted for the analysis, as well as for the results of the analysis that must be presented. In this study, we chose to present the measurement values (logit), infit MnSq and outfit MnSq of all categories and not just the means of the results of these analyses.

The Covid-19 pandemic brought about social isolation measures, thus limiting and subsequently preventing face-to-face evaluations via interviews. As a result, the evaluations with a remote self-administered questionnaire were carried out, and the participants rated their own functioning. It was expected that the core set could be used in this context of social restriction, however, the results were unfavorable. In the analysis of the data with a remote self-administered questionnaire, a unidimensional characteristic problem was observed, which compromised the results of the total sample. A possible explanation for this result may be how categories were presented to participants, because the online forms in which the core set categories were sent included the description of categories and what they included, without any additional information. As the ICF is written in scientific and medical terms, it may be necessary to adjust some domains to make the language easier to understand [31]. One study has already suggested developing self-administered questionnaires from a core set [36]. However, the difficulty in converting the ICF categories into a questionnaire has already been reported in the study investigating the construct validity of the core set for breast cancer [31]. In addition, a cross-cultural adaptation may be necessary to increase the understanding of categories [37]. The results of this study also highlight the importance of a trained professional in using the ICF to conduct the assessment.

As a strength of the present study, a clinical instrument to measure the functioning of people with any knee dysfunction was developed, considering the multiple aspects of functioning recommended by the ICF. As we proposed a measurement tool, a weakness of the study was the failure to evaluate other measurement properties for the instrument to have all the scientifically required parameters. The results of this study demonstrated the potential of the brief core set for knee dysfunction as a scale. Future studies can evaluate other measurement properties of the brief core set for knee dysfunction, such as inter- and intra-rater reliability and responsiveness, in addition to checking for possible ceiling and floor effects.

Regarding the limitations of this study, first, the sample was limited to only two cities in Brazil. Therefore, different results could be found in other countries or regions. Although patients were asked about other comorbidities, we did not assess whether these comorbidities impacted functioning. Furthermore, lifestyle habits and the use of orthoses were not recorded. Regarding educational level, other studies [14, 25] conducted the analysis considering the average schooling time in the country. However, in our study, educational level was assessed as a categorical variable, not allowing analysis by the average schooling time of the Brazilian population, which is 9.4 years [38]. Finally, the face-to-face sample may not represent all people with knee dysfunction, because they are relatively less educated, older, and retired. Although the study identifies which categories can be used as a measurement tool, scores from qualifiers can be considered provisional until repeated evidence in larger samples supports the current interpretation.

Conclusion

From the comprehensive core set for knee dysfunction, 12 categories showed adequate construct validity to compose the brief core set for knee dysfunction to be applied face-to-face via interview by a professional. These 12 categories cover body functions and structures and activities and participation and can be used to measure the functioning of people with any knee dysfunction, aged between 18 and 89 years.

Data availability

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Abbreviations

ICF:

International Classification of Functioning, Disability and Health

IKDC:

International Knee Documentation Committee

Infit:

Information-weighted (or inlier-sensitive) fit statistic

Outfit:

Outlier sensitive fit statistic

MnSq:

Mean-square

DIF:

Differential item functioning

References

  1. Reiman MP, Bolgla LA, Lorenz D. Hip functions influence on knee dysfunction: a proximal link to a distal problem. J Sport Rehabil. 2009;18(1):33–46.

    Article  PubMed  Google Scholar 

  2. Urquhart DM, Phyomaung PP, Dubowitz J, Fernando S, Wluka AE, Raajmaakers P, Wang Y, Cicuttini FM. Are cognitive and behavioural factors associated with knee pain? A systematic review. Semin Arthritis Rheum. 2015;44(4):445–55.

    Article  PubMed  Google Scholar 

  3. Alkan BM, Fidan F, Tosun A, Ardicoglu O. Quality of life and self-reported disability in patients with knee osteoarthritis. Mod Rheumatol. 2014;24(1):166–71.

    Article  PubMed  Google Scholar 

  4. World Health Organization. ICF - international classification of Functioning, disability and health. Geneva: World Health Organization; 2001.

    Google Scholar 

  5. Stucki G. G Grimby 2004 Applying the ICF in medicine. J Rehabil Med 44 Suppl 5–6.

    Article  Google Scholar 

  6. Selb M, Escorpizo R, Kostanjsek N, Stucki G, Ustun B, Cieza A. A guide on how to develop an international classification of Functioning, disability and Health Core Set. Eur J Phys Rehabil Med. 2015;51(1):105–17.

    CAS  PubMed  Google Scholar 

  7. Fréz AR, Alouche SR, Binda AC, Vieira GW, Bueno BAM, Cabral CMN. Development of a core set for knee dysfunction based on the International Classification of Functioning, disability and health: a cross-sectional study. Arch Phys Med Rehabil. 2021;102(4):571–81.

    Article  PubMed  Google Scholar 

  8. Fréz AR, Alouche SR, Binda AC, Nunes Cabral CM. Content validity of the International Classification of Functioning, disability and health core set for knee dysfunction: a Delphi study. Physiother Theory Pract. 2024;40(1):110–7.

  9. Cieza A, Hilfiker R, Chatterji S, Kostanjsek N, Ustün BT, Stucki G. The International classification of Functioning, disability, and Health could be used to measure functioning. J Clin Epidemiol. 2009;62(9):899–911.

    Article  PubMed  Google Scholar 

  10. Algurén B, Bostan C, Christensson L, Fridlund B, Cieza A. A multidisciplinary cross-cultural measurement of functioning after stroke: Rasch analysis of the brief ICF core set for stroke. Top Stroke Rehabil. 2011;18(Suppl 1):573–86.

    Article  PubMed  Google Scholar 

  11. Cieza A, Hilfiker R, Boonen A, van der Heijde D, Braun J, Stucki G. Towards an ICF-based clinical measure of functioning in people with ankylosing spondylitis: a methodological exploration. Disabil Rehabil. 2009;31(7):528–37.

    Article  CAS  PubMed  Google Scholar 

  12. A user’s guide to Winsteps. Minisep Rasch-model computer programs [https://www.winsteps.com/a/Winsteps-Manual.pdf].

  13. Svensson E. Guidelines to statistical evaluation of data from rating scales and questionnaires. J Rehabil Med. 2001;33(1):47–8.

    Article  CAS  PubMed  Google Scholar 

  14. Kurtaiş Y, Oztuna D, Küçükdeveci AA, Kutlay S, Hafiz M, Tennant A. Reliability, construct validity and measurement potential of the ICF comprehensive core set for osteoarthritis. BMC Musculoskelet Disord. 2011;12:255.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Rodriguez-Merchan EC. Knee instruments and rating scales designed to measure outcomes. J Orthop Traumatol. 2012;13(1):1–6.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Irrgang JJ, Anderson AF, Boland AL, Harner CD, Kurosaka M, Neyret P, Richmond JC, Shelborne KD. Development and validation of the international knee documentation committee subjective knee form. Am J Sports Med. 2001;29(5):600–13.

    Article  CAS  PubMed  Google Scholar 

  17. Xie F, Lo NN, Lee HP, Cieza A, Li SC. Validation of the International Classification of Functioning, disability, and Health (ICF) brief core set for osteoarthritis. Scand J Rheumatol. 2008;37(6):450–61.

    Article  CAS  PubMed  Google Scholar 

  18. Andrich D, Luo G. Conditional pairwise estimation in the Rasch model for ordered response categories using principal components. J Appl Meas. 2003;4(3):205–21.

    PubMed  Google Scholar 

  19. Prodinger B, Salzberger T, Stucki G, Stamm T, Cieza A. Measuring functioning in people with fibromyalgia (FM) based on the international classification of functioning, disability and health (ICF)--a psychometric analysis. Pain Pract. 2012;12(4):255–65.

    Article  PubMed  Google Scholar 

  20. Masters GN. A Rasch model for partial credit scoring. Psychometrika. 1982;47(2):149–74.

    Article  Google Scholar 

  21. Linacre J, Wright B. Winsteps: multiple-choice, rating scale, na partial credit Rasch analysis. Chicago: MESA; 2000.

    Google Scholar 

  22. Chien CW, Bond TG. Measurement properties of fine motor scale of Peabody developmental motor scales-second edition: a Rasch analysis. Am J Phys Med Rehabil. 2009;88(5):376–86.

    Article  PubMed  Google Scholar 

  23. Bond TG, Fox CM. Applying the Rasch model: fundamental measurement in the human sciences. 2nd ed. New York: Routledge; 2007.

    Google Scholar 

  24. Wright BD, Linacre JM. Observations are always ordinal; measurements, however, must be interval. Arch Phys Med Rehabil. 1989;70(12):857–60.

    CAS  PubMed  Google Scholar 

  25. Røe C, Sveen U, Geyh S, Cieza A, Bautz-Holter E. Construct dimensionality and properties of the categories in the ICF Core Set for low back pain. J Rehabil Med. 2009;41(6):429–37.

    Article  PubMed  Google Scholar 

  26. Embretson SE, Reise SP. Item response theory for psychologists. Mahway: NJ: Lawrence Erlbaum Associates; 2000.

    Google Scholar 

  27. Fidalgo AM, Scalon JD. Using mantel-haenszel methods for detecting differential item functioning. Psicologia Reflexão E Crítica. 2012;25(1):60.

    Article  Google Scholar 

  28. Hauck Filho N, Teixeira MAP. Análise de rasch de uma medida de problemas decorrentes do beber em estudantes universitários. Psico. 2013;44(2):3.

    Google Scholar 

  29. Prinsen CAC, Mokkink LB, Bouter LM, Alonso J, Patrick DL, de Vet HCW, Terwee CB. COSMIN guideline for systematic reviews of patient-reported outcome measures. Qual Life Res. 2018;27(5):1147–57.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Stucki G, Kostanjsek N, Ustün B, Cieza A. ICF-based classification and measurement of functioning. Eur J Phys Rehabil Med. 2008;44(3):315–28.

    CAS  PubMed  Google Scholar 

  31. Yang EJ, Shin EK, Shin HI, Lim JY. Psychometric properties of scale constructed from the International Classification of Functioning, disability and health (ICF) core set for breast cancer based on Rasch analysis. Support Care Cancer. 2014;22(10):2839–49.

    Article  PubMed  Google Scholar 

  32. Cheng KK, Lee J, Leung SF, Liang RH, Tai JW, Yeung RM, Thompson DR. Use of Rasch analysis in the evaluation of the oropharyngeal mucositis quality of life scale. Nurs Res. 2011;60(4):256–63.

    Article  PubMed  Google Scholar 

  33. Alvarelhao J, Silva A, Martins A, Queiros A, Amaro A, Rocha N, Lains J. Comparing the content of instruments assessing environmental factors using the International classification of Functioning, disability and health. J Rehabil Med. 2012;44(1):1–6.

    Article  PubMed  Google Scholar 

  34. Guo C, Liu Y, Hao S, Xie L, Xiang G, Wu Y, Li S. The reliability and validity of the activity and participation component in the brief ICF core set for chronic obstructive pulmonary diseases based on Rasch analysis. Int J Chron Obstruct Pulmon Dis. 2020;15:1191–8.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Bagraith KS, Strong J, Meredith PJ, McPhail SM. Rasch analysis supported the construct validity of self-report measures of activity and participation derived from patient ratings of the ICF low back pain core set. J Clin Epidemiol. 2017;84:161–72.

    Article  PubMed  Google Scholar 

  36. Rogers SN, Forgie S, Lowe D, Precious L, Haran S, Tschiesner U. Development of the International Classification of Functioning, Disability and Health as a brief head and neck cancer patient questionnaire. Int J Oral Maxillofac Surg. 2010;39(10):975–82.

    Article  CAS  PubMed  Google Scholar 

  37. Tschiesner U, Sabariego C, Linseisen E, Becker S, Stier-Jarmer M, Cieza A, Harreus U. Priorities of head and neck cancer patients: a patient survey based on the brief ICF core set for HNC. Eur Arch Otorhinolaryngol. 2013;270(12):3133–42.

    Article  PubMed  Google Scholar 

  38. Pesquisa Nacional por Amostra de Domicílios Contínua - Educação. 2019 [https://biblioteca.ibge.gov.br/index.php/biblioteca-catalogo?view=detalhes&id=2101736].

Download references

Acknowledgements

Not applicable.

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Contributions

ARF, BBP, and ACB performed patient recruitment and data collection. ARF, GRC, and CMNC performed data analysis and interpretation. ARF and ACB performed the literature review and primary manuscript preparation. ARF, ACB, and CMNC assisted with initial drafting of the manuscript, as well as editing and final draft preparation. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Andersom Ricardo Fréz.

Ethics declarations

Ethics approval and consent to participate

All methods were carried out in accordance with relevant guidelines and regulations. The study was approved by the Research Ethics Committee of Universidade Estadual do Centro-Oeste (process number 2.008.701 and 4.845.845). All eligible participants received clear information and those who agreed to participate in the study signed an informed consent form.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Appendix A

Appendix B

Appendix C

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Fréz, A.R., Coelho, G.R., de Barros Pereira, B. et al. Development of a brief core set for knee dysfunction based on the International Classification of Functioning, Disability and Health: assessing construct validity and measurement potential. BMC Musculoskelet Disord 25, 512 (2024). https://doi.org/10.1186/s12891-024-07635-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12891-024-07635-3

Keywords