Skip to main content

Estimating individual health-related quality of life changes in low back pain patients

Abstract

Background

There is a need to evaluate different options for estimating individual change in health-related quality of life for patients with low back pain.

Methods

Secondary analysis of data collected at baseline and 6 weeks later in a randomized trial of 749 adults with low back pain receiving usual medical care (UMC) or UMC plus chiropractic care at a small hospital at a military training site or two large military medical centers. The mean age was 31; 76% were male and 67% were White. The study participants completed the Patient-Reported Outcomes Measurement Information System (PROMIS®)-29 v 1.0 physical function, pain interference, pain intensity, fatigue, sleep disturbance, depression, anxiety, satisfaction with participation in social roles, physical summary, and mental health summary scores (T-scored with mean = 50 and standard deviation (SD) = 10 in the U.S. general population).

Results

Reliability estimates at the baseline ranged from 0.700 to 0.969. Six-week test–retest intraclass correlation estimates were substantially lower than these estimates: the median test–retest intraclass correlation for the two-way mixed-effects model was 0. 532. Restricting the test–retest reliability estimates to the subset who reported they were about the same as at baseline on a retrospective rating of change item increased the median test–retest reliability to 0.686. The amount of individual change that was statistically significant varied by how reliability was estimated, and which SD was used. The smallest change needed was found when internal consistency reliability and the SD at baseline were used. When these values were used, the amount of change needed to be statistically significant (p < .05) at the individual level ranged from 3.33 (mental health summary scale) to 12.30 (pain intensity item) T-score points.

Conclusions

We recommend that in research studies estimates of the magnitude of individual change needed for statistical significance be provided for multiple reliability and standard deviation estimates. Whenever possible, patients should be classified based on whether they 1) improved significantly and perceived they got better, 2) improved significantly but did not perceive they were better, 3) did not improve significantly but felt they got better, or 4) did not improve significantly or report getting better.

Peer Review reports

Background

Patient-reported outcome measures provide essential information about the effects of interventions on functioning and well-being [1]. The importance of supplementing group-level mean differences with estimates of responders to treatment is increasingly recognized [2]. The reliable change index (RCI) is most often used to evaluate individual change from one time point (e.g., baseline) to a follow-up [3]: individual change/(\(\sqrt2\;SEM)\), where SEM (standard error of measurement) is: \(SD \sqrt{1-{reliability}}\), and SD is the standard deviation. However, the reliability and SD can be estimated in different ways that effect the estimated RCI and classification of whether an individual has gotten worse, stayed the same, or gotten better.

For simple-summated scales, reliability can be estimated as internal consistency reliability [4] or test–retest reliability [5]. For a measure that is a weighted combination of scale scores (i.e., a weighted composite), reliability can be estimated using Mosier’s formula [6] or test–retest reliability. Test–retest reliability can be estimated using either two-way mixed effects or random effects analysis of variance [7]. The mixed effects formula is (MSbetween – MSinteraction)/MSbetween, where MSbetween is the mean square between respondents and MSiinteraction is the mean square for the interaction of respondents and timepoint (test, retest). The random effects model is N (MSbetween – MSinteraction)/(N MSbetween, + MStime – MSinteraction), where N is the number of respondents and MStime is the mean square for the main effect of the timepoint. Qin et al. [8] argued for using a "two-way mixed effect" ANOVA with interaction for absolute agreement that is equivalent to the two-way random effects model. The intraclass correlation variant of these formulas yields the estimated reliability for a single assessment.

The SD at baseline or the SD of change can be used in the RCI denominator. The choice is analogous to different denominators for responsiveness to group-level change indices [9]. The SD of change within-subjects [10] is perhaps the most consistent epistemologically with evaluating individual change. The SD of change can be estimated from the baseline and follow-up SD and the correlation between baseline and follow-up [11]:

$${SD}_{change}=\sqrt{{SD}_{baseline }^{2}{+ SD}_{followup}^{2}-(2 \times Corr \times {SD}_{baseline} \times {SD}_{followup})}$$

Significant individual change can also be estimated by using “typical error” for the standard error estimate: SDchange/\(\sqrt{2}\) [12].

In summary, multiple possible reliability and SD estimates can be used in estimating individual change. Researchers and clinicians need to understand how the choice of reliability and SD estimates impacts the classification of individual change based on the RCI. We compare different ways of estimating significant individual change for the Patient-Reported Outcomes Measurement Information System (PROMIS®)-29 v1.0 profile instrument using data from a longitudinal study of U.S. service members with low back pain [13].

Methods

This is a secondary analysis of data collected at a small hospital in a military training site (Naval Hospital in Pensacola, Florida) and two large military medical centers in major metropolitan areas: 1) Walter Reed National Military Medical Center in Bethesda, Maryland; and 2) Naval Medical Center in San Diego, California. Study participants were randomized to usual medical care (UMC) or UMC plus chiropractic care. The active treatment period for the study was 6 weeks which served as the primary end point for the study outcomes. The clinical trial did not dictate the care to be delivered. Care was determined by the patient and their clinician. Participants in the UMC group were asked to refrain from seeking chiropractic care during the 6-week treatment period.

The PROMIS-29 v1.0 [14] was administered at baseline and 6-weeks later. It includes a single pain intensity item and 7 multi-item scales with 4 items each (physical function, pain interference, fatigue, sleep disturbance, depression, anxiety, satisfaction with participation in social roles). In addition, a pain composite (combination of pain intensity and pain interference), emotional distress composite (combination of depression and anxiety), physical health summary score, and mental health summary score can be estimated [15]. Extensive support for the reliability and validity of the PROMIS-29 profile measure has been published [14, 16, 17]. Statistically significant mean differences favoring UMC plus chiropractic care over UMC alone on all PROMIS®-29 v1.0 scales were previously reported [18]. All PROMIS®-29 v1.0 scale scores were estimated using existing calibrations (T-score metric: mean: 50, SD: 10 in U.S. general population).

A retrospective rating of change in pain was administered at the 6-week post-baseline assessment: “Compared to your first visit, your low back pain is: much worse, a little worse, about the same, a little better, moderately better, much better, or completely gone?” This item was used to identify patients who perceived that their low back had not changed during these 6 weeks.

Analysis plan

We computed internal consistency reliability [4] for the multi-item scales, Mosier’s [6] reliability estimate for the PROMIS®-29 v1.0 physical and mental health summary scores, and test–retest (intraclass) correlations using analysis of variance [5]. We estimated the SD at baseline for the UMC group (SD1) and for the subset of the UMC group that reported they were about the same at 6 weeks compared to baseline (SD1*). In addition, we estimated the SD of change between baseline and 6 weeks for the UMC group (SD2) and the subgroup of the sample that reported at 6 weeks that they were about the same as at baseline (SD2*). Finally, we estimated the SD of change within subjects (SD3).

We estimate the magnitude of individual change between baseline and 6 weeks later needed to be significant at p < 0.05 using the coefficient of repeatability (CR). The CR is a re-expression of the RCI and is also known as the minimally detectable change, smallest real difference, or the smallest detectable change: CR for p < 0.05: 1.96 \(\sqrt{2}\) SEM. We compare six different estimates of the CR: 1) CR1 (based on internal consistency reliability and SD1); 2) CR2 (based on internal consistency reliability and SD1*); 3) CR3 (based on random effects test–retest intraclass correlations and SD2); 4) CR4 (based on random effects test–retest intraclass correlations and SD2*); 5) CR5 (based on the SD of change within subjects) and 6) CR6 (based on the typical error method). Table 1 provides the six CR formulas. These CRs cover all the relevant possibilities of SDs and reliability estimates.

Table 1 Coefficient of repeatability formulas for p-value < 0.05

Results

The average age of the 749 study participants was 31; 76% were male and 67% White. Most participants reported low back pain for more than 3 months (chronic low back pain, 51%), 38% had acute and 11% had subacute low back pain.

Internal consistency and weighted composite reliability estimates ranged from 0.700 to 0.969 (Table 2). Six-week test–retest intraclass correlation estimates were substantially lower than these estimates. The median test–retest reliability estimate for the two-way mixed effects model was 0.532 and ranged from 0.359 (pain composite) to 0.647 (emotional distress composite) in the UMC group overall. The estimated median test–retest reliability was 0.686 and reliabilities ranged from 0.550 (satisfaction with participation in social roles) to 0.765 (physical health summary) within the subset of the sample who reported they were about the same compared to baseline on the retrospective rating of change item. The test–retest reliability estimates based on the random effects model were similar but tended to be a little lower than those based on the mixed effects model.

Table 2 Reliability of PROMIS-29 v. 1.0 scales

Table 3 provides the SD and CR estimates. The smallest SDs were found for the standard deviation of change within the subgroup that reported they didn’t change from baseline to 6 weeks later (SD3). The smallest CRs tended to be those derived from SD1 in combination with internal consistency reliability estimates (CR1). These smallest CRs ranged from 3.33 (mental health summary scale) to 12.30 (pain intensity item).

Table 3 Coefficient of Repeatability (CR) using different reliability and standard deviation estimates

Discussion

This study shows varying estimates of the CR when using different ways of estimating reliability and the SD. The smallest CR was obtained when internal consistency reliability and the SD at baseline for the UMC sample were used. The different SDs used to evaluate individual change are analogous to options for estimating responsiveness of measures to group-level change [19]. Responsiveness indices include group mean change in the numerator and the same SDs examined in this study for the denominator: effect size uses SD1, the standardized response mean uses SD2, and the responsiveness statistic uses SD2*. These results provide concrete information that the way that the RCI and CR are estimated impacts whether an individual is deemed to have stayed the same or changed over time on patient-reported outcome measures.

While some have suggested that test–retest reliability and the SD of change provide the cleanest estimates for use in evaluating within change from baseline to follow-up, there are practical challenges in using them. Reeve et al. [20]:

“noted practical concerns regarding test–retest reliability, primarily that some populations studied in PCOR are not stable and that their HRQOL can fluctuate. This phenomenon would reduce estimates of test–retest reliability, making the PRO measure look unreliable when it may be accurately detecting changes over time. In addition, memory effects will positively influence the test–retest reliability when the two survey points are scheduled close to each other.”

But the impact of different reliability and SD estimates on the CR depends on the context. Test–retest reliability estimates were all below the 0.90 threshold for use of measures to assess individuals [7]. These were likely underestimates of reliability because of the 6-week interval between assessments in a sample of individuals with chronic back pain. Future studies are needed that use shorter intervals of time for test–retest estimates. Caution is warranted in generalizing from a sample of active-duty members of the U.S. military. Further comparison of the SD alternatives is needed in other samples and with different measures. It also may be informative to assess the same issues with different individual change indices such as the standard error of prediction (SEP), which uses the (\({SD}_{1} \sqrt{1-{reliability}^{2}}\)) in the denominator [21]. In addition, future studies should consider using item response theory standard error estimates rather than one reliability estimate applied to every individual [22].

Significant individual change is conceptually different from group-level estimates of the minimally important change (MIC) for patient-reported outcome measures. Classifying individuals as changed using MIC estimates is inappropriate and results in overly optimistic estimates of responders to treatment [2]. However, concerns about the seemingly large amount of individual change needed to be significant at p < 0.05 have been raised [23, 24]. Lower levels of confidence may be appropriate to monitor short-term change when a trend is expected to continue over time [25]. Donaldson [23] suggested that a less stringent confidence interval than 95% could be used to classify people as likely having changed or staying the same on a patient-reported outcome measure. Doing this results in a smaller CR and a test of significance that is more sensitive but less specific to perceived change by patients. In this study CR1 was smaller than CR2 (Table 3). Sensitivity to retrospectively reported improvement in low back pain (a little better, moderately better, much better, or completely gone) was higher and specificity lower for CR1 than CR2. For example, with the physical function scale the sensitivity of CR1 to retrospective reports of improvement was 46% compared to 29% for CR2 but the specificity of CR1 to reported improvement was 85% versus 98% for CR2.

In addition to whether change is statistically significant, where the individual is at follow-up may be important in clinical practice. That is, the focus could be on bringing the patient to the normal range of a clinical parameter. For example, a clinician might focus on whether their therapy takes someone who starts with hypertension to within the normal range. Similarly, for patient-reported outcomes, a clinician might be interested in whether the patient who is clinically depressed at baseline is no longer depressed at follow-up.

Conclusions

We recommend that the sensitivity of results be evaluated for different reliability and SD estimates in research studies evaluating individual change. For assessing whether individuals have changed in clinical practice, we suggest clinicians estimate significant individual change for simple summated scales using CR1 (internal consistency reliability and the SD at baseline). If possible, they should also ask individuals at follow-up if they have changed. Having information about significant individual change on the patient-reported outcome measure and the individual’s perception of whether they have changed, the clinician can classify an individual patient as: 1) improved significantly and perceived they got better (i.e., reported their low back pain was a little better, moderately better, much better, or completely gone), 2) improved significantly but did not perceive they were better (i.e., reported their low back pain was about the same, a little worse, or much worse), 3) did not improve significantly but perceived they got better, and 4) did not improve significantly and did not perceive they were better.

Availability of data and materials

This secondary analysis uses an existing dataset. The study was registered at ClinicalTrials.gov (NCT01692275). Requests for the data can be directed to: Patricia Herman, ND, PhD, RAND Corporation, 1776 Main Street, Santa Monica, CA 90401 (pherman@rand.org).

Abbreviations

Alpha:

Internal consistency reliability

CR:

Coefficient of repeatability

CR1 :

Coefficient of repeatability (CR) based on internal consistency reliability and SD1

CR2 :

Coefficient of repeatability (CR) based on internal consistency reliability and SD1*

CR3 :

CR based on random effects test–retest intraclass correlation and SD2

CR4 :

CR based on random effects test–retest intraclass correlation and SD2*

CR5 :

CR based on SD3

CR6 :

CR based on 1.96 * SD2* /\(\sqrt{2 }\)

ICC:

Intraclass correlation

MSbetween :

Mean square between respondents

MSnteraction :

Mean square for the interaction of respondents and timepoint (test, retest)

MStime :

Mean square for main effect of timepoint

N:

The number of respondents

PROMIS:

Patient-Reported Outcomes Measurement Information System

RCI:

Reliable change index

SD:

Standard deviation

SD1 :

Standard deviation at baseline for the usual medical care group

SD1* :

SD baseline for subgroup of the usual medical care group reporting “about the same” on retrospective change item

SD2 :

Standard deviation of change for the usual medical care group

SD2* :

Standard deviation of change for subgroup of the usual medical care group reporting “about the same” on retrospective change item

SD3 :

Standard deviation of change within subjects for subgroup of the usual medical care group reporting “about the same” on retrospective change item

SEM:

Standard error of measurement

UMC:

Usual medical care

References

  1. Alcantara DC, Nazarenko AL, Ohm J, Alcantara J. The use of the patient reported outcomes measurement information system and the RAND VSQ9 to measure the quality of life and visit-specific satisfaction of pregnant patients under chiropractic care utilization the Webster technique. J Altern Complement Med. 2018;24(1):90–8. https://doi.org/10.1089/acm.2017.0162.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Hays RD, Peipert JD. Between-group minimally important change versus individual treatment responders. Qual Life Res. 2021;30(10):2765–72. https://doi.org/10.1007/s11136-021-02897-z.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Jacobson NS, Truax P. Clinical significance: a statistical approach to defining meaningful change in psychotherapy research. J Consult Clin Psychol. 1991;59(1):12–9. https://doi.org/10.1037//0022-006x.59.1.12.

    Article  PubMed  CAS  Google Scholar 

  4. Cronbach LJ. Coefficient alpha and the internal structure of tests. Psychometrika. 1951;16:297–334. https://doi.org/10.1007/BF02310555.

    Article  Google Scholar 

  5. Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psychol Bull. 1979;86(2):420–8. https://doi.org/10.1037//0033-2909.86.2.420.

    Article  PubMed  CAS  Google Scholar 

  6. Mosier CI. On the reliability of a weighted composite. Psychometrika. 1943;8:161–8. https://doi.org/10.1007/BF02288700.

    Article  Google Scholar 

  7. Hays RD, Hubble D, Jenkins F, Fraser A, Carew B. Methodological and statistical considerations for the National Children’s Study. Front Pediatr. 2021;20(9):595059. https://doi.org/10.3389/fped.2021.595059.

    Article  Google Scholar 

  8. Qin S, Nelson L, McLeod L, Eremenco S, Coons SJ. Assessing test-retest reliability of patient-reported outcome measures using intraclass correlation coefficients: recommendations for selecting and documenting the analytical formula. Qual Life Res. 2019;28(4):1029–33. https://doi.org/10.1007/s11136-018-2076-0.

    Article  PubMed  Google Scholar 

  9. Guyatt G, Walter S, Norman G. Measuring change over time: assessing the usefulness of evaluative instruments. J Chronic Dis. 1987;40(2):171–8. https://doi.org/10.1016/0021-9681(87)90069-5.

    Article  PubMed  CAS  Google Scholar 

  10. Bland JM, Altman DG. Measurement error. British Medical Journal. 1996;29;312(7047):1654. https://doi.org/10.1136/bmj.312.7047.1654. Corrected and republished in: BMJ. 1996;21;313(7059):744.

  11. Higgins JPT, Li T, Deeks JJ. Choosing effect measures and computing estimates of effect. In: Higgins JPT, Thomas J, Chandler J, Cumpston M, Li T, Page MJ, Welch VA, editors. Cochrane handbook for systematic reviews of interventions Version 6.3 2022. Available from: www.training.cochrane.org/handbook.

  12. Swinton PA, Hemingway BS, Saunders B, Gualano B, Dolan E. A statistical framework to interpret individual response to intervention: paving the way for personalized nutrition and exercise prescription. Front Nutr. 2018;28(5):41. https://doi.org/10.3389/fnut.2018.00041.

    Article  Google Scholar 

  13. Goertz CM, Long CR, Vining RD, Pohlman KA, Kane B, Corber L, et al. Assessment of chiropractic treatment for active duty, U.S. military personnel with low back pain: study protocol for a randomized controlled trial. Trials. 2016;17:70. https://doi.org/10.1186/s13063-016-1193-8.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  14. Cella D, Choi SW, Condon DM, et al. PROMIS® adult health profiles: efficient short-form measures of seven health domains. Value in Health. 2019;22(5):537–44. https://doi.org/10.1016/j.jval.2019.02.004.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Hays RD, Spritzer KL, Schalet BD, Cella D. PROMIS®-29 v2.0 profile physical and mental health summary scores. Qual Life Res. 2018;27(7):1885–91. https://doi.org/10.1007/s11136-018-1842-3.

    Article  PubMed  PubMed Central  Google Scholar 

  16. McMullen K, Bamer A, Ryan CM, McMullen K, Barner A, Ryan CM, et al. Validation of PROMIS-29 domain scores among adult burn survivors: a national institute on disability, independent living, and rehabilitation research burn model system study. J Trauma Acute Care Surg. 2022;92(1):213–22. https://doi.org/10.1097/TA.0000000000003365.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Pennings JS, Khan I, Davidson CA, Freitag R, Bydon M, Asher AL, et al. Using PROMIS-29 to predict Neck Disability Index (NDI) scores using a national sample of cervical spine surgery patients. Spine Journal. 2020;20(8):1305–15. https://doi.org/10.1016/j.spinee.2020.04.028.

    Article  Google Scholar 

  18. Hays RD, Shannon ZK, Long CR, Spritzer KL, Vining RD, Coulter ID, et al. Health-related quality of life among United States service members with low back pain receiving usual care plus chiropractic care plus usual care vs usual care alone: secondary outcomes of a pragmatic clinical trial. Pain Med. 2022;23(9):1550–9. https://doi.org/10.1093/pm/pnac009.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Revicki DA, Cella D, Hays RD, Sloan JA, Lenderking WR, Aaronson NK. Responsiveness and minimal important differences for patient reported outcomes. Health Qual Life Outcomes. 2006;4:70. https://doi.org/10.1186/1477-7525-4-70.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Reeve BB, Wyrwich KW, Wu AW, Velikova G, Terwee CB, Snyder CF, et al. ISOQOL recommends minimum standards for patient-reported outcome measures used in patient-centered outcomes and comparative effectiveness research. Qual Life Res. 2013;22(8):1889–905. https://doi.org/10.1007/s11136-012-0344-y.

    Article  PubMed  Google Scholar 

  21. Hays RD, Slaughter ME, Spritzer KL, Herman PM. Assessing the significance of change in two samples of patients in treatment for low back pain using five different statistical indicators. J Manipulative Physiol Ther. 2022;44:699–706.

    Article  PubMed Central  Google Scholar 

  22. Hays RD, Spritzer KL, Reise SP. Using item response theory to identify responders to treatment: examples with the Patient-Reported Outcomes Measurement Information System (PROMIS®) physical function scale and emotional distress composite. Psychometrika. 2021;86(3):781–92. https://doi.org/10.1007/s11336-021-09774-1.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Donaldson G. Patient-reported outcomes and the mandate of measurement. Qual Life Res. 2008;17(10):1303–13. https://doi.org/10.1007/s11136-008-9408-4.

    Article  PubMed  Google Scholar 

  24. Hopkins WG. Measures of reliability in sports medicine and science. Sports Med. 2000;30(1):1–15. https://doi.org/10.2165/00007256-200030010-00001.

    Article  PubMed  CAS  Google Scholar 

  25. Peipert JD, Hays RD, Cella D. Likely change indexes improve estimates of individual change on patient-reported outcomes. Qual Life Res. 2022. epub. https://doi.org/10.1007/s11136-022-03200-4.

Download references

Acknowledgements

We thank Ian Coulter and Christine Goertz for providing access to the dataset analyzed in the paper. We are indebted to the active-duty US military personnel with low back pain who participated in this study.

Funding

This secondary analysis was supported by the National Center for Complementary and Integrative Health (NCCIH). Grant No. 1R01AT010402-01A1. NCCIH had no role in the design; data collection, analysis, or interpretation; or writing of this manuscript.

Author information

Authors and Affiliations

Authors

Contributions

RDH conceived and conducted the analyses and drafted the paper.  SPR and PMH provided feedback on the analyses and the draft.  RDH finalized the paper.

Corresponding author

Correspondence to Ron D. Hays.

Ethics declarations

Ethics approval and consent to participate

All methods were carried out in accordance with the Declaration of Helsinki. The trial was pre-registered in ClinicalTrials.gov (NCT01692275), and oversight was provided by an independent data and safety monitoring committee. Written informed consent was given by all study participants. The study was approved by each participating institution’s institutional review board: Palmer College of Chiropractic (#2010G137), RAND Corporation (#2010–0782), NMCSD (#NMCSD.2012.0022, IRB of record: Naval Medical Center San Diego, California), NHP (#NHPC.2012.0002, IRB of record: Naval Medical Center Portsmouth, Virginia), and WRNMMC (#369462, IRB of record: Walter Reed National Military Medical Center Bethesda, Maryland). The study protocol was also approved by the U.S. Army Medical Research and Material Command Human Research Protection Office and the Clinical Investigation Regulatory Office. All study investigators completed training in the protection of human subjects as required by the respective collaborating institutions.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hays, R.D., Reise, S.P. & Herman, P.M. Estimating individual health-related quality of life changes in low back pain patients. BMC Musculoskelet Disord 24, 961 (2023). https://doi.org/10.1186/s12891-023-07093-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12891-023-07093-3

Keywords