- Research article
- Open Access
- Open Peer Review
Predicting the outcome of conservative treatment with physiotherapy in adults with shoulder pain associated with partial-thickness rotator cuff tears – a prognostic model development study
BMC Musculoskeletal Disorders volume 19, Article number: 329 (2018)
Rotator cuff disorders represent the commonest type of painful shoulder complaints in clinical practice. Although conservative treatment including physiotherapy is generally recommended as first-line treatment, little is known about the precise treatment indications for subgroups of rotator cuff disorders, particularly people with shoulder pain associated with partial-thickness tears of the rotator cuff, PTTs: “symptomatic PPTs”. The aim of this study was to develop a prognostic model for predicting the outcome of a phase of conservative treatment primarily with physiotherapy in adults with symptomatic PTTs.
A prospective observational cohort study was conducted in an outpatient setting in Germany. Ten baseline factors were selected to evaluate nine pre-defined multivariable candidate prognostic models (each including between two and nine factors) in a cohort of adults with symptomatic atraumatic PTTs undergoing a three-month phase of conservative treatment primarily with physiotherapy. The primary outcome was change in the Western Ontario Rotator Cuff Index. The models were developed using linear regression and an information-theoretic analysis approach: Akaike’s Information Criterion (AICC).
Eight candidate models were analyzed using data from 61 participants. Two “best models” were identified: smoking & pain catastrophizing and disability & pain catastrophizing. However, none of the models had a satisfactory performance or precision.
We could not determine a prognostic model with satisfactory performance and precision. Further high-quality prognostic model studies with larger samples are needed, but should be underpinned, and thus preceded, by robust research that enhances knowledge of relevant prognostic factors.
DRKS00004462. Registered 08 April 2014; retrospectively registered (prior to the analysis).
Painful shoulder complaints are common musculoskeletal disorders in clinical practice , most being attributed to rotator cuff pathology [2, 3]. Rotator cuff pathology encompasses a range of pathologies from tendinopathy to tears, which may be partial- or full-thickness . Reported rates of symptomatic partial-thickness tears (PTTs), the condition of interest in this study, vary between 7%  and 24%  in shoulder pain populations. Of the four rotator cuff tendons (supraspinatus, infraspinatus, teres minor, subscapularis), the supraspinatus is by far the most often affected , and also usually the first to tear [8, 9]. In order to concisely label the population of interest, we use the term “symptomatic PTT” to describe people with shoulder pain in the presence of a PTT of the rotator cuff.
The clinical presentation of symptomatic PTTs is essentially that of “shoulder impingement” [7, 9, 10]. Verification of a PTT requires diagnostic imaging, commonly ultrasonography (US) or magnetic resonance imaging (MRI) .
Current guidelines for rotator cuff disorders [12, 13] recommend conservative treatment with medical care and physiotherapy as the first-line treatment; surgical intervention being mainly reserved for non-responders. Head-to-head comparisons of conservative and surgical interventions  have overall shown no clinically relevant differences. However, utilisation of surgery for rotator cuff disorders has significantly increased in many countries [15,16,17], with physiotherapy bypassed in some cases . Both unnecessary surgery and ineffective conservative treatment are undesirable. Knowledge about a patient’s likely response to conservative treatment at the point of diagnosis would save time, effort and suffering, limit exposure to the risks of surgery, and inform distribution of resources. “Understanding which patients [with rotator cuff tears] do best with non-operative treatment” has been rated a top “priority scientific research issue” (, p. 10).
The importance of predicting individuals’ responses to particular interventions is increasingly recognized , with a corresponding development in prognosis research methodology [21, 22]. One aspect of prognosis research involves the identification of single, independent factors . However, these are unlikely to predict outcomes satisfactorily. Multivariable prognostic models are better placed as they account for real-life clinical complexities [24, 25]. Estimates of prognosis are highly context-dependent, with relevant contextual factors being existing diagnostic and treatment practices, time and place.
Prognostic model research encompasses three key phases: development including internal validation; external validation; and evaluation of clinical impact . External validation is essential before a model may be usable in practice . While prospective cohort studies are generally considered the preferable design for the initial development of a prognostic model [25,26,27], evaluations of the clinical impact of a prognostic model ultimately require comparative studies.
Our systematic review of the evidence on prognostic models for predicting outcomes in adults undergoing physiotherapy for rotator cuff disorders showed a lack of clinically usable prognostic models and, crucially, of prognostic model research on PTTs . The study’s primary aim was to develop a multivariable prognostic model for the outcome of a phase of conservative treatment with physiotherapy in adults with symptomatic atraumatic PTTs. Secondary aims were to determine the incidence of tear progression and to establish participants’ perceived change of their shoulder complaints over time.
The study was based on an a priori protocol and was approved by the Teesside University School of Health and Social Care Research Governance & Ethics Committee and the Ethics Commission of the Hamburg Medical Council (Germany). It was registered in the German Clinical Trials Register (reg.no DRKS00004462). The study design was informed by the most current methodological guidance available at the time of planning [21, 22]. All deviations from protocol were discussed and recorded prior to implementation ; the only two relevant deviations are flagged up in this section. This report complies with the items required by the TRIPOD (Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis) prediction model development checklist .
Study design, setting and key dates
We conducted a prospective observational single-group cohort study set in Hamburg, Germany. All recruitment and assessments took place in a single-handed medical specialist practice led by one of the authors, AB, an orthopaedic shoulder specialist and DEGUM (German Society for Ultrasonography in Medicine) certified instructor in ultrasonographic shoulder diagnosis. The physiotherapy treatment took place in 24 collaborating physical therapy practices in the broader area of Hamburg. (In our protocol, we initially considered seven collaborating practices, but expanded their number eventually to 24 to improve recruitment). Recruitment took place between December 2012 to September 2014. Follow-up ended in January 2015.
Eligible patients were adults (≥ 18 years) presenting with shoulder pain unrelated to a traumatic event (e.g. an accident) and an ultrasonographically determined PTT who had accepted advice to undergo conservative treatment with physiotherapy (see Table 1 for the full eligibility criteria). These patients typically present with clinical signs of “shoulder impingement”, such as a painful arc or positive “impingement signs” [7, 9, 10]. We additionally determined the presence of a PTT by diagnostic ultrasonography, which is highly specific for detecting PTTs . Our intention was to recruit patients whose shoulder pain could reasonably be linked to the presence of a PTT; however, we acknowledge that the precise link between shoulder pain and the presence of a PTT (similar to other shoulder structures) is unclear . Following standard practice, the assessment involved a structured patient history, physical and ultrasonographic evaluation. The physical evaluation was based on DVSE (German Society for Shoulder and Elbow Surgery) recommendations . The ultrasonographic evaluation followed DEGUM and DGOU (German Society for Orthopaedics and Trauma) standards . An ultrasound unit within the highest DEGUM appliance class was used together with a linear transducer with a resolution of ≥10 MHZ and width of ≥40 mm. Diagnosis of a rotator cuff defect was based on alterations of structure and form, following the criteria of Hedtmann & Fett [35, 36]. In distinction to a PTT, a full-thickness tear (FTT) was marked by the absence of a depiction of the rotator cuff (discontinuity of the cuff).
Participants were followed over three months of standard conservative care with physiotherapy in one of the collaborating practices. Adjunctive medical treatment (e.g. local steroid injections), was delivered by AB where considered appropriate. The physiotherapy treatment followed a broad best-evidence protocol based on two systematic reviews [37, 38]. These reviews provided evidence supporting exercises with or without manual therapy as the first-line approach for treating patients with rotator cuff related shoulder pain including PTTs, but could not provide conclusive guidance on the optimal type or dose of treatment. Since there was no justification for restricting treatment to any specific exercises or manual techniques, the protocol was based on the broad principles that a) exercises, preferably combined with manual techniques (soft tissue and/or joint mobilisation), would be the key treatment components, and b) flexibility of the interventions and in the provision of adjunctive modalities would be allowed. In keeping with the ethos of an observational study, the specific content and amount of treatment were unregulated, i.e. individually advised. Treatment, which included the clinical follow-up appointment at three months to assess progress and need for further treatment, was delivered in compliance with German healthcare regulations and AB’s standard practice. Acceptability of the physiotherapy protocol was confirmed by all collaborating physiotherapy practices. Treatment details were documented in a purpose-designed, piloted report form.
The primary outcome, the outcome to be predicted, was the change in ‘disability’ (disability and health-related quality of life) from baseline to follow-up, measured by a validated German version of the Western Ontario Rotator Cuff Index (WORC) [39, 40]: WORCCHANGE. The WORC has been shown to be a valid, reliable and responsive patient-reported outcome measure (PROM) for use in people with rotator cuff disorders [41, 42]. It comprises 21 questions. Responses are made by putting a mark on a 100 mm visual analogue scale (VAS), with lower scores indicating less disability. Scores range from 0 to 2100 . We adjusted all WORCCHANGE values for Regression to the Mean (RTM) using methods outlined by Linden . Participants completed questionnaires at baseline and at 3 to 4 months, the study endpoint, either at AB’s clinic or at home.
As both the WORC and all prognostic factors were patient-assessed, there was no blinding of participants. Nonetheless, the WORC was completed independently and in the absence of AB and study investigators.
Secondary outcomes were tear progression, defined as the presence (yes or no) of an FTT at follow-up, and participants’ perceived overall change of their shoulder problem, measured by a 7-point Global Perceived Change (GPC) scale (from − 3 = “worse as ever” to + 3 = “completely recovered”). Lastly, physical therapy-related adverse events were monitored.
Inclusion of candidate factors was restricted to factors from the baseline assessment, regardless of their type (e.g. demographic, physical). Selection was done through a systematic, three-stage approach comprising identification of factors, critical assessment of these, and a consensus phase that aimed to select a maximum of 10 factors (see Fig. 1 for an outline of the process; a full account is available in Braun 2016 ([29, Chapter 5]). The process was informed by comprehensive literature searches of several electronic databases, including Medline, Embase and Cinahl, for primary prognostic studies, prognostic systematic reviews and expert consensus studies. We screened overall around 3900 records and identified 23 primary study reports (relating to 22 studies), one systematic review and one expert consensus study as relevant sources for informing the selection of factors for our study (a list of these articles is provided in Additional file 1). We extracted and considered 36 factors altogether (these are listed in Additional file 2, which also shows for each factor whether it was included or excluded and the reasons for exclusion). We assessed the relevance of all factors to the study population and setting, their measurement properties, practicality of use, and their applicability, and excluded those that were either not relevant to the study population and setting, not sufficiently valid and reliable, or not applicable in most clinical settings. We grouped the remaining factors according to the availability of clinical evidence and expert consensus supporting their prognostic relevance; we gave preference to the selection of those factors for which there was reasonably consistent support for their prognostic relevance, either through clinical evidence from several studies, or from both clinical evidence and expert consensus. Notably, there was reasonably consistent evidence of prognostic value from several studies pertaining to clinical outcomes of conservative treatment in adults with rotator cuff disorders for only three factors: age, disability and symptom duration. We finally agreed on 10 factors: age, sex, physical demands, disability, pain, history of shoulder pain, symptom duration, diabetes, smoking and pain catastrophizing. We gave thorough attention to factor definitions and measurements (Table 2). All factors were assessed during the patients’ baseline appointment with AB. Since the study was prospective, the assessment of prognostic factor information was inherently blinded to knowledge about the outcome.
The multivariable nature of prognostic model studies makes it difficult to estimate the required sample size . Indeed, no formal methods (based on either power calculations or adequate precision of estimation of effects) are available to determine the effective sample size, and recommendations for the sample size vary across the literature. Following work by Vittinghoff & McCulloch , we based the minimum sample size of our study on a requirement of 5 to 9 outcome events (events equate to individuals for continuous outcomes) per candidate prognostic factor in relation to the full model (i.e. the model including all 10 factors). As per our protocol, we initially planned to analyze the WORC as a binary outcome variable, but subsequently (and prior to the analysis) decided to analyze it as a continuous variable to avoid the unnecessary loss of information that would have resulted from dichotomization [45, 46]. By analyzing the WORC on a continuous scale, and setting out to study overall 10 factors, which we considered feasible, we aimed to include (5 to 9)*10 = 50 to 90 participants. Increased by 20% to allow for losses to follow-up, the recruitment target was 60 to 108 patients.
Any missing prognostic factor and outcome data were documented. The decision about the method for dealing with missing data, including whether or not to impute any missing data, was made prior to the analysis. We considered the amount and also the potential reasons for missing values, i.e. whether the reasons for missingness appeared systematic or random. We decided to limit the replacement of missing values to those missing for the two multi-item measures, the WORC (baseline and follow-up) and the Pain Catastrophizing Scale (PCS). No standard missing rule was available for the WORC in the literature; therefore, we replaced missing WORC values by the mean of the respective domain. We replaced missing PCS values by the mean of the items that were completed, as suggested by the primary originator of the scale, Prof Michael Sullivan (personal communication 02/06/2014). We did not replace any missing values where the PCS was completely missing. As the information-theoretic analysis approach we used required identical datasets, the data were analyzed on a complete-case basis. We would have considered formal testing of the effects of missing data should the amount have been bigger and should the reasons for missingness have been of concern.
Statistical analysis methods
We intended to include all 10 candidate factors in the prognostic modelling analysis. All continuous factors, WORC and PCS scores, were analyzed as continuous measurements. All non-continuous factors were binary.
We based our analysis on an information-theoretic approach, namely on a small-sample variant of Akaike’s Information Criterion (AIC) approach, AICC . Information-theoretic approaches to model selection differ from other approaches, particularly from the widely used stepwise regression approaches, in several ways. Under the AIC approach, selection is based on the comparison of multiple candidate models, which are pre-specified based on “theory”, rather than on a single global set of factors . Selection is further based on an information-theoretic criterion (e.g. AIC), which provides “numerical values that represent the scientific evidence” for a model, but no “test statistics” such as p values, thus avoiding the application of arbitrary cut-offs of “statistical significance” ( p. 64). Reflecting the perspective that models never reflect “full reality”, i.e. that they are approximations (, p. 27), the AIC value represents an estimator of the information that is inherently lost when a model is used to approximate full reality (Kullback-Leibler information) . The AIC accounts for the number of candidate factors by ‘penalizing’ models with larger numbers of factors, thereby favouring parsimony (, p. 60–1). The model with the lowest AIC value (AICMIN) represents the closest approximation and is accordingly termed the “best model” within a set of models . AIC differences (∆AIC = AIC – AICMIN) can then be calculated to rank the models by their distance to the best model [47, 48]. Burnham et al. (, p. 25) have proposed considering models with ∆AIC values < 4 to 7 as “plausible” alternatives to the best model, whereas models with higher ∆AIC values (> 9) have little to no support. AIC values are relative rather than absolute, and “on the scale of information” (, p. 84). Accordingly, their use is limited to comparing models within a defined set of models . As the AIC approach will always select a best model among a set of models, it has been suggested that the worth of the best or the global (full) model be assessed, e.g. by a goodness-of-fit test, analysis of residuals or the adjusted R2 (the percentage of variance explained) .
Following recommendations from the literature that the number of candidate models should usually be limited to a few , we decided to analyze a selection of nine candidate models. The selection of models was based on clinical and theoretical considerations, with the first model (number 1 in Table 3) including all 10 candidate prognostic factors (thus representing the “full model”). The composition of the other eight models, which included between two to eight of these factors, was based on various characteristics, as shown in Table 3. Examples of characteristics were the potential for modification (model 2) or the effort required for the assessment of prognostic factors (models 5 and 7, inclusion or exclusion of questionnaires), which would be highly relevant to clinical practitioners. The primary analysis approach was a linear regression analysis [26, 49] which we conducted in IBM SPSS Statistics 22. All continuous factors were modelled as linear. Satisfaction of the assumptions of linear regression was assessed visually for each model based on the residual plot (scatterplot of standardized residuals against standardized predicted values) .
We extracted the following statistics: the AICC value; the standard error of the estimate (SEE), as the primary measure of model precision; the adjusted coefficient of (multiple) determination (R2ADJ), as a complementary measure of model performance; the regression constant (Constant); and the unstandardized regression coefficients (B) of all factors with their 95% confidence intervals (CIs). For comparison of the different models, we extracted AICC, ∆AIC and SEE values.
Model validation and further analyses
We intended to compare the SEE of the best model with the estimate of the Minimal Important Difference (MID) of the WORC, which we intended to derive from the sample data, and to internally validate any model with an SEE substantially lower than the MID. We intended to conduct the following exploratory subgroup analyses: amount of physiotherapy (number of sessions); medical treatment (specifically provision of injections); and length of follow-up.
Figure 2 illustrates the flow of participants. Of 82 eligible participants, 70 were included, of whom 65 (representing 65 shoulders) completed the study. The baseline characteristics and prognostic factor information of these 65 participants are presented in Table 4.
The amount of missing data was small: six values (0.4% of all values) were missing for the baseline WORC; 11 (1%) for the follow-up WORC; and six (1%) for the single-item prognostic factors. The PCS was missing completely for three participants; beyond this, only one PCS value (0.1%) was missing. The distribution appeared random, thus non-systematic. Four participants had missing prognostic factor data after replacement of missing WORC and PCS values, and were consequently, in keeping with the need for identical datasets for the AIC approach , excluded from the modelling. The data of 61 participants were analyzed. The mean (SD) interval between completion of the baseline and follow-up WORC (and GPC) was 97 (17) days (n = 65 for WORC, 64 for GPC). The mean (SD) interval between the baseline and follow-up US assessment was 100 (13) days (n = 52).
All participants received conservative treatment with physiotherapy. The mean (SD) number of physiotherapy sessions was 12 (6); and the mean (SD) duration of single sessions was 28 (13) minutes. A breakdown of the physiotherapy treatment content, documented by the physiotherapists, is provided in Table 5. Treatment usually included a combination of exercises and manual techniques. Consistent with physiotherapy practice in Germany, where this study took place, all physiotherapists routinely provided advice and patient education.
Thirty-seven participants (57% of 65) received some supplementary medical treatment: i.e. subacromial steroid injection (27; of these, 24 received one injection and three received two injections), elastic tape (12) or prescription of oral medication (Metamizole, 1). No participant was put on sick leave.
The mean (SD) unadjusted WORCCHANGE score (n = 65) was − 363 (361); the range was − 1248 to 372. The mean (SD) RTM-adjusted WORCCHANGE score was − 363 (341); the range was − 1102 to 387. Tear progression to an FTT occurred in two participants (4%, n = 52). Adverse events were reported for six participants (9%, n = 65), and related exclusively to temporary exacerbations of the shoulder symptoms. Fifty-five participants (86%, n = 64) rated their shoulder problem as improved (positive GPC ratings), five (8%) as unchanged (GPC = 0), and four (6%) as deteriorated (negative GPC ratings). The MID estimate for the WORC, which we derived from the sample data using an anchor-based approach (n = 64), was − 300 (this analysis is reported in a separate article ).
There were no complexities (e.g. unit of analysis issues) in the data. We excluded diabetes from the analysis because of its very low prevalence in the sample (Table 4), and consequently excluded one two-factor model, ‘diabetes & smoking’ (Table 3). The ratio of the number of outcome events (individuals with data available for analysis) to the overall number of analyzed candidate factors approximated to 7 (61/9); the range across all models was, depending on the number of factors included in each model, approximately 7 to 31. The residual plots showed no strong evidence of a violation of the assumptions for linear regression for any of the models.
The key model statistics are shown in Table 2. The coefficient statistics for each model and each prognostic factor are provided with the supplementary materials (Additional file 3). Two models with the same AICC value (models 2 and 5) were identified as the best models. The model with the third-highest AICC value (model 9) had an ∆AICC within the range of plausible alternatives (∆AICC < 7) to the best models . The remaining models had ∆AICC values outside this range. The SEE ranged from 313 to 344, and was, for all models, higher than the estimated MID of the WORC (300). The full model (model 1) had the highest R2ADJ (the range of all models was from − 0.06 to 0.12).
Model validation and further analyses
The performance and precision of the analyzed models did not justify internal validation; nor the planned subgroup analyses.
Despite our rigorous approach and meeting our minimum sample size (relating to the full model), we did not achieve our primary aim of developing a prognostic model for the outcome of a phase of conservative treatment with physiotherapy in adults with symptomatic atraumatic rotator cuff PTTs. Of the eight models for which testing was appropriate, none had a satisfactory performance (R2ADJ) or precision (SEE).
Strengths and weaknesses of the study
The rigorous methodological design of our study helped to avoid various potential sources of bias. This included avoidance of statistical univariable selection techniques, which have been linked to biased predictions , and the analysis of continuous measurements on their continuous scale, hereby avoiding the various problems associated with the categorization of continuous measurements [45, 46]. The latter reflected our post-protocol decision to analyze the WORC on a continuous scale, instead of analyzing it as a binary outcome. By using an information-theoretic analysis approach, we purposely avoided the selection of factors within the multivariable analysis based on arbitrary cut-offs of “statistical significance”, as these, in particular stepwise regression techniques, have been linked to biased predictions [52,53,54]. Although the outcome assessment could not be blinded to the prognostic factor information, any influence of participants’ knowledge about prognostic factor information on the outcome is unlikely because the participants did not know which of the multiple baseline variables were modelled.
The ratio of outcome events to candidate factors was within the pre-specified range of 5 to 9 for the full model (and considerably higher, i.e. > 20, for some of the other models), and losses to follow-up and missing data were few. Additionally, as the reasons for missingness appeared non-systematic, we considered the data from the complete cases as representative of the whole sample. However, despite our meeting our sample size estimate, sample size is a key limitation of our study as indicated by the low precision and also by the rejection of the ‘diabetes & smoking’ model due to the low numbers of diabetic patients recruited. In the absence of any formal methods to determine the effective sample size, and without prior knowledge of the relationship between the candidate prognostic factors, it was difficult to estimate the sample size for our study (please see reviewer feedback on this aspect in Open Peer Review Reports). Considering the low precision of the analyzed models in our study, we conclude that a much larger sample size would have been needed to increase the chances of achieving satisfactory precision of the analyzed models.
Rigour was applied to the consideration of the clinical relevance, practicality of measurement and applicability of the study findings. All PTTs were diagnosed by US, which is highly specific (94%), but less sensitive (68%) for detecting PTTs . This means that, while some PTTs might have been missed, those identified were almost certainly true positives; hence, the study population was homogeneous in this respect. We aimed to enroll patients at a fairly similar state of health. Similarity of several baseline characteristics such as pain intensity, symptom duration and disability could not be guaranteed, as their restriction would have threatened recruitment, but was accounted for by candidate prognostic factors.
The physiotherapy protocol accommodated clinical autonomy within an evidence-based framework. Some of the study participants received adjunctive medical treatment, such as a local steroid injection. Arguably, the different treatments may have had an impact on the overall improvement of the participants during the three- month treatment period and also on the predictive performance of the analyzed models. We are confident, though, that this was not a relevant issue in our study. Consistent with our study question, we selected prognostic factors that were present at baseline before starting conservative treatment. The primary treatment was exercise-based physiotherapy within an evidence-based framework. The adjunctive treatments, which were provided to a minority of participants, included subacromial corticosteroid injections, elastic tapes and oral pain medication. The evidence on the effectiveness of these treatments for rotator cuff related shoulder pain is limited. Notably, for corticosteroid injection, which was the most often delivered adjunctive treatment, there is evidence of no relevant difference compared with physiotherapy . Considering this and that the majority of the participants in our study who received injections received only one injection, we consider the likely impact of corticosteroid injections was minimal. Similar considerations apply to the other adjunctive treatments, which were received by smaller numbers of participants. In this context, we consider our decision not to perform the planned exploratory subgroup analyses, which included “medical treatment (specifically provision of injections)”, was appropriate.
Although set within one country, Germany, with clinical care under one orthopaedic specialist, the study findings are broadly applicable to adults with symptomatic PTTs undergoing a three-month period of conservative treatment with exercise-based physiotherapy.
The eight analyzed models could explain only a very limited amount (up to 12%, see R2ADJ values), of the variability of the outcome, which means that most of the variability remains unexplained. This finding could be partly due to the fact that the evidence base for most of the factors identified was generally very limited. Although we cannot say what other factors may have contributed to this unexplained variability, we suggest these may be among the 36 factors listed in the supplementary table. As evidenced by their low precision (SEE), the predictions are affected by considerable uncertainty; they consequently do not provide reliable estimates of population parameters. The “natural” temptation to select out more “promising” factors, such as pain catastrophizing, which featured in the three best models, should be countered by the realization that our study was explicitly designed to explore multivariable models rather than individual factors. Thus, the presented coefficient statistics do not represent the factors’ independent contributions to the predictions.
Lastly, it should be kept in mind that generally, any prognostic model that has been developed in a single population should only be considered clinically usable after it has been externally validated and, ideally, also evaluated for clinical impact .
Comparison with other studies
As already established, this is the first study aimed at predicting the outcome of conservative treatment with physical therapy in adults with symptomatic PTTs. Comparison with studies of adults undergoing conservative treatment with physiotherapy for rotator cuff disorders, in general, would be uninformative because of heterogeneity, not least in methodological terms .
We could not determine a prognostic model with satisfactory performance and precision. Thus, the challenge remains to develop a prognostic model with a satisfactory performance and precision for predicting the outcome of a phase of conservative treatment with physiotherapy in adults with symptomatic PTTs. Further high-quality prognostic studies are needed but should be underpinned, and thus preceded, by robust research aimed at improving knowledge of relevant factors. Consensus approaches (e.g. Delphi studies) may provide guidance about which factors to prioritize for future studies. Collaborative data collection and data sharing initiatives could enhance the realization of larger studies and applicability. Further methodological research is also needed to determine the optimal methods for developing prognostic models. Investigators of future prognostic model development studies should attend to the importance of the internal and external validation of any models with a promising performance.
- ∆AICC :
- ADJ, ADJ :
Adjusted (in the context of this study: for regression to the mean, RTM)
Akaike’s Information Criterion
- AICC :
Akaike’s Information Criterion, small sample variant
- AICMIN :
Smallest AIC (AICc) value
Deutsche Gesellschaft für Ultraschall in der Medizin [German Society for Ultrasound in Medicine]
Deutsche Vereinigung für Schulter- und Ellenbogenchirurgie [German Society of Shoulder and Elbow Surgery]
Full-thickness tear (rotator cuff)
Global Perceived Change
Long head of biceps
Minimal Important Difference
Magnetic resonance imaging
Pain Catastrophizing Scale
Patient-reported outcome measure
Partial-thickness tear (rotator cuff)
- R2 :
Coefficient of (multiple) determination
Regression to the mean
Standard error of the estimate
Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (checklist)
Visual analogue scale
Western Ontario Rotator Cuff index
Change of WORC score from baseline to follow-up
Kooijman M, Swinkels I, van Dijk C, de Bakker D, Veenhof C. Patients with shoulder syndromes in general and physiotherapy practice: an observational study. BMC Musculoskelet Disord. 2013;14:128.
Östör AJK, Richards CA, Prevost AT, Speed CA, Hazleman BL. Diagnosis and relation to general health of shoulder disorders presenting to primary care. Rheumatology. 2005;44:800–5.
van der Windt DA, Koes BW, de Jong BA, Bouter LM. Shoulder disorders in general practice: incidence, patient characteristics, and management. Ann Rheum Dis. 1995;54:959–64.
Cook JL, Purdam CR. Is tendon pathology a continuum? A pathology model to explain the clinical presentation of load-induced tendinopathy. Br J Sports Med. 2009;43:409–16.
Reilly P, Macleod I, Macfarlane R, Windley J, Emery RJH. Dead men and radiologists don’t lie: a review of cadaveric and radiological studies of rotator cuff tear prevalence. Ann R Coll Surg Engl. 2006;88:116–21.
Yamaguchi K, Ditsios K, Middleton WD, Hildebolt CF, Galatz LM. The demographic and morphological features of rotator cuff disease: a comparison of asymptomatic and symptomatic shoulders. J Bone Jt Surgery, Am Vol. 2006;88A:1699–704.
Matava MJ, Purcell DB, Rudzki JR. Partial-thickness rotator cuff tears. Am J Sports Med. 2005;33:1405–17.
Beaudreuil J, Bardin T, Orcel P, Goutallier D. Natural history or outcome with conservative treatment of degenerative rotator cuff tears. Joint Bone Spine. 2007;74:527–9.
Hedtmann A. Weichteilerkrankungen der Schulter – Subakromialsyndrome. Orthopädie und Unfallchirurgie up2date. 2009;4:85–106.
Finnan RP, L a C. Partial-thickness rotator cuff tears. J Shoulder Elb Surg. 2010;19:609–16.
Lenza M, Buchbinder R, Takwoingi Y, Johnston RV, Hanchard NC, Faloppa F. Magnetic resonance imaging, magnetic resonance arthrography and ultrasonography for assessing rotator cuff tears in people with shoulder pain for whom surgery is being considered. Cochrane Database Syst Rev. 2013;9:CD009020.
Tashjian RZ. AAOS clinical practice guideline: optimizing the management of rotator cuff problems. J Am Acad Orthop Surg. 2011;19:380–3.
Beaudreuil J, Dhénain M, Coudane H, Mlika-Cabanne N. Clinical practice guidelines for the surgical management of rotator cuff tears in adults. Orthop Traumatol Surg Res. 2010;96:175–9.
Ryösä A, Laimi K, Äärimaa V, Lehtimäki K, Kukkonen J, Saltychev M. Surgery or conservative treatment for rotator cuff tear: a meta-analysis. Disabil Rehabil. 2017;39(14):1357-63.
Colvin AC, Egorova N, Harrison AK, Moskowitz A, Flatow EL. National trends in rotator cuff repair. J Bone Joint Surg Am. 2012;94:227–33.
Paloneva J, Lepola V, Äärimaa V, Joukainen A, Ylinen J, Mattila VM. Increasing incidence of rotator cuff repairs--a nationwide registry study in Finland. BMC Musculoskelet Disord. 2015;16:189.
Svendsen SW, Frost P, Jensen LD. Time trends in surgery for non-traumatic shoulder disorders and postoperative risk of permanent work disability: a nationwide cohort study. Scand J Rheumatol. 2012;41:59–65.
Ylinen J, Vuorenmaa M, Paloneva J, Kiviranta I, Kautiainen H, Oikari M, et al. Exercise therapy is evidence-based treatment of shoulder impingement syndrome. Current practice or recommendation only. Eur J Phys Rehabil Med. 2013;49:499–505.
Butler M, Forte M, Braman J, Swiontkowski M, Kane RL. Nonoperative and Operative treatments for rotator cuff tears: future research needs: identification of future research needs from comparative effectiveness review no. 22. Rockville: Agency for Healthcare Research and Quality (US). Report No.: 13-EHC050-EF. AHRQ Future Research Needs Papers. 2013. https://www.ncbi.nlm.nih.gov/pubmed/23905196.
Croft P, Altman DG, Deeks JJ, Dunn KM, Hay AD, Hemingway H, et al. The science of clinical practice: disease diagnosis or patient prognosis? Evidence about “what is likely to happen” should shape clinical practice. BMC Med. 2015;13:20.
Cochrane Prognosis Methods Group. 2018.http://prognosismethods.cochrane.org/. Accessed 21 Apr 2018.
Progress. mrc prognosis research strategy partnership. 2018.http://progress-partnership.org/. Accessed 21 Apr 2018.
Riley RD, Hayden JA, Steyerberg EW, Moons KGM, Abrams K, Kyzas PA, et al. Prognosis research strategy (PROGRESS) 2: prognostic factor research. PLoS Med. 2013;10:e1001380.
Hemingway H, Croft P, Perel P, Hayden JA, Abrams K, Timmis A, et al. Prognosis research strategy (PROGRESS) 1: a framework for researching clinical outcomes. BMJ. 2013;346:e5595.
Steyerberg EW, Moons KGM, van der Windt DA, Hayden JA, Perel P, Schroter S, et al. Prognosis research strategy (PROGRESS) 3: prognostic model research. PLoS Med. 2013;10:e1001381.
Moons KGM, Royston P, Vergouwe Y, Grobbee DE, Altman DG. Prognosis and prognostic research: what, why, and how? BMJ. 2009;338:b375.
Royston P, Moons KGM, Altman DG, Vergouwe Y. Prognosis and prognostic research: developing a prognostic model. BMJ. 2009;338:b604.
Braun C, Hanchard NC, Batterham AM, Handoll HH, Betthäuser A. Prognostic models in adults undergoing physical therapy for rotator cuff disorders: systematic review. Phys Ther. 2016;96:961–71.
Braun C. Predicting the outcome of physiotherapy in poeple with painful partial-thickness rotator cuff tears. A thesis submitted in partial fulfillment of the requirements of Teesside University for the award of the degree of Doctor of Philosophy (PhD) http://tees.openrepository.com/tees/handle/10149/621790 (2016). Accessed 21 Apr 2018.
Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ. 2015;350:g7594.
Roy J-S, Braën C, Leblond J, Desmeules F, Dionne CE, MacDermid JC, et al. Diagnostic accuracy of ultrasonography, MRI and MR arthrography in the characterisation of rotator cuff disorders: a systematic review and meta-analysis. Br J Sports Med. 2015;49:1316–28.
Khan KM, Cook JL, Maffulli N, Kannus P. Where is the pain coming from in tendinopathy? It may be biochemical, not only structural, in origin. Br J Sports Med. 2000;34:81–3.
DVSE. Untersuchungstechniken des Schultergelenks. Expertenevaluation auf der Basis einer Literaturanalyse. Obere Extermität. 2012;7(Suppl 1):3–68.
Konermann W, Gruber G. Ultraschalldiagnostik Der Bewegungsorgane. In: Kursbuch nach den Richtlinien der DEGUM und der DGOU. 2nd ed. Stuttgart: Thieme; 2007.
Hedtmann A, Fett H. Schultersonographie bei Subakromialsyndromen mit Erkrankungen und Verletzungen der Rotatorenmanschette (Sonography of the shoulder in subacromial syndromes with diseases and injuries of the rotator cuff). Orthopade. 1995;24:498–508.
Hedtmann A, Fett A. Sonographie der Rotatorenmanschette (Ultrasonographic diagnosis of the rotator cuff). Orthopade. 2002;31:236–46.
Braun C, Hanchard NCA. Manual therapy and exercise for impingementrelated shoulder pain. Phys Ther Rev. 2010;15:62–83.
Braun C, Bularczyk M, Heintsch J, Hanchard NCA. Manual therapy and exercises for shoulder impingement revisited. Phys Ther Rev. 2013;18:263–84.
Kirkley A, Alvarez C, Griffin S. The development and evaluation of a disease-specific quality-of-life questionnaire for disorders of the rotator cuff: the western Ontario rotator cuff index. Clin J Sport Med. 2003;13:84–92.
Huber W, Hofstaetter JG, Hanslik-Schnabel B, Posch M, Wurnig C. Translation and psychometric testing of the western Ontario rotator cuff index (WORC) for use in Germany. Z Orthop Ihre Grenzgeb. 2005;143:453–60.
Huang H, Grant JA, Miller BS, Mirza FM, Gagnier JJ. A systematic review of the psychometric properties of patient-reported outcome instruments for use in patients with rotator cuff disease. Am J Sports Med. 2015;43:2572–82.
St-Pierre C, Desmeules F, Dionne CE, Frémont P, MacDermid JC, Roy J-S. Psychometric properties of self-reported questionnaires for the evaluation of symptoms and functional limitations in individuals with rotator cuff disorders: a systematic review. Disabil Rehabil. 2016;38:103–22.
Linden A. Assessing regression to the mean effects in health care initiatives. BMC Med Res Methodol. 2013;13:119.
Vittinghoff E, McCulloch CE. Relaxing the rule of ten events per variable in logistic and cox regression. Am J Epidemiol. 2007;165:710–8.
Altman DG, Royston P. The cost of dichotomising continuous variables. BMJ. 2006;332(7549):1080.
Royston P, Altman DG, Sauerbrei W. Dichotomizing continuous predictors in multiple regression: a bad idea. Stat Med. 2006;25:127–41.
Anderson DA. Model based inference in the life sciences. In: A primer on evidence. New York: Springer science + business Media; 2008.
Burnham KP, Anderson DR, Huyvaert KP. AIC model selection and multimodel inference in behavioral ecology: some background, observations, and comparisons. Behav Ecol Sociobiol. 2011;65:23–35.
Burnham KP, Anderson DR. Model selection and multimodel inference. A practical information-theoretic approach. 2nd ed. New York: Springer; 2002.
Miles J, Shevlin M. Applying regression & correlation. London: Sage Publications; 2001.
Braun C, Handoll HH. Estimating the minimal important difference for the western Ontario rotator cuff index (WORC) in adults with shoulder pain associated with partial-thickness rotator cuff tears. Musculoskelet Sci Pract. 2018;35:30–3.
Harrell FE, Lee KL, Mark DB. Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat Med. 1996;15:361–87.
Flom PL, Cassell DL. Statistics and Data Analysis: Why stepwise and similar selection methods are bad , and what you should use (statistics and data analysis): North East SAS Users Group (NESUG) Annual Conference 2007. 2007.http://www.lexjansen.com/pnwsug/2008/DavidCassell-StoppingStepwise.pdf, https://www.lexjansen.com/nesug/. Accessed 21 Apr 2018.
Harrell FE. Regression modeling strategies. 1st ed. New York: Springer; 2001.
Mohamadi A, Chan JJ, Claessen FMAP, Ring D, Chen NC. Corticosteroid injections give small and transient pain relief in rotator cuff Tendinosis: a meta-analysis. Clin Orthop Relat Res. 2017;475:232–43.
Sullivan MJL, Bishop SR, Pivik J. The pain Catastrophizing scale: development and validation. Psychol Assess. 1995;7:524–32.
Meyer K, Sprott H, Mannion AF. Cross-cultural adaptation, reliability, and validity of the German version of the pain Catastrophizing scale. J Psychosom Res. 2008;64:469–78.
This study formed part of the work included in the PhD thesis (conferred December 2016) of CB at Teesside University, Middlesbrough, UK. We are grateful to all patients who participated in the study, and to all collaborating physiotherapists.
Availability of data and material
The datasets generated and/or analysed during the current study that involve patient data are not publicly available because this was not established, nor would it have been accepted, as part of the ethics application. Other datasets and study material that do not require participant consent for sharing are publicly available  and also available from the corresponding author on reasonable request.
This research did not receive any grant from funding agencies in the public, commercial or not-for-profit sectors.
Ethics approval and consent to participate
The study was approved by the Teesside University School of Health and Social Care Research Governance & Ethics Committee and the Ethics Commission of the Hamburg Medical Council (Germany). Written informed consent, a prerequisite for study participation, was obtained from all participants.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Braun, C., Hanchard, N.C., Handoll, H.H. et al. Predicting the outcome of conservative treatment with physiotherapy in adults with shoulder pain associated with partial-thickness rotator cuff tears – a prognostic model development study. BMC Musculoskelet Disord 19, 329 (2018) doi:10.1186/s12891-018-2239-8
- Shoulder pain
- Rotator cuff
- Conservative treatment
- Physical therapy
- Prognostic model development