Skip to main content

Predicting the outcome of conservative treatment with physiotherapy in adults with shoulder pain associated with partial-thickness rotator cuff tears – a prognostic model development study



Rotator cuff disorders represent the commonest type of painful shoulder complaints in clinical practice. Although conservative treatment including physiotherapy is generally recommended as first-line treatment, little is known about the precise treatment indications for subgroups of rotator cuff disorders, particularly people with shoulder pain associated with partial-thickness tears of the rotator cuff, PTTs: “symptomatic PPTs”. The aim of this study was to develop a prognostic model for predicting the outcome of a phase of conservative treatment primarily with physiotherapy in adults with symptomatic PTTs.


A prospective observational cohort study was conducted in an outpatient setting in Germany. Ten baseline factors were selected to evaluate nine pre-defined multivariable candidate prognostic models (each including between two and nine factors) in a cohort of adults with symptomatic atraumatic PTTs undergoing a three-month phase of conservative treatment primarily with physiotherapy. The primary outcome was change in the Western Ontario Rotator Cuff Index. The models were developed using linear regression and an information-theoretic analysis approach: Akaike’s Information Criterion (AICC).


Eight candidate models were analyzed using data from 61 participants. Two “best models” were identified: smoking & pain catastrophizing and disability & pain catastrophizing. However, none of the models had a satisfactory performance or precision.


We could not determine a prognostic model with satisfactory performance and precision. Further high-quality prognostic model studies with larger samples are needed, but should be underpinned, and thus preceded, by robust research that enhances knowledge of relevant prognostic factors.

Study registration

DRKS00004462. Registered 08 April 2014; retrospectively registered (prior to the analysis).

Peer Review reports


Painful shoulder complaints are common musculoskeletal disorders in clinical practice [1], most being attributed to rotator cuff pathology [2, 3]. Rotator cuff pathology encompasses a range of pathologies from tendinopathy to tears, which may be partial- or full-thickness [4]. Reported rates of symptomatic partial-thickness tears (PTTs), the condition of interest in this study, vary between 7% [5] and 24% [6] in shoulder pain populations. Of the four rotator cuff tendons (supraspinatus, infraspinatus, teres minor, subscapularis), the supraspinatus is by far the most often affected [7], and also usually the first to tear [8, 9]. In order to concisely label the population of interest, we use the term “symptomatic PTT” to describe people with shoulder pain in the presence of a PTT of the rotator cuff.

The clinical presentation of symptomatic PTTs is essentially that of “shoulder impingement” [7, 9, 10]. Verification of a PTT requires diagnostic imaging, commonly ultrasonography (US) or magnetic resonance imaging (MRI) [11].

Current guidelines for rotator cuff disorders [12, 13] recommend conservative treatment with medical care and physiotherapy as the first-line treatment; surgical intervention being mainly reserved for non-responders. Head-to-head comparisons of conservative and surgical interventions [14] have overall shown no clinically relevant differences. However, utilisation of surgery for rotator cuff disorders has significantly increased in many countries [15,16,17], with physiotherapy bypassed in some cases [18]. Both unnecessary surgery and ineffective conservative treatment are undesirable. Knowledge about a patient’s likely response to conservative treatment at the point of diagnosis would save time, effort and suffering, limit exposure to the risks of surgery, and inform distribution of resources. “Understanding which patients [with rotator cuff tears] do best with non-operative treatment” has been rated a top “priority scientific research issue” ([19], p. 10).

The importance of predicting individuals’ responses to particular interventions is increasingly recognized [20], with a corresponding development in prognosis research methodology [21, 22]. One aspect of prognosis research involves the identification of single, independent factors [23]. However, these are unlikely to predict outcomes satisfactorily. Multivariable prognostic models are better placed as they account for real-life clinical complexities [24, 25]. Estimates of prognosis are highly context-dependent, with relevant contextual factors being existing diagnostic and treatment practices, time and place.

Prognostic model research encompasses three key phases: development including internal validation; external validation; and evaluation of clinical impact [25]. External validation is essential before a model may be usable in practice [25]. While prospective cohort studies are generally considered the preferable design for the initial development of a prognostic model [25,26,27], evaluations of the clinical impact of a prognostic model ultimately require comparative studies.

Our systematic review of the evidence on prognostic models for predicting outcomes in adults undergoing physiotherapy for rotator cuff disorders showed a lack of clinically usable prognostic models and, crucially, of prognostic model research on PTTs [28]. The study’s primary aim was to develop a multivariable prognostic model for the outcome of a phase of conservative treatment with physiotherapy in adults with symptomatic atraumatic PTTs. Secondary aims were to determine the incidence of tear progression and to establish participants’ perceived change of their shoulder complaints over time.


The study was based on an a priori protocol and was approved by the Teesside University School of Health and Social Care Research Governance & Ethics Committee and the Ethics Commission of the Hamburg Medical Council (Germany). It was registered in the German Clinical Trials Register ( DRKS00004462). The study design was informed by the most current methodological guidance available at the time of planning [21, 22]. All deviations from protocol were discussed and recorded prior to implementation [29]; the only two relevant deviations are flagged up in this section. This report complies with the items required by the TRIPOD (Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis) prediction model development checklist [30].

Study design, setting and key dates

We conducted a prospective observational single-group cohort study set in Hamburg, Germany. All recruitment and assessments took place in a single-handed medical specialist practice led by one of the authors, AB, an orthopaedic shoulder specialist and DEGUM (German Society for Ultrasonography in Medicine) certified instructor in ultrasonographic shoulder diagnosis. The physiotherapy treatment took place in 24 collaborating physical therapy practices in the broader area of Hamburg. (In our protocol, we initially considered seven collaborating practices, but expanded their number eventually to 24 to improve recruitment). Recruitment took place between December 2012 to September 2014. Follow-up ended in January 2015.


Eligible patients were adults (≥ 18 years) presenting with shoulder pain unrelated to a traumatic event (e.g. an accident) and an ultrasonographically determined PTT who had accepted advice to undergo conservative treatment with physiotherapy (see Table 1 for the full eligibility criteria). These patients typically present with clinical signs of “shoulder impingement”, such as a painful arc or positive “impingement signs” [7, 9, 10]. We additionally determined the presence of a PTT by diagnostic ultrasonography, which is highly specific for detecting PTTs [31]. Our intention was to recruit patients whose shoulder pain could reasonably be linked to the presence of a PTT; however, we acknowledge that the precise link between shoulder pain and the presence of a PTT (similar to other shoulder structures) is unclear [32]. Following standard practice, the assessment involved a structured patient history, physical and ultrasonographic evaluation. The physical evaluation was based on DVSE (German Society for Shoulder and Elbow Surgery) recommendations [33]. The ultrasonographic evaluation followed DEGUM and DGOU (German Society for Orthopaedics and Trauma) standards [34]. An ultrasound unit within the highest DEGUM appliance class was used together with a linear transducer with a resolution of ≥10 MHZ and width of ≥40 mm. Diagnosis of a rotator cuff defect was based on alterations of structure and form, following the criteria of Hedtmann & Fett [35, 36]. In distinction to a PTT, a full-thickness tear (FTT) was marked by the absence of a depiction of the rotator cuff (discontinuity of the cuff).

Table 1 Eligibility criteria


Participants were followed over three months of standard conservative care with physiotherapy in one of the collaborating practices. Adjunctive medical treatment (e.g. local steroid injections), was delivered by AB where considered appropriate. The physiotherapy treatment followed a broad best-evidence protocol based on two systematic reviews [37, 38]. These reviews provided evidence supporting exercises with or without manual therapy as the first-line approach for treating patients with rotator cuff related shoulder pain including PTTs, but could not provide conclusive guidance on the optimal type or dose of treatment. Since there was no justification for restricting treatment to any specific exercises or manual techniques, the protocol was based on the broad principles that a) exercises, preferably combined with manual techniques (soft tissue and/or joint mobilisation), would be the key treatment components, and b) flexibility of the interventions and in the provision of adjunctive modalities would be allowed. In keeping with the ethos of an observational study, the specific content and amount of treatment were unregulated, i.e. individually advised. Treatment, which included the clinical follow-up appointment at three months to assess progress and need for further treatment, was delivered in compliance with German healthcare regulations and AB’s standard practice. Acceptability of the physiotherapy protocol was confirmed by all collaborating physiotherapy practices. Treatment details were documented in a purpose-designed, piloted report form.


The primary outcome, the outcome to be predicted, was the change in ‘disability’ (disability and health-related quality of life) from baseline to follow-up, measured by a validated German version of the Western Ontario Rotator Cuff Index (WORC) [39, 40]: WORCCHANGE. The WORC has been shown to be a valid, reliable and responsive patient-reported outcome measure (PROM) for use in people with rotator cuff disorders [41, 42]. It comprises 21 questions. Responses are made by putting a mark on a 100 mm visual analogue scale (VAS), with lower scores indicating less disability. Scores range from 0 to 2100 [39]. We adjusted all WORCCHANGE values for Regression to the Mean (RTM) using methods outlined by Linden [43]. Participants completed questionnaires at baseline and at 3 to 4 months, the study endpoint, either at AB’s clinic or at home.

As both the WORC and all prognostic factors were patient-assessed, there was no blinding of participants. Nonetheless, the WORC was completed independently and in the absence of AB and study investigators.

Secondary outcomes were tear progression, defined as the presence (yes or no) of an FTT at follow-up, and participants’ perceived overall change of their shoulder problem, measured by a 7-point Global Perceived Change (GPC) scale (from − 3 = “worse as ever” to + 3 = “completely recovered”). Lastly, physical therapy-related adverse events were monitored.

Prognostic factors

Inclusion of candidate factors was restricted to factors from the baseline assessment, regardless of their type (e.g. demographic, physical). Selection was done through a systematic, three-stage approach comprising identification of factors, critical assessment of these, and a consensus phase that aimed to select a maximum of 10 factors (see Fig. 1 for an outline of the process; a full account is available in Braun 2016 ([29, Chapter 5]). The process was informed by comprehensive literature searches of several electronic databases, including Medline, Embase and Cinahl, for primary prognostic studies, prognostic systematic reviews and expert consensus studies. We screened overall around 3900 records and identified 23 primary study reports (relating to 22 studies), one systematic review and one expert consensus study as relevant sources for informing the selection of factors for our study (a list of these articles is provided in Additional file 1). We extracted and considered 36 factors altogether (these are listed in Additional file 2, which also shows for each factor whether it was included or excluded and the reasons for exclusion). We assessed the relevance of all factors to the study population and setting, their measurement properties, practicality of use, and their applicability, and excluded those that were either not relevant to the study population and setting, not sufficiently valid and reliable, or not applicable in most clinical settings. We grouped the remaining factors according to the availability of clinical evidence and expert consensus supporting their prognostic relevance; we gave preference to the selection of those factors for which there was reasonably consistent support for their prognostic relevance, either through clinical evidence from several studies, or from both clinical evidence and expert consensus. Notably, there was reasonably consistent evidence of prognostic value from several studies pertaining to clinical outcomes of conservative treatment in adults with rotator cuff disorders for only three factors: age, disability and symptom duration. We finally agreed on 10 factors: age, sex, physical demands, disability, pain, history of shoulder pain, symptom duration, diabetes, smoking and pain catastrophizing. We gave thorough attention to factor definitions and measurements (Table 2). All factors were assessed during the patients’ baseline appointment with AB. Since the study was prospective, the assessment of prognostic factor information was inherently blinded to knowledge about the outcome.

Fig. 1

Identification and selection of candidate factors – outline of process

Table 2 Candidate factors – definition and measurement

Sample size

The multivariable nature of prognostic model studies makes it difficult to estimate the required sample size [26]. Indeed, no formal methods (based on either power calculations or adequate precision of estimation of effects) are available to determine the effective sample size, and recommendations for the sample size vary across the literature. Following work by Vittinghoff & McCulloch [44], we based the minimum sample size of our study on a requirement of 5 to 9 outcome events (events equate to individuals for continuous outcomes) per candidate prognostic factor in relation to the full model (i.e. the model including all 10 factors). As per our protocol, we initially planned to analyze the WORC as a binary outcome variable, but subsequently (and prior to the analysis) decided to analyze it as a continuous variable to avoid the unnecessary loss of information that would have resulted from dichotomization [45, 46]. By analyzing the WORC on a continuous scale, and setting out to study overall 10 factors, which we considered feasible, we aimed to include (5 to 9)*10 = 50 to 90 participants. Increased by 20% to allow for losses to follow-up, the recruitment target was 60 to 108 patients.

Missing data

Any missing prognostic factor and outcome data were documented. The decision about the method for dealing with missing data, including whether or not to impute any missing data, was made prior to the analysis. We considered the amount and also the potential reasons for missing values, i.e. whether the reasons for missingness appeared systematic or random. We decided to limit the replacement of missing values to those missing for the two multi-item measures, the WORC (baseline and follow-up) and the Pain Catastrophizing Scale (PCS). No standard missing rule was available for the WORC in the literature; therefore, we replaced missing WORC values by the mean of the respective domain. We replaced missing PCS values by the mean of the items that were completed, as suggested by the primary originator of the scale, Prof Michael Sullivan (personal communication 02/06/2014). We did not replace any missing values where the PCS was completely missing. As the information-theoretic analysis approach we used required identical datasets, the data were analyzed on a complete-case basis. We would have considered formal testing of the effects of missing data should the amount have been bigger and should the reasons for missingness have been of concern.

Statistical analysis methods

We intended to include all 10 candidate factors in the prognostic modelling analysis. All continuous factors, WORC and PCS scores, were analyzed as continuous measurements. All non-continuous factors were binary.

We based our analysis on an information-theoretic approach, namely on a small-sample variant of Akaike’s Information Criterion (AIC) approach, AICC [47]. Information-theoretic approaches to model selection differ from other approaches, particularly from the widely used stepwise regression approaches, in several ways. Under the AIC approach, selection is based on the comparison of multiple candidate models, which are pre-specified based on “theory”, rather than on a single global set of factors [48]. Selection is further based on an information-theoretic criterion (e.g. AIC), which provides “numerical values that represent the scientific evidence” for a model, but no “test statistics” such as p values, thus avoiding the application of arbitrary cut-offs of “statistical significance” ([47] p. 64). Reflecting the perspective that models never reflect “full reality”, i.e. that they are approximations ([47], p. 27), the AIC value represents an estimator of the information that is inherently lost when a model is used to approximate full reality (Kullback-Leibler information) [48]. The AIC accounts for the number of candidate factors by ‘penalizing’ models with larger numbers of factors, thereby favouring parsimony ([47], p. 60–1). The model with the lowest AIC value (AICMIN) represents the closest approximation and is accordingly termed the “best model” within a set of models [47]. AIC differences (∆AIC = AIC – AICMIN) can then be calculated to rank the models by their distance to the best model [47, 48]. Burnham et al. ([48], p. 25) have proposed considering models with ∆AIC values < 4 to 7 as “plausible” alternatives to the best model, whereas models with higher ∆AIC values (> 9) have little to no support. AIC values are relative rather than absolute, and “on the scale of information” ([47], p. 84). Accordingly, their use is limited to comparing models within a defined set of models [49]. As the AIC approach will always select a best model among a set of models, it has been suggested that the worth of the best or the global (full) model be assessed, e.g. by a goodness-of-fit test, analysis of residuals or the adjusted R2 (the percentage of variance explained) [47].

Following recommendations from the literature that the number of candidate models should usually be limited to a few [47], we decided to analyze a selection of nine candidate models. The selection of models was based on clinical and theoretical considerations, with the first model (number 1 in Table 3) including all 10 candidate prognostic factors (thus representing the “full model”). The composition of the other eight models, which included between two to eight of these factors, was based on various characteristics, as shown in Table 3. Examples of characteristics were the potential for modification (model 2) or the effort required for the assessment of prognostic factors (models 5 and 7, inclusion or exclusion of questionnaires), which would be highly relevant to clinical practitioners. The primary analysis approach was a linear regression analysis [26, 49] which we conducted in IBM SPSS Statistics 22. All continuous factors were modelled as linear. Satisfaction of the assumptions of linear regression was assessed visually for each model based on the residual plot (scatterplot of standardized residuals against standardized predicted values) [50].

Table 3 Candidate prognostic models and key model statistics

We extracted the following statistics: the AICC value; the standard error of the estimate (SEE), as the primary measure of model precision; the adjusted coefficient of (multiple) determination (R2ADJ), as a complementary measure of model performance; the regression constant (Constant); and the unstandardized regression coefficients (B) of all factors with their 95% confidence intervals (CIs). For comparison of the different models, we extracted AICC, ∆AIC and SEE values.

Model validation and further analyses

We intended to compare the SEE of the best model with the estimate of the Minimal Important Difference (MID) of the WORC, which we intended to derive from the sample data, and to internally validate any model with an SEE substantially lower than the MID. We intended to conduct the following exploratory subgroup analyses: amount of physiotherapy (number of sessions); medical treatment (specifically provision of injections); and length of follow-up.



Figure 2 illustrates the flow of participants. Of 82 eligible participants, 70 were included, of whom 65 (representing 65 shoulders) completed the study. The baseline characteristics and prognostic factor information of these 65 participants are presented in Table 4.

Fig. 2

Flow of participants

Table 4 Baseline characteristics and prognostic factor data

The amount of missing data was small: six values (0.4% of all values) were missing for the baseline WORC; 11 (1%) for the follow-up WORC; and six (1%) for the single-item prognostic factors. The PCS was missing completely for three participants; beyond this, only one PCS value (0.1%) was missing. The distribution appeared random, thus non-systematic. Four participants had missing prognostic factor data after replacement of missing WORC and PCS values, and were consequently, in keeping with the need for identical datasets for the AIC approach [47], excluded from the modelling. The data of 61 participants were analyzed. The mean (SD) interval between completion of the baseline and follow-up WORC (and GPC) was 97 (17) days (n = 65 for WORC, 64 for GPC). The mean (SD) interval between the baseline and follow-up US assessment was 100 (13) days (n = 52).


All participants received conservative treatment with physiotherapy. The mean (SD) number of physiotherapy sessions was 12 (6); and the mean (SD) duration of single sessions was 28 (13) minutes. A breakdown of the physiotherapy treatment content, documented by the physiotherapists, is provided in Table 5. Treatment usually included a combination of exercises and manual techniques. Consistent with physiotherapy practice in Germany, where this study took place, all physiotherapists routinely provided advice and patient education.

Table 5 Breakdown of physiotherapy treatment

Thirty-seven participants (57% of 65) received some supplementary medical treatment: i.e. subacromial steroid injection (27; of these, 24 received one injection and three received two injections), elastic tape (12) or prescription of oral medication (Metamizole, 1). No participant was put on sick leave.


The mean (SD) unadjusted WORCCHANGE score (n = 65) was − 363 (361); the range was − 1248 to 372. The mean (SD) RTM-adjusted WORCCHANGE score was − 363 (341); the range was − 1102 to 387. Tear progression to an FTT occurred in two participants (4%, n = 52). Adverse events were reported for six participants (9%, n = 65), and related exclusively to temporary exacerbations of the shoulder symptoms. Fifty-five participants (86%, n = 64) rated their shoulder problem as improved (positive GPC ratings), five (8%) as unchanged (GPC = 0), and four (6%) as deteriorated (negative GPC ratings). The MID estimate for the WORC, which we derived from the sample data using an anchor-based approach (n = 64), was − 300 (this analysis is reported in a separate article [51]).

Prognostic modelling

There were no complexities (e.g. unit of analysis issues) in the data. We excluded diabetes from the analysis because of its very low prevalence in the sample (Table 4), and consequently excluded one two-factor model, ‘diabetes & smoking’ (Table 3). The ratio of the number of outcome events (individuals with data available for analysis) to the overall number of analyzed candidate factors approximated to 7 (61/9); the range across all models was, depending on the number of factors included in each model, approximately 7 to 31. The residual plots showed no strong evidence of a violation of the assumptions for linear regression for any of the models.

The key model statistics are shown in Table 2. The coefficient statistics for each model and each prognostic factor are provided with the supplementary materials (Additional file 3). Two models with the same AICC value (models 2 and 5) were identified as the best models. The model with the third-highest AICC value (model 9) had an ∆AICC within the range of plausible alternatives (∆AICC < 7) to the best models [48]. The remaining models had ∆AICC values outside this range. The SEE ranged from 313 to 344, and was, for all models, higher than the estimated MID of the WORC (300). The full model (model 1) had the highest R2ADJ (the range of all models was from − 0.06 to 0.12).

Model validation and further analyses

The performance and precision of the analyzed models did not justify internal validation; nor the planned subgroup analyses.


Principal findings

Despite our rigorous approach and meeting our minimum sample size (relating to the full model), we did not achieve our primary aim of developing a prognostic model for the outcome of a phase of conservative treatment with physiotherapy in adults with symptomatic atraumatic rotator cuff PTTs. Of the eight models for which testing was appropriate, none had a satisfactory performance (R2ADJ) or precision (SEE).

Strengths and weaknesses of the study

The rigorous methodological design of our study helped to avoid various potential sources of bias. This included avoidance of statistical univariable selection techniques, which have been linked to biased predictions [52], and the analysis of continuous measurements on their continuous scale, hereby avoiding the various problems associated with the categorization of continuous measurements [45, 46]. The latter reflected our post-protocol decision to analyze the WORC on a continuous scale, instead of analyzing it as a binary outcome. By using an information-theoretic analysis approach, we purposely avoided the selection of factors within the multivariable analysis based on arbitrary cut-offs of “statistical significance”, as these, in particular stepwise regression techniques, have been linked to biased predictions [52,53,54]. Although the outcome assessment could not be blinded to the prognostic factor information, any influence of participants’ knowledge about prognostic factor information on the outcome is unlikely because the participants did not know which of the multiple baseline variables were modelled.

The ratio of outcome events to candidate factors was within the pre-specified range of 5 to 9 for the full model (and considerably higher, i.e. > 20, for some of the other models), and losses to follow-up and missing data were few. Additionally, as the reasons for missingness appeared non-systematic, we considered the data from the complete cases as representative of the whole sample. However, despite our meeting our sample size estimate, sample size is a key limitation of our study as indicated by the low precision and also by the rejection of the ‘diabetes & smoking’ model due to the low numbers of diabetic patients recruited. In the absence of any formal methods to determine the effective sample size, and without prior knowledge of the relationship between the candidate prognostic factors, it was difficult to estimate the sample size for our study (please see reviewer feedback on this aspect in Open Peer Review Reports). Considering the low precision of the analyzed models in our study, we conclude that a much larger sample size would have been needed to increase the chances of achieving satisfactory precision of the analyzed models.

Rigour was applied to the consideration of the clinical relevance, practicality of measurement and applicability of the study findings. All PTTs were diagnosed by US, which is highly specific (94%), but less sensitive (68%) for detecting PTTs [31]. This means that, while some PTTs might have been missed, those identified were almost certainly true positives; hence, the study population was homogeneous in this respect. We aimed to enroll patients at a fairly similar state of health. Similarity of several baseline characteristics such as pain intensity, symptom duration and disability could not be guaranteed, as their restriction would have threatened recruitment, but was accounted for by candidate prognostic factors.

The physiotherapy protocol accommodated clinical autonomy within an evidence-based framework. Some of the study participants received adjunctive medical treatment, such as a local steroid injection. Arguably, the different treatments may have had an impact on the overall improvement of the participants during the three- month treatment period and also on the predictive performance of the analyzed models. We are confident, though, that this was not a relevant issue in our study. Consistent with our study question, we selected prognostic factors that were present at baseline before starting conservative treatment. The primary treatment was exercise-based physiotherapy within an evidence-based framework. The adjunctive treatments, which were provided to a minority of participants, included subacromial corticosteroid injections, elastic tapes and oral pain medication. The evidence on the effectiveness of these treatments for rotator cuff related shoulder pain is limited. Notably, for corticosteroid injection, which was the most often delivered adjunctive treatment, there is evidence of no relevant difference compared with physiotherapy [55]. Considering this and that the majority of the participants in our study who received injections received only one injection, we consider the likely impact of corticosteroid injections was minimal. Similar considerations apply to the other adjunctive treatments, which were received by smaller numbers of participants. In this context, we consider our decision not to perform the planned exploratory subgroup analyses, which included “medical treatment (specifically provision of injections)”, was appropriate.

Although set within one country, Germany, with clinical care under one orthopaedic specialist, the study findings are broadly applicable to adults with symptomatic PTTs undergoing a three-month period of conservative treatment with exercise-based physiotherapy.

The eight analyzed models could explain only a very limited amount (up to 12%, see R2ADJ values), of the variability of the outcome, which means that most of the variability remains unexplained. This finding could be partly due to the fact that the evidence base for most of the factors identified was generally very limited. Although we cannot say what other factors may have contributed to this unexplained variability, we suggest these may be among the 36 factors listed in the supplementary table. As evidenced by their low precision (SEE), the predictions are affected by considerable uncertainty; they consequently do not provide reliable estimates of population parameters. The “natural” temptation to select out more “promising” factors, such as pain catastrophizing, which featured in the three best models, should be countered by the realization that our study was explicitly designed to explore multivariable models rather than individual factors. Thus, the presented coefficient statistics do not represent the factors’ independent contributions to the predictions.

Lastly, it should be kept in mind that generally, any prognostic model that has been developed in a single population should only be considered clinically usable after it has been externally validated and, ideally, also evaluated for clinical impact [25].

Comparison with other studies

As already established, this is the first study aimed at predicting the outcome of conservative treatment with physical therapy in adults with symptomatic PTTs. Comparison with studies of adults undergoing conservative treatment with physiotherapy for rotator cuff disorders, in general, would be uninformative because of heterogeneity, not least in methodological terms [28].


We could not determine a prognostic model with satisfactory performance and precision. Thus, the challenge remains to develop a prognostic model with a satisfactory performance and precision for predicting the outcome of a phase of conservative treatment with physiotherapy in adults with symptomatic PTTs. Further high-quality prognostic studies are needed but should be underpinned, and thus preceded, by robust research aimed at improving knowledge of relevant factors. Consensus approaches (e.g. Delphi studies) may provide guidance about which factors to prioritize for future studies. Collaborative data collection and data sharing initiatives could enhance the realization of larger studies and applicability. Further methodological research is also needed to determine the optimal methods for developing prognostic models. Investigators of future prognostic model development studies should attend to the importance of the internal and external validation of any models with a promising performance.



AICC difference


Adjusted (in the context of this study: for regression to the mean, RTM)


Akaike’s Information Criterion


Akaike’s Information Criterion, small sample variant


Smallest AIC (AICc) value


Confidence interval


Deutsche Gesellschaft für Ultraschall in der Medizin [German Society for Ultrasound in Medicine]


Deutsche Vereinigung für Schulter- und Ellenbogenchirurgie [German Society of Shoulder and Elbow Surgery]


Full-thickness tear (rotator cuff)


Global Perceived Change


Long head of biceps


Minimal Important Difference


Magnetic resonance imaging


Pain Catastrophizing Scale


Patient-reported outcome measure


Partial-thickness tear (rotator cuff)

R2 :

Coefficient of (multiple) determination


Regression to the mean


Standard deviation


Standard error of the estimate


Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (checklist)




Visual analogue scale


Western Ontario Rotator Cuff index


Baseline WORC


Follow-up WORC


Change of WORC score from baseline to follow-up


  1. 1.

    Kooijman M, Swinkels I, van Dijk C, de Bakker D, Veenhof C. Patients with shoulder syndromes in general and physiotherapy practice: an observational study. BMC Musculoskelet Disord. 2013;14:128.

    Article  PubMed  PubMed Central  Google Scholar 

  2. 2.

    Östör AJK, Richards CA, Prevost AT, Speed CA, Hazleman BL. Diagnosis and relation to general health of shoulder disorders presenting to primary care. Rheumatology. 2005;44:800–5.

    Article  PubMed  Google Scholar 

  3. 3.

    van der Windt DA, Koes BW, de Jong BA, Bouter LM. Shoulder disorders in general practice: incidence, patient characteristics, and management. Ann Rheum Dis. 1995;54:959–64.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  4. 4.

    Cook JL, Purdam CR. Is tendon pathology a continuum? A pathology model to explain the clinical presentation of load-induced tendinopathy. Br J Sports Med. 2009;43:409–16.

    Article  PubMed  CAS  Google Scholar 

  5. 5.

    Reilly P, Macleod I, Macfarlane R, Windley J, Emery RJH. Dead men and radiologists don’t lie: a review of cadaveric and radiological studies of rotator cuff tear prevalence. Ann R Coll Surg Engl. 2006;88:116–21.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  6. 6.

    Yamaguchi K, Ditsios K, Middleton WD, Hildebolt CF, Galatz LM. The demographic and morphological features of rotator cuff disease: a comparison of asymptomatic and symptomatic shoulders. J Bone Jt Surgery, Am Vol. 2006;88A:1699–704.

    Article  Google Scholar 

  7. 7.

    Matava MJ, Purcell DB, Rudzki JR. Partial-thickness rotator cuff tears. Am J Sports Med. 2005;33:1405–17.

    Article  PubMed  Google Scholar 

  8. 8.

    Beaudreuil J, Bardin T, Orcel P, Goutallier D. Natural history or outcome with conservative treatment of degenerative rotator cuff tears. Joint Bone Spine. 2007;74:527–9.

    Article  PubMed  Google Scholar 

  9. 9.

    Hedtmann A. Weichteilerkrankungen der Schulter – Subakromialsyndrome. Orthopädie und Unfallchirurgie up2date. 2009;4:85–106.

    Article  Google Scholar 

  10. 10.

    Finnan RP, L a C. Partial-thickness rotator cuff tears. J Shoulder Elb Surg. 2010;19:609–16.

    Article  Google Scholar 

  11. 11.

    Lenza M, Buchbinder R, Takwoingi Y, Johnston RV, Hanchard NC, Faloppa F. Magnetic resonance imaging, magnetic resonance arthrography and ultrasonography for assessing rotator cuff tears in people with shoulder pain for whom surgery is being considered. Cochrane Database Syst Rev. 2013;9:CD009020.

    Google Scholar 

  12. 12.

    Tashjian RZ. AAOS clinical practice guideline: optimizing the management of rotator cuff problems. J Am Acad Orthop Surg. 2011;19:380–3.

    Article  PubMed  Google Scholar 

  13. 13.

    Beaudreuil J, Dhénain M, Coudane H, Mlika-Cabanne N. Clinical practice guidelines for the surgical management of rotator cuff tears in adults. Orthop Traumatol Surg Res. 2010;96:175–9.

    Article  PubMed  CAS  Google Scholar 

  14. 14.

    Ryösä A, Laimi K, Äärimaa V, Lehtimäki K, Kukkonen J, Saltychev M. Surgery or conservative treatment for rotator cuff tear: a meta-analysis. Disabil Rehabil. 2017;39(14):1357-63.

  15. 15.

    Colvin AC, Egorova N, Harrison AK, Moskowitz A, Flatow EL. National trends in rotator cuff repair. J Bone Joint Surg Am. 2012;94:227–33.

    Article  PubMed  PubMed Central  Google Scholar 

  16. 16.

    Paloneva J, Lepola V, Äärimaa V, Joukainen A, Ylinen J, Mattila VM. Increasing incidence of rotator cuff repairs--a nationwide registry study in Finland. BMC Musculoskelet Disord. 2015;16:189.

    Article  PubMed  PubMed Central  Google Scholar 

  17. 17.

    Svendsen SW, Frost P, Jensen LD. Time trends in surgery for non-traumatic shoulder disorders and postoperative risk of permanent work disability: a nationwide cohort study. Scand J Rheumatol. 2012;41:59–65.

    Article  PubMed  CAS  Google Scholar 

  18. 18.

    Ylinen J, Vuorenmaa M, Paloneva J, Kiviranta I, Kautiainen H, Oikari M, et al. Exercise therapy is evidence-based treatment of shoulder impingement syndrome. Current practice or recommendation only. Eur J Phys Rehabil Med. 2013;49:499–505.

    PubMed  CAS  Google Scholar 

  19. 19.

    Butler M, Forte M, Braman J, Swiontkowski M, Kane RL. Nonoperative and Operative treatments for rotator cuff tears: future research needs: identification of future research needs from comparative effectiveness review no. 22. Rockville: Agency for Healthcare Research and Quality (US). Report No.: 13-EHC050-EF. AHRQ Future Research Needs Papers. 2013.

  20. 20.

    Croft P, Altman DG, Deeks JJ, Dunn KM, Hay AD, Hemingway H, et al. The science of clinical practice: disease diagnosis or patient prognosis? Evidence about “what is likely to happen” should shape clinical practice. BMC Med. 2015;13:20.

    Article  PubMed  PubMed Central  Google Scholar 

  21. 21.

    Cochrane Prognosis Methods Group. 2018. Accessed 21 Apr 2018.

  22. 22.

    Progress. mrc prognosis research strategy partnership. 2018. Accessed 21 Apr 2018.

  23. 23.

    Riley RD, Hayden JA, Steyerberg EW, Moons KGM, Abrams K, Kyzas PA, et al. Prognosis research strategy (PROGRESS) 2: prognostic factor research. PLoS Med. 2013;10:e1001380.

    Article  PubMed  PubMed Central  Google Scholar 

  24. 24.

    Hemingway H, Croft P, Perel P, Hayden JA, Abrams K, Timmis A, et al. Prognosis research strategy (PROGRESS) 1: a framework for researching clinical outcomes. BMJ. 2013;346:e5595.

    Article  PubMed  PubMed Central  Google Scholar 

  25. 25.

    Steyerberg EW, Moons KGM, van der Windt DA, Hayden JA, Perel P, Schroter S, et al. Prognosis research strategy (PROGRESS) 3: prognostic model research. PLoS Med. 2013;10:e1001381.

    Article  PubMed  PubMed Central  Google Scholar 

  26. 26.

    Moons KGM, Royston P, Vergouwe Y, Grobbee DE, Altman DG. Prognosis and prognostic research: what, why, and how? BMJ. 2009;338:b375.

    Article  PubMed  Google Scholar 

  27. 27.

    Royston P, Moons KGM, Altman DG, Vergouwe Y. Prognosis and prognostic research: developing a prognostic model. BMJ. 2009;338:b604.

    Article  PubMed  Google Scholar 

  28. 28.

    Braun C, Hanchard NC, Batterham AM, Handoll HH, Betthäuser A. Prognostic models in adults undergoing physical therapy for rotator cuff disorders: systematic review. Phys Ther. 2016;96:961–71.

    Article  PubMed  Google Scholar 

  29. 29.

    Braun C. Predicting the outcome of physiotherapy in poeple with painful partial-thickness rotator cuff tears. A thesis submitted in partial fulfillment of the requirements of Teesside University for the award of the degree of Doctor of Philosophy (PhD) (2016). Accessed 21 Apr 2018.

  30. 30.

    Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ. 2015;350:g7594.

    Article  PubMed  Google Scholar 

  31. 31.

    Roy J-S, Braën C, Leblond J, Desmeules F, Dionne CE, MacDermid JC, et al. Diagnostic accuracy of ultrasonography, MRI and MR arthrography in the characterisation of rotator cuff disorders: a systematic review and meta-analysis. Br J Sports Med. 2015;49:1316–28.

    Article  PubMed  PubMed Central  Google Scholar 

  32. 32.

    Khan KM, Cook JL, Maffulli N, Kannus P. Where is the pain coming from in tendinopathy? It may be biochemical, not only structural, in origin. Br J Sports Med. 2000;34:81–3.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  33. 33.

    DVSE. Untersuchungstechniken des Schultergelenks. Expertenevaluation auf der Basis einer Literaturanalyse. Obere Extermität. 2012;7(Suppl 1):3–68.

    Google Scholar 

  34. 34.

    Konermann W, Gruber G. Ultraschalldiagnostik Der Bewegungsorgane. In: Kursbuch nach den Richtlinien der DEGUM und der DGOU. 2nd ed. Stuttgart: Thieme; 2007.

    Google Scholar 

  35. 35.

    Hedtmann A, Fett H. Schultersonographie bei Subakromialsyndromen mit Erkrankungen und Verletzungen der Rotatorenmanschette (Sonography of the shoulder in subacromial syndromes with diseases and injuries of the rotator cuff). Orthopade. 1995;24:498–508.

    PubMed  CAS  Google Scholar 

  36. 36.

    Hedtmann A, Fett A. Sonographie der Rotatorenmanschette (Ultrasonographic diagnosis of the rotator cuff). Orthopade. 2002;31:236–46.

    Article  PubMed  CAS  Google Scholar 

  37. 37.

    Braun C, Hanchard NCA. Manual therapy and exercise for impingementrelated shoulder pain. Phys Ther Rev. 2010;15:62–83.

    Article  Google Scholar 

  38. 38.

    Braun C, Bularczyk M, Heintsch J, Hanchard NCA. Manual therapy and exercises for shoulder impingement revisited. Phys Ther Rev. 2013;18:263–84.

    Article  Google Scholar 

  39. 39.

    Kirkley A, Alvarez C, Griffin S. The development and evaluation of a disease-specific quality-of-life questionnaire for disorders of the rotator cuff: the western Ontario rotator cuff index. Clin J Sport Med. 2003;13:84–92.

    Article  PubMed  Google Scholar 

  40. 40.

    Huber W, Hofstaetter JG, Hanslik-Schnabel B, Posch M, Wurnig C. Translation and psychometric testing of the western Ontario rotator cuff index (WORC) for use in Germany. Z Orthop Ihre Grenzgeb. 2005;143:453–60.

    Article  PubMed  CAS  Google Scholar 

  41. 41.

    Huang H, Grant JA, Miller BS, Mirza FM, Gagnier JJ. A systematic review of the psychometric properties of patient-reported outcome instruments for use in patients with rotator cuff disease. Am J Sports Med. 2015;43:2572–82.

    Article  PubMed  Google Scholar 

  42. 42.

    St-Pierre C, Desmeules F, Dionne CE, Frémont P, MacDermid JC, Roy J-S. Psychometric properties of self-reported questionnaires for the evaluation of symptoms and functional limitations in individuals with rotator cuff disorders: a systematic review. Disabil Rehabil. 2016;38:103–22.

    Article  PubMed  Google Scholar 

  43. 43.

    Linden A. Assessing regression to the mean effects in health care initiatives. BMC Med Res Methodol. 2013;13:119.

    Article  PubMed  PubMed Central  Google Scholar 

  44. 44.

    Vittinghoff E, McCulloch CE. Relaxing the rule of ten events per variable in logistic and cox regression. Am J Epidemiol. 2007;165:710–8.

    Article  PubMed  Google Scholar 

  45. 45.

    Altman DG, Royston P. The cost of dichotomising continuous variables. BMJ. 2006;332(7549):1080.

    Article  PubMed  PubMed Central  Google Scholar 

  46. 46.

    Royston P, Altman DG, Sauerbrei W. Dichotomizing continuous predictors in multiple regression: a bad idea. Stat Med. 2006;25:127–41.

    Article  PubMed  Google Scholar 

  47. 47.

    Anderson DA. Model based inference in the life sciences. In: A primer on evidence. New York: Springer science + business Media; 2008.

    Google Scholar 

  48. 48.

    Burnham KP, Anderson DR, Huyvaert KP. AIC model selection and multimodel inference in behavioral ecology: some background, observations, and comparisons. Behav Ecol Sociobiol. 2011;65:23–35.

    Article  Google Scholar 

  49. 49.

    Burnham KP, Anderson DR. Model selection and multimodel inference. A practical information-theoretic approach. 2nd ed. New York: Springer; 2002.

    Google Scholar 

  50. 50.

    Miles J, Shevlin M. Applying regression & correlation. London: Sage Publications; 2001.

    Google Scholar 

  51. 51.

    Braun C, Handoll HH. Estimating the minimal important difference for the western Ontario rotator cuff index (WORC) in adults with shoulder pain associated with partial-thickness rotator cuff tears. Musculoskelet Sci Pract. 2018;35:30–3.

    Article  PubMed  Google Scholar 

  52. 52.

    Harrell FE, Lee KL, Mark DB. Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat Med. 1996;15:361–87.

    Article  PubMed  Google Scholar 

  53. 53.

    Flom PL, Cassell DL. Statistics and Data Analysis: Why stepwise and similar selection methods are bad , and what you should use (statistics and data analysis): North East SAS Users Group (NESUG) Annual Conference 2007. 2007., Accessed 21 Apr 2018.

  54. 54.

    Harrell FE. Regression modeling strategies. 1st ed. New York: Springer; 2001.

    Book  Google Scholar 

  55. 55.

    Mohamadi A, Chan JJ, Claessen FMAP, Ring D, Chen NC. Corticosteroid injections give small and transient pain relief in rotator cuff Tendinosis: a meta-analysis. Clin Orthop Relat Res. 2017;475:232–43.

    Article  PubMed  Google Scholar 

  56. 56.

    Sullivan MJL, Bishop SR, Pivik J. The pain Catastrophizing scale: development and validation. Psychol Assess. 1995;7:524–32.

    Article  Google Scholar 

  57. 57.

    Meyer K, Sprott H, Mannion AF. Cross-cultural adaptation, reliability, and validity of the German version of the pain Catastrophizing scale. J Psychosom Res. 2008;64:469–78.

    Article  PubMed  Google Scholar 

Download references


This study formed part of the work included in the PhD thesis (conferred December 2016) of CB at Teesside University, Middlesbrough, UK. We are grateful to all patients who participated in the study, and to all collaborating physiotherapists.

Availability of data and material

The datasets generated and/or analysed during the current study that involve patient data are not publicly available because this was not established, nor would it have been accepted, as part of the ethics application. Other datasets and study material that do not require participant consent for sharing are publicly available [29] and also available from the corresponding author on reasonable request.


This research did not receive any grant from funding agencies in the public, commercial or not-for-profit sectors.

Author information




CB designed the study, managed the acquisition of data, analyzed and interpreted the data (with support from a statistician, who had access to the full dataset) and drafted and edited the manuscript. NCH and HHH made substantial contributions to the design of the study, the analysis and interpretation of the data, revised the manuscript for important intellectual content, and contributed to writing and editing. AB made substantial contributions to the design of the study and to the acquisition of data, contributed to the interpretation of the data, and revised the manuscript for important intellectual content. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Cordula Braun.

Ethics declarations

Ethics approval and consent to participate

The study was approved by the Teesside University School of Health and Social Care Research Governance & Ethics Committee and the Ethics Commission of the Hamburg Medical Council (Germany). Written informed consent, a prerequisite for study participation, was obtained from all participants.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Primary study reports and other articles used to identify prognostic factors (DOCX 21 kb)

Additional file 2:

Factors considered for inclusion in the study (DOCX 20 kb)

Additional file 3:

Model coefficient statistics (DOCX 21 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Braun, C., Hanchard, N.C., Handoll, H.H. et al. Predicting the outcome of conservative treatment with physiotherapy in adults with shoulder pain associated with partial-thickness rotator cuff tears – a prognostic model development study. BMC Musculoskelet Disord 19, 329 (2018).

Download citation


  • Shoulder pain
  • Rotator cuff
  • Conservative treatment
  • Physical therapy
  • Prognosis
  • Prognostic model development