Skip to main content
  • Research article
  • Open access
  • Published:

Prognostic factors of a favorable outcome following a supervised exercise program for soldiers with sub-acute and chronic low back pain



Low back pain (LBP) encompasses heterogeneous patients unlikely to respond to a unique treatment. Identifying sub-groups of LBP may help to improve treatment outcomes. This is a hypothesis-setting study designed to create a clinical prediction rule (CPR) that will predict favorable outcomes in soldiers with sub-acute and chronic LBP participating in a multi-station exercise program.


Military members with LBP participated in a supervised program comprising 7 stations each consisting of exercises of increasing difficulty. Demographic, impairment and disability data were collected at baseline. The modified Oswestry Disability Index (ODI) was administered at baseline and following the 6-week program. An improvement of 50% in the initial ODI score was considered the reference standard to determine a favorable outcome. Univariate associations with favorable outcome were tested using chi-square or paired t-tests. Variables that showed between-group (favorable/unfavorable) differences were entered into a logistic regression after determining the sampling adequacy. Finally, continuous variables were dichotomized and the sensitivity, specificity and positive and negative likelihood ratios were determined for the model and for each variable.


A sample of 85 participants was included in analyses. Five variables contributed to prediction of a favorable outcome: no pain in lying down (p = 0.017), no use of antidepressants (p = 0.061), FABQ work score < 22.5 (p = 0.061), fewer than 5 physiotherapy sessions before entering the program (p = 0.144) and less than 6 months’ work restriction (p = 0.161). This model yielded a sensitivity of 0.78, specificity of 0.80, LR+ of 3.88, and LR- of 0.28. A 77.5% probability of favorable outcome can be predicted by the presence of more than three of the five variables, while an 80% probability of unfavorable outcome can be expected if only three or fewer variables are present.


The use of prognostic factors may guide clinicians in identifying soldiers with LBP most likely to have a favorable outcome. Further validation studies are needed to determine if the variables identified in our study are treatment effect modifiers that can predict success following participation in the multi-station exercise program.

Trial registration Identifier: NCT03464877 registered retrospectively on 14 March 2018.

Peer Review reports


Attempts to treat low back pain (LBP) have typically had limited success [1,2,3,4]. These difficulties are believed to be due, in part, to the fact that the cause of the pain is rarely known, leaving approximately 85% of individuals with LBP to be included in a “non-specific LBP” group [4, 5]. However, LBP is a complex condition influenced by a range of psychosocial, physical and activity-related factors that differ from person to person [6,7,8,9], rendering the non-specific LBP group heterogeneous, and unlikely to respond to the same treatment approach. It has thus been suggested that classification of LBP subjects into subgroups, followed by specific treatment of each subgroup according to their needs, is necessary in order to efficiently treat this condition [9]. Different classification systems, in which LBP subjects are assigned according to personal and clinical characteristics, have therefore been created and modified over the past 20 years [10, 11]. However, more conclusions on how to classify LBP sufferers and how to treat each subgroup are needed [12, 13]. Recent literature reviews still examine studies implementing interventions in non-specific, unclassified LBP populations. The most recent reviews evaluating the efficiency of exercise programs in the treatment of chronic LBP, for example, did not discuss LBP subgroups but treated the subjects as a homogenous population [4, 14]. Searle et al.’s review [4] confirmed the previously-held belief that stabilization and strengthening exercise programs decrease chronic LBP, but with only small effects. It remains to be examined whether division of the LBP population into subgroups will allow researchers and clinicians to obtain larger treatment effects when using exercise program interventions [2, 4, 15].

In order to accurately classify LBP patients into subgroups to which a specific intervention is assigned, valid clinical prediction rules (CPR) need to be developed [16, 17]. These rules are sets of data obtained from the history and physical evaluation of each patient, as well as from auto-administered questionnaires, that indicate which subgroup of patients will likely respond to a specific intervention. In 2011, a multi-station full-body supervised exercise program was created for soldiers with LBP and implemented at a military base of the Canadian Armed Forces. Although the program has enjoyed some clinical success, as subjectively reported by a number of soldiers, it is of great importance to maximize the efficacy of the program by determining the LBP subgroup most susceptible to benefit from it.

The development of CPR typically follows a three-step process: derivation, validation and impact analysis [16, 18, 19]. Derivation is the early hypothesis generation step in which prognostic factors are identified from a set of clinical variables that are believed to have predictive value, based on clinical experience or previous research. Kent et al. [20] recommended subdividing the derivation step into two sub-steps: hypothesis-setting, which represents the exploratory phase in which data from cohort studies may generate hypotheses about potential treatment effect modifiers, and hypothesis-testing, which is the confirmation sub-step in which pre-specified (a priori) hypotheses are tested more rigorously on subgrouping effects in samples of people similar to those who participated in the hypothesis-setting study. Subsequently, a validation step is required to identify treatment effect modifiers by testing subgroups/treatment interaction effects in randomized controlled trials involving samples of people with characteristics that are similar to (narrow validation) or different from (broad validation) those in the hypothesis-testing study. Lastly, the impact analysis aims to verify the usefulness of the CPR in improving outcomes and patient satisfaction and in decreasing costs once it is implemented in clinical practice.

According to this development framework, the present study is at the early hypothesis-setting step. Our objective was therefore to identify variables associated with a favorable outcome in soldiers with sub-acute and chronic LBP participating in a multi-station full-body supervised exercise program. The results obtained may permit generation of potential treatment effect modifiers that will eventually have to be validated before being recommended for clinical practice.



Military members consulting at Valcartier Health Centre for LBP were consecutively recruited. To be included, potential participants had to be aged 18 and older and present with an episode of subacute or chronic LBP with or without radiation to the lower limbs. Potential participants were also required to have a minimal score of 17% on the Modified Oswestry Disability Index (ODI) at the initial evaluation (based on the clinically important difference of this test [21]). Patients were excluded in the following circumstances: previous surgery to the spinal column, lumber spine injection in the past two weeks, signs of upper motor neuron lesions (bilateral paresthesia, hyperreflexia or spasticity), serious medical conditions (e.g. tumor, fracture, rheumatoid arthritis, osteoporosis) and unavailability to participate in the 6-week exercise program. Patients admitted to the clinic with acute LBP (e.g. onset of constant and intense pain [> 5/10] < 7 days, severely limited lumbar range of motion [more than 50% in at least 2 directions], obvious lateral shift) were first treated by their physiotherapist and then referred to the project coordinator, once the risk of harm associated with participation in the program was deemed low (e.g. when the indicators of acute LBP were no longer present). Participation in the study was voluntary, and informed consent forms were signed by all subjects. This study was approved by the ethics committee of the CIUSSS de la Capitale-Nationale (Quebec Rehabilitation Institute).

Previous CPR developed for rehabilitation programs in LBP populations [15, 22, 23] included 5 predictors or fewer in their final model. According to the formula suggested by Green [24] for regression analysis (50 + [8 x number of prognostic factors]), and considering the expected 5 variables in the final model, as well as an estimated 20% dropout rate, the ideal sample size was 108.

Study design

All participants took part in the 6-week exercise program, as well as in the two evaluation sessions (pre- and post- exercise program). At the initial evaluation, subjects completed forms and questionnaires on sociodemographics, symptomatology, comorbidities, work restrictions, pain and functional limitations and fear-avoidance beliefs. A physiotherapist measured their lumbar and hip mobility, conducted diagnostic and pain provocation tests and assessed endurance of the trunk muscles. Following the initial evaluation, subjects took part in the 6-week multi-station full-body supervised exercise program (2 to 3 sessions per week). The ODI was completed at the initial and at the final evaluations. The change in ODI score following the program was considered the principal measure reflecting favorable or unfavorable outcome.

Evaluations and variables

The initial evaluation was carried out by four experienced physiotherapists who completed a three-hour training session in order to standardize the evaluation protocol. Selection of the variables included in the clinical examination was based on the results of previous studies that aimed to develop preliminary CPR in patients with LBP [15, 22, 23]. The following clinical variables were collected.

Standardized subjective clinical questionnaire

This questionnaire documented participants’ personal and occupational characteristics, personal medical history, current and past episodes of LBP, a detailed description of their symptoms (low back and lower limb pain mapping), pain behaviour (aggravating or relieving factors such as sitting, bending, standing, supine, walking, lifting) [25] and other characteristics such as work restrictions and number of treatments received before the initial evaluation.

Modified Oswestry disability index

The ODI is a self-administered questionnaire whose purpose is to evaluate the severity of the limitations and restrictions suffered by patients with LBP. It consists of 10 items that assess the interference of LBP with activities of daily living. Each item is scored on a categorical scale from 0 to 5, with higher values representing greater disability. The score out of 50 is multiplied by two and expressed as a percentage. The reliability (ICC = 0.86), construct validity and sensitivity to change of this questionnaire have previously been demonstrated [26]. The minimal detectable change in persons with non-specific LBP is 10 points [21, 27, 28], while the minimal clinically important difference ranges from 10 to 19 [21, 26, 29]. Patients who experienced an improvement of at least 50% on the ODI were categorized as having a favorable outcome [15, 22, 23].

Fear-avoidance beliefs questionnaire (FABQ)

The FABQ is a self-administered questionnaire that consists of 16 questions pertaining to patients’ beliefs regarding the effect of their physical activities and work on their LBP [30]. It comprises a physical activity subscale (5 questions) and a work subscale (11 questions). Each item is scored from 0 to 6, with higher scores indicating higher levels of fear-avoidance behavior. Moderate to high test-retest reliability was found respectively for the Physical Activity subscale (ICC = 0.72 to 0.90) and the Work subscale (ICC = 0.8 to 0.91) [31].

Numeric pain rating scale (NPRS)

This tool was used to document the worst and average LBP experienced by participants in the 48 h preceding the initial evaluation. Pain intensity was rated on a scale from 0 to 10, where 10 is the worst pain imaginable. Its reliability is moderate (ICC = 0.76) [32].

Lumbar and hip mobility

Lumbar range of motion (ROM) was evaluated by measuring the distance between the third fingertip and the floor in trunk forward and lateral flexions (ICC = 0.91–0.99) [33]. The single inclinometer method was used to measure trunk extension (ICC = 0.61–0.95) [34]. Passive hip ROM was measured bilaterally using a goniometer or an inclinometer (ICC = 0.56–0.99) [35] and included internal rotation, external rotation, flexion and extension.

Signs of instability

Aberrant lumbar movement patterns during forward flexion that are believed to be associated with instability were documented. These patterns include a painful arc, Gower’s sign (thigh climbing), brisk movement (instability catch) or an inverted lumbar pelvic rhythm [36].

The response to the following clinical screening, diagnostic or performance tests was also documented: 1) The Straight Leg Raise, used to screen for herniated discs and to verify neural sensitivity (ICC = 0.93–0.97) [37]. The amplitude of the leg raise was measured with an inclinometer placed on the subject’s tibia. 2) The Biering-Sorensen, McGill and lateral plank tests, designed to measure muscle endurance in trunk extension, flexion and lateral flexion (ICC = 0,89–0.98) [33], respectively. 3) The Prone Instability Test, used to identify patients with lumbar instability, and which has been shown to have predictive value for the stabilization group of the Treatment-Based Classification (TBC) [11] (ICC = 0.74–1.00) [36]. 4) Central posterior to anterior pressure techniques were executed to evaluate pain provocation and segmental mobility of the lumbar spine in prone position. Segment mobility was classified as hypomobile, normal or hypermobile by applying a posterior to anterior force on the spinous process of each lumbar vertebra with the hypothenar eminence. Pain was rated as present or absent. The interrater reliability of this technique is good for the identification of the least mobile segment (agreement = 82.8%, kappa = 0.71, 95% CI = 0.48 to 0.94), but poor for determining the most mobile segment (kappa = 0.29, 95% CI = − 0.13 to 0.71) [33]. The reliability of this procedure seems higher to assess pain provocation than segmental mobility [38].


Each participant took part in the 6-week multi-station full-body supervised exercise program, which consisted of two to three sessions per week (45 to 60 min each) and was supervised by a physiotherapist. The exercise program is composed of 7 stations, each consisting of numerous exercises of increasing difficulty. The exercises were grouped together as follows: Hip strengthening and control (Station 1); The squat and its variants (Station 2); Elastic bands and the Bodyblade (Station 3); Abdominal planks and their variants (Station 4); Abdominal strengthening (Station 5); Back extensor strengthening (Station 6); and Lifting techniques (Station 7). Each of the exercises in the program is shown in an Additional file 1. The basic principal applied for all exercises was the maintenance of natural lordosis, regardless of the weight or external forces imposed on the body. Exercise parameters were chosen to increase strength, endurance and neuromuscular control. In accordance with recognized motor learning principles [39], participants were encouraged to complete a large variety of exercises and to focus on the quality, rather than the quantity, of their movements. Progression in the program led to the execution of exercises that simulated functional and occupational tasks (task-oriented approach). The initial difficulty level and choice of exercises were determined by the supervising physiotherapist according to 3 principal criteria: severity of the condition (constant or non-constant pain, disturbed sleep, limitation and restriction level according to the ODI results), the most limited plane of motion (as the prescribed exercises were primarily carried out in the planes of motion that showed limited mobility or aberrant movements) and the quality of exercise execution. The physiotherapist adjusted the difficulty level of the exercises in order to prompt maximal effort without jeopardizing the quality of the movements. Exercises were paused or stopped when fatigue led to the deterioration of the quality of movements. See the summary table of principles on exercise selection and progression in the Additional file 1. Individual interventions such as manual therapy and other physiotherapy treatment modalities were not performed during the course of the study.

Data analysis

Based on their favorable (50% improvement in the ODI score) or unfavorable outcome following participation in the program, subjects were classified into two subgroups. A first screening of the potential prognostic factors of outcome was done using univariate analyses by looking at statistical differences between groups on characteristics, questionnaires and physical examination data at baseline (Tables 1, 2 and 3). Independent t-tests (continuous variables) and chi-square tests (categorical variables) were used. A potential prognostic factor was retained when the p-value was lower or equal to 0.1.

Table 1 Baseline participants’ characteristics
Table 2 Baseline participants’ questionnaire scores
Table 3 Performance in the clinical tests at baseline

Before entering the potential prognostic factors into a multiple logistic regression, the Kaiser-Meyer-Olkin (KMO) index was calculated on the set of variables that included all the potential prognostic factors and the group variable. The KMO index and measures of sampling adequacy (MSA) were used to determine collinearity. When the KMO index is greater or equal to 0.6, the logistic regression may be performed with all the potential prognostic factors. When the MSA are below 0.6, the associated potential prognostic factor must be removed.

The first multiple logistic regression included all valid prognostic factors. Considering that the useful information brought by one prognostic factor may be entirely offered by another, prognostic factors with very high p-values (p > .50) were removed from the model even though they had significant p-values with the univariate tests. The aim was to obtain the most efficient prognostic factors. A second multiple logistic regression was then calculated on a model that included only prognostic factors that had p-values below 0.50. From a clinical perspective, continuous factors are often impractical. Discrete criteria are better suited to clinical use. For this reason, the continuous variables retained by the second model were dichotomized based on thresholds determined by recursive partitioning analysis (SPSS, proc. TREE). Then, a third and final multiple logistic regression was calculated with the dichotomized version of the set of prognostic factors from the second model. The anti-image and inverse of the correlation matrix, drawn from factor analyses with and without the dependent variable, were used to test for collinearity between variables included in the model. Finally, the sensitivity, specificity and the positive and negative likelihood ratios (LR+ and LR-, respectively) were determined for each variable and for the overall regression model. All analyses were conducted with SPSS software (Version 23; IBM SPSS Statistics for Mac. Armonk, NY: IBM Corp) except for the sensitivity, specificity and likelihood ratios (with 95% confidence intervals) which were estimated with the package epiR (Tools for the Analysis of Epidemiological Data, a package available for the 3.3.1 version of the R statistical software).


Of the 104 individuals that were recruited, nineteen (18.3%) did not complete the 6-week program or did not take part in the final evaluation. Therefore, a constant sample of 85 participants (81.7%) was included in all analyses. The reasons given by the individuals for dropping out of the study were: difficulties in coming to the Valcartier Health Centre due to military duties (n = 2), medical reasons (n = 1), new employment outside of the military (n = 2), prolonged absence for a military exercise or mission (n = 3) and personal or unknown reasons (n = 11). The individuals who dropped out had higher scores on the numerical pain rating scale evaluating worst and mean LBP perceived in the last 48 h when compared to the participants included in the final analyses (p < 0.05). No other differences were found. Subjects retained for analyses participated in 14.4 ± 2.6 sessions on average during the 6-week program (a mean of 2.34 sessions/week).

The mean baseline ODI score for the whole sample was 32.1 ± 12.1, considered as a moderate disability level [26]. Forty participants (47%) were categorized as having favorable outcome based on the ODI criterion, and 45 participants (53%) as unfavorable. The mean ODI change was 23.9 ± 11.0 for the group with favorable outcome and 3.4 ± 10.3 for the group with unfavorable outcome.

Tables 1, 2 and 3 present the clinical variables at baseline for the whole sample, as well as for both groups with favorable/unfavorable outcome. Notably, 74 % of the participants had previous episodes of LBP, and the time elapsed since the last episode varied from 1 to 200 months (mean 16.3 ± 32.2 months). Thirty-two percent (n = 27) had referred pain to the lower limbs. Finally, 51.8% of the participants (n = 44) had work restrictions.

The univariate tests identified seven potential prognostic factors with a KMO index of .692. As all MSA were above .600, no potential prognostic factors needed to be removed. The first multiple logistic regression indicated that two prognostic factors (work restrictions because of LBP and worst LBP perceived in the last 48 h) brought no unique information, with p-values respectively of .843 and .813, and were thus removed from the model. The second set of five factors with the group variable had a KMO index of .635. Although two prognostic factors had a MSA value very slightly below .6, all were kept in the model as these MSA were above .6 when the seven prognostic factors were used. [40] Furthermore, when the prognostic factors were dichotomized (see below), these MSA rose respectively to .60 and .62, and all five MSA were above or equal to .6. The model of the second multiple logistic regression had a highly significant (p = .00011) moderate capacity (Nagelkerke R2 = .412) to predict the program outcome (favorable outcome: 86% correct; unfavorable outcome: 70% correct; global: 78% correct). This set of five variables included: (1) “no pain in lying down” [dichotomous], (2) “no use of antidepressants” [dichotomous], (3) “FABQ Work subscale” [continuous], (4) “number of treatments received before the first evaluation” [continuous], and (5) “no work restriction of 6 months or more” [dichotomous]. Individually, only variables 1 and 4 had p-values below .05 (respectively of .017 and .030). Since we were at the hypothesis-setting step of CPR development and since it is possible that beyond our sample these variables contain unique information that can predict the favorable or unfavorable outcomes, we decided to continue with the full set of five variables.

The FABQ Work subscale was dichotomized with the help of a recursive partitioning technique that indicated an optimal cutoff point of 22.5. Then, the potential dichotomized prognostic factor of a favorable outcome was set at a score below 22.5 on the FABQ Work subscale. The recursive partitioning technique did not successfully dichotomize the number of previous treatments. Therefore, this variable was dichotomized based on the threshold that led to the model yielding the best accuracy in predicting outcome. A criterion of 4 previous treatments or fewer was set as a potential dichotomized prognostic factor of favorable outcome.

The results of the third and final multiple logistic regression are shown in Tables 4 and 5. The model has a significant (p = .00004) moderate capacity (Nagelkerke R2 = .370) to predict the program outcome (favorable outcome: 78% correct; unfavorable outcome: 80% correct; global: 79% correct). Furthermore, as the five variables had an odds ratio above 2.0 in this final regression, we decided to retain all variables in the final model. Even though the Nagelkerke R2 may be lower, the outcome classification appears almost identical if not very slightly better. There was no sign of multicollinearity between variables included in the model. As seen in Table 4, only the first variable has a p-value below .05 (p = .017; Odds ratio = 3.7 [1.3–10.6]). It is also noteworthy that two variables have p-values of .061: (a) “no use of anti-depressants” (Odds ratio: 5.2 [0.9–29.4]), and (b) “FABQ Work below 22.5” (Odds ratio: 2.9 [0.9–8.6]).

Table 4 Predictive capacity of individual variables obtained in the multiple logistic regression
Table 5 Predictive capacity according to the number of criteria present

In sum, for a patient or client presenting with all five criteria, the prognostic factors have low sensitivity and high specificity. The sensitivity improves to 0.78 for individuals presenting at least four of the five criteria. However, this increase in sensitivity comes at the expense of a decrease in specificity. Table 6 reports the prediction rate of a favorable outcome according to the number of criteria fulfilled. The satisfaction of four out of five criteria appears to be the best compromise between sensitivity and specificity.

Table 6 Prediction rate of a favorable outcome according to the number of criteria fulfilled


This study identified five variables of a favorable outcome in patients with LBP who participated in a multi-station full-body supervised exercise program. Clinicians may anticipate a favorable outcome when their initial assessment identifies 4 or 5 variables included in our model. On the other hand, a less favorable outcome is expected if 3 or fewer variables are present, since the expected failure rate would be 80%. It is to be noted that these findings were established for a 50% improvement on the ODI and different variables would likely have been obtained had a different ODI threshold or other outcomes been used. Furthermore, the LR+ (3.9) and LR- (0.28) were relatively low [35], suggesting that the predictive capability of the model is limited. In comparison, Hicks et al. [15] found similar LR+ (LR+ 4.0), but higher LR- (LR- 0.18), with their preliminary CPR. Rabin et al. could not, however, validate this preliminary CPR in a subsequent study [13]. In contrast, Stolze et al. [23] reported a LR+ of 10.6 (95% confidence interval [CI]: 3.52, 32.14) in a CPR deviation study created to generate potential treatment effect modifiers of pilates exercises.

Three of the five variables included in our model had already been identified as prognostic factors of favorable outcomes for individuals with LBP in previous studies. We found that a score below 22.5 on the FABQ work subscale predicted a favorable outcome, which is in agreement with previous studies in which fears and beliefs about work were associated with poor recovery in patients with work-related LBP [41]. The FABQ work score was not identified as a predictor in CPRs developed by Hicks et al. [15] and Stolze et al.[23]. However, a score of less than 19 on the FABQ work subscale was considered a treatment effect modifier in the CPR developed by Flynn et al. [22], later validated by Childs et al., [42] whose aim was to identify patients likely to benefit from lumbar manipulation. As in the current study, participants in Flynn and Child’s studies were recruited from care facilities within the military, suggesting that fear-avoidance beliefs about military tasks may be of particular concern in soldiers with LBP. We also found that patients who do not use antidepressant medication were more likely to have a better outcome. This finding suggests that patients requiring antidepressants had less favorable outcomes, an observation which concords with the literature showing that patients with LBP in a depressive state are predisposed to longer recovery time and to the development of chronicity [43, 44]. A third prognostic factor of outcome in our model was “no work restriction of six months or more”. In the military context, a work restriction of fewer than six months should be interpreted as a temporary modification of regular duties for a non-chronic health condition, in contrast to a six-month medical category which is attributed to people for whom slow recovery or permanent health problems are anticipated. This finding is in agreement with previous studies that identified prolonged absence from work and the number of days of reduced activity as predictors of slower recovery in patients with LBP [41, 43]. “No pain when lying down” and “having had fewer than five physiotherapy treatments before the baseline evaluation”, that were both identified as prognostic factors in our model, have, to our knowledge, never been associated with the outcome of LBP. It is to be noted that the cutoff criteria for two of the prognostic factors (FABQ Work subscale and number of previous treatments) are actually tied to our sample and must be confirmed by future studies.

This single-arm study was the preliminary step of a process aiming to develop a CPR that can be used to identify patients most likely to benefit from the proposed exercise program. Three of the five variables identified in our final model are generally recognized as prognostic factors in people with LBP and only one of these has been validated as a treatment effect modifier within a CPR aiming to predict patients likely to benefit from lumbar manipulation. The methodological approach used in our study was deliberately liberal. To minimize the possibility of missing potential prognostic factors, we set the p value for retaining variables at 0.1 and, as suggested in hypothesis-setting studies, [20] we did not perform corrections for multiple comparisons. Investigating a high number of variables (more than 50) in a relatively small sample, as we did in the present study, increases the likelihood of finding significant associations by chance (type 1 error), which can be considered a limitation. However, only one prognostic factor was below the generally accepted level of significance of 0.05 and our final model has only a limited predictive capability. Thus, before continuing to the validation step, our results need to be confirmed in a hypothesis-testing study in which a limited number of a priori hypotheses will be tested and appropriate adjustments for multiple comparisons will be made. Any subsequent validation study should also be conducted with non-military groups (broad validation), as the present results may not be generalizable to the greater population. On the other hand, the well-standardized multivariate protocol and the acceptable dropout rate represent strengths of this study. Finally, the targeted sample size of 90 participants to be included in the statistical analyses was not met, as only 85 of the 104 participants took part in both evaluation sessions.


The present study established five variables to identify patients most likely to have a favorable outcome, regardless of their participation in the exercise program. Careful use of these variables is mandatory for clinical purposes as this study is at the early stage of CPR development. Future validation studies should be carried out with other populations to confirm this CPR and subsequently, to verify whether some of these factors may be considered treatment effect modifiers.



Confidence interval


Clinical prediction rule


Fear-avoidance beliefs questionnaire


Intraclass correlation coefficient




Low back pain


Likelihood ratio


Measure of sampling adequacy


Numeric pain rating scale


Modified Oswestry disability index


Lumbar range of motion


Treatment-based classification


  1. Bogduk N. Management of chronic low back pain. Med J Aust. 2004;180(2):79–83.

    PubMed  Google Scholar 

  2. Geneen LJ, Moore RA, Clarke C, Martin D, Colvin LA, Smith BH. Physical activity and exercise for chronic pain in adults: an overview of Cochrane reviews. Cochrane Database Syst Rev. 2017;4:CD011279.

    PubMed  Google Scholar 

  3. Rubinstein SM, van Middelkoop M, Assendelft WJ, de Boer MR, van Tulder MW. Spinal manipulative therapy for chronic low-back pain: an update of a Cochrane review. Spine (Phila Pa 1976). 2011;36(13):E825–46.

    Article  Google Scholar 

  4. Searle A, Spink M, Ho A, Chuter V. Exercise interventions for the treatment of chronic low back pain: a systematic review and meta-analysis of randomised controlled trials. Clin Rehabil. 2015;29(12):1155–67.

    Article  PubMed  Google Scholar 

  5. Deyo RA, Phillips WR. Low back pain. A primary care challenge. Spine (Phila Pa 1976). 1996;21(24):2826–32.

    Article  CAS  Google Scholar 

  6. Huijnen IP, Rusu AC, Scholich S, Meloto CB, Diatchenko L. Subgrouping of low back pain patients for targeting treatments: evidence from genetic, psychological, and activity-related behavioral approaches. Clin J Pain. 2015;31(2):123–32.

    Article  PubMed  Google Scholar 

  7. Pincus T, Burton AK, Vogel S, Field APA. Systematic review of psychological factors as predictors of chronicity/disability in prospective cohorts of low back pain. Spine (Phila Pa 1976). 2002;27(5):E109–20.

    Article  Google Scholar 

  8. Ramond A, Bouton C, Richard I, Roquelaure Y, Baufreton C, Legrand E, Huez JF. Psychosocial risk factors for chronic low back pain in primary care--a systematic review. Fam Pract. 2011;28(1):12–21.

    Article  PubMed  Google Scholar 

  9. Vassilaki M, Hurwitz EL. Insights in public health: perspectives on pain in the low back and neck: global burden, epidemiology, and management. Hawaii J Med Public Health. 2014;73(4):122–6.

    PubMed  PubMed Central  Google Scholar 

  10. Delitto A, Erhard RE, Bowling RW. A treatment-based classification approach to low back syndrome: identifying and staging patients for conservative treatment. Phys Ther. 1995;75(6):470–85. discussion 485-479

    Article  CAS  PubMed  Google Scholar 

  11. Fritz JM, Cleland JA, Childs JD. Subgrouping patients with low back pain: evolution of a classification approach to physical therapy. J Orthop Sports Phys Ther. 2007a;37(6):290–302.

    Article  PubMed  Google Scholar 

  12. Fritz JM, Lindsay W, Matheson JW, Brennan GP, Hunter SJ, Moffit SD, Swalberg A, Rodriquez B. Is There a subgroup of patients with low back pain likely to benefit from mechanical traction? Results of a randomized clinical trial and subgrouping analysis. Spine (Phila Pa 1976). 2007b;32(26):E793–800.

    Article  Google Scholar 

  13. Rabin A, Shashua A, Pizem K, Dickstein R, Dar G. A clinical prediction rule to identify patients with low back pain who are likely to experience short-term success following lumbar stabilization exercises: a randomized controlled validation study. J Orthop Sports Phys Ther. 2014;44(1):6–B13.

    Article  PubMed  Google Scholar 

  14. Wang XQ, Zheng JJ, Yu ZW, Bi X, Lou SJ, Liu J, Cai B, Hua YH, Wu M, Wei ML, et al. A meta-analysis of core stability exercise versus general exercise for chronic low back pain. PLoS One. 2012;7(12):e52082.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Hicks GE, Fritz JM, Delitto A, McGill SM. Preliminary development of a clinical prediction rule for determining which patients with low back pain will respond to a stabilization exercise program. Arch Phys Med Rehabil. 2005;86(9):1753–62.

    Article  PubMed  Google Scholar 

  16. Childs JD, Cleland JA. Development and application of clinical prediction rules to improve decision making in physical therapist practice. Phys Ther. 2006;86(1):122–31.

    Article  PubMed  Google Scholar 

  17. Laupacis A, Sekar N, Stiell IG. Clinical prediction rules. A review and suggested modifications of methodological standards. JAMA. 1997;277(6):488–94.

    Article  CAS  PubMed  Google Scholar 

  18. Haskins R, Osmotherly PG, Rivett DA. Validation and impact analysis of prognostic clinical prediction rules for low back pain is needed: a systematic review. J Clin Epidemiol. 2015;68(7):821–32.

    Article  PubMed  Google Scholar 

  19. McGinn TG, Guyatt GH, Wyer PC, Naylor CD, Stiell IG, Users RWS. Guides to the medical literature: XXII: how to use articles about clinical decision rules. Evidence-based medicine working group. JAMA. 2000;284(1):79–84.

    Article  CAS  PubMed  Google Scholar 

  20. Kent P, Keating JL, Leboeuf-Yde C. Research methods for subgrouping low back pain. BMC Med Res Methodol. 2010;10:62.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Maughan EF, Lewis JS. Outcome measures in chronic low back pain. Eur Spine J. 2010;19(9):1484–94.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Flynn T, Fritz J, Whitman J, Wainner R, Magel J, Rendeiro D, Butler B, Garber M, Allison SA. Clinical prediction rule for classifying patients with low back pain who demonstrate short-term improvement with spinal manipulation. Spine (Phila Pa 1976). 2002;27(24):2835–43.

    Article  Google Scholar 

  23. Stolze LR, Allison SC, Childs JD. Derivation of a preliminary clinical prediction rule for identifying a subgroup of patients with low back pain likely to benefit from Pilates-based exercise. J Orthop Sports Phys Ther. 2012;42(5):425–36.

    Article  PubMed  Google Scholar 

  24. Green SB. How many subjects does it take to do a regression analysis. Multivariate Behav Res. 1991;26(3):499–510.

    Article  CAS  PubMed  Google Scholar 

  25. Goodman Cavallaro C, Snyder Kelly TE. Differential diagnosis for physical therapists: screening for referral. 5th ed. St-Louis: Elsevier-Saunders; 2013.

    Google Scholar 

  26. Smeets R, Koke A, Lin CW, Ferreira M, Demoulin C. Measures of function in low back pain/disorders: low back pain rating scale (LBPRS), Oswestry disability index (ODI), progressive Isoinertial lifting evaluation (PILE), Quebec back pain disability scale (QBPDS), and Roland-Morris disability questionnaire (RDQ). Arthritis Care Res (Hoboken). 2011;63(Suppl 11):S158–73.

    Article  Google Scholar 

  27. Fritz JM, Irrgang JJ. A comparison of a modified Oswestry low back pain disability questionnaire and the Quebec back pain disability scale. Phys Ther. 2001;81(2):776–88.

    Article  CAS  PubMed  Google Scholar 

  28. Ostelo RW, Deyo RA, Stratford P, Waddell G, Croft P, Von Korff M, Bouter LM, de Vet HC. Interpreting change scores for pain and functional status in low back pain: towards international consensus regarding minimal important change. Spine (Phila Pa 1976). 2008;33(1):90–4.

    Article  Google Scholar 

  29. Davidson M, Keating JL. A comparison of five low back disability questionnaires: reliability and responsiveness. Phys Ther. 2002;82(1):8–24.

    Article  PubMed  Google Scholar 

  30. Waddell G, Newton M, Henderson I, Somerville D, Main CJA. Fear-avoidance beliefs questionnaire (FABQ) and the role of fear-avoidance beliefs in chronic low back pain and disability. Pain. 1993;52(2):157–68.

    Article  CAS  PubMed  Google Scholar 

  31. Williamson E. Fear Avoidance Beliefs Questionnaire (FABQ). Aust J Physiother. 2006;52(2):149.

    Article  PubMed  Google Scholar 

  32. Cleland JA, Childs JD, Whitman JM. Psychometric properties of the neck disability index and numeric pain rating scale in patients with mechanical neck pain. Arch Phys Med Rehabil. 2008;89(1):69–74.

    Article  PubMed  Google Scholar 

  33. Lindell O, Eriksson L, Strender LE. The reliability of a 10-test package for patients with prolonged back and neck pain: could an examiner without formal medical education be used without loss of quality? A methodological study. BMC Musculoskelet Disord. 2007;8:31.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Fritz JM, Brennan GP, Clifford SN, Hunter SJ, Thackeray A. An examination of the reliability of a classification algorithm for subgrouping patients with low back pain. Spine (Phila Pa 1976). 2006;31(1):77–82.

    Article  Google Scholar 

  35. Cleland JA, Koppenhaver S. Netter's orthopaedic clinical examination: an evidence basec approach. 2nd ed. Philadelphia: Saunders-Elsevier; 2011.

    Google Scholar 

  36. Hicks GE, Fritz JM, Delitto A, Mishock J. Interrater reliability of clinical examination measures for identification of lumbar segmental instability. Arch Phys Med Rehabil. 2003;84(12):1858–64.

    Article  PubMed  Google Scholar 

  37. Neto T, Jacobsohn L, Carita AI, Oliveira R. Reliability of the Active-Knee-Extension and Straight-Leg-Raise Tests in Subjects With Flexibility Deficits. J Sport Rehabil. 2015;Technical Notes;17:2014–0220.

    Google Scholar 

  38. Stochkendahl MJ, Christensen HW, Hartvigsen J, Vach W, Haas M, Hestbaek L, Adams A, Bronfort G. Manual examination of the spine: a systematic critical literature review of reproducibility. J Manipulative Physiol Ther. 2006;29(6):475–85. 485 e471–410

    Article  PubMed  Google Scholar 

  39. Shumway-Cook A, Woollacott MH. Motor control: translating research into clinical practice. 4th ed. Philadelphia: Wilkins LW; 2012.

    Google Scholar 

  40. Dziuban CD, Shirkey EC. When is a correlation matrix appropriate for factor analysis? Some decision rules. Psychol Bull. 1974;81(6):358–61.

    Article  Google Scholar 

  41. Poitras S, Rossignol M, Dionne C, Tousignant M, Truchon M, Arsenault B, Allard P, Cote M, Neveu A. An interdisciplinary clinical practice model for the management of low-back pain in primary care: the CLIP project. BMC Musculoskelet Disord. 2008;9:54.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Childs JD, Fritz JM, Flynn TW, Irrgang JJ, Johnson KK, Majkowski GR, Delitto A. A clinical prediction rule to identify patients with low back pain most likely to benefit from spinal manipulation: a validation study. Ann Intern Med. 2004;141(12):920–8.

    Article  PubMed  Google Scholar 

  43. Henschke N, Maher CG, Refshauge KM, Herbert RD, Cumming RG, Bleasel J, York J, Das A, McAuley JH. Prognosis in patients with recent onset low back pain in Australian primary care: inception cohort study. BMJ. 2008;337:a171.

    Article  PubMed  Google Scholar 

  44. Traeger AC, Henschke N, Hubscher M, Williams CM, Kamper SJ, Maher CG, Moseley GL, McAuley JH. Estimating the risk of chronic pain: development and validation of a prognostic model (PICKUP) for patients with acute low back pain. PLoS Med. 2016;13(5):e1002019.

    Article  PubMed  PubMed Central  Google Scholar 

Download references


The authors wish to thank Sophie Bernard, Nathalie Fortier, Pierre-Marc Vézina and Alex Volant, physiotherapists, and Hélène Simard, physiotherapist-assistant, for their contribution to the data collection, as well as Samuel Camiré-Bernier for his technical assistance. We also thank Dany Ducharme for having consented to the publication of photos illustrating the exercises included in our program.


This study was supported by a grant from the Ordre professionnel de la physiothérapie du Québec and the Quebec Rehabilitation Research Network partnership program.

Availability of data and materials

The dataset used and/or analysed during the current study is available from the corresponding author on request.

Author information

Authors and Affiliations



MP - Coordination of the study, study design and writing of the protocol, training of research assistants, data analysis and interpretation, writing of the article, final approval of the version to be published; CG - Study design and writing of the protocol, training of research assistants, data collection and interpretation, critical revision of the article, final approval of the version to be published; PL- Study design and writing of the protocol, training of research assistants, data interpretation, critical revision of the article, final approval of the version to be published; JL - Data analysis and interpretation, writing of the article, final approval of the version to be published; MR - Data collection and interpretation, writing and critical revision of the article, final approval of the version to be published; JSR - Design and writing of the protocol, training of research assistants, data analysis and interpretation, writing of the article, final approval of the version to be published.

Corresponding author

Correspondence to Marc Perron.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the ethics comittee of the CIUSSS de la Capitale-Nationale (Quebec Rehabilitation Institute). Participation in the study was voluntary, and informed consent forms were signed by all subjects.

Consent for publication

Consent for publication was obtained from the person appearing in images included in the additional file.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional file

Additional file 1:

Illustrations of the exercises included in the multi-station program. (DOCX 923 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Perron, M., Gendron, C., Langevin, P. et al. Prognostic factors of a favorable outcome following a supervised exercise program for soldiers with sub-acute and chronic low back pain. BMC Musculoskelet Disord 19, 95 (2018).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: