- Study protocol
- Open Access
- Open Peer Review
Decompression alone versus decompression with instrumental fusion the NORDSTEN degenerative spondylolisthesis trial (NORDSTEN-DS); study protocol for a randomized controlled trial
BMC Musculoskeletal Disorders volume 20, Article number: 7 (2019)
Fusion in addition to decompression has become the standard treatment for lumbar spinal stenosis with degenerative spondylolisthesis (DS). The evidence for performing fusion among these patients is conflicting and there is a need for further investigation through studies of high quality. The present protocol describes an ongoing study with the primary aim of comparing the outcome between decompression alone and decompression with instrumented fusion. The secondary aim is to investigate whether predictors can be used to choose the best treatment for an individual. The trial, named the NORDSTEN-DS trial, is one of three studies in the Norwegian Degenerative Spinal Stenosis (NORDSTEN) study.
The NORDSTEN-DS trial is a block-randomized, controlled, multicenter, non-inferiority study with two parallel groups. The surgeons at the 15 participating hospitals decide whether a patient is eligible or not according to the inclusion and exclusion criteria. Participating patients are randomized to either a midline preserving decompression or a decompression followed by an instrumental fusion.
Primary endpoint is the percentage of patients with an improvement in Oswestry Disability Index version 2.0 of more than 30% from baseline to 2-year follow-up. Secondary outcome measurements are the Zürich Claudication Questionnaire, Numeric Rating Scale for back and leg pain, Euroqol 5 dimensions questionnaire, Global perceived effect scale, complications and several radiological parameters. Analysis and interpretation of results will also be conducted after 5 and 10 years.
The NORDSTEN/DS trial has the potential to provide Level 1 evidence of whether decompression alone should be advocated as the preferred method or not. Further on the study will investigate whether predictors exist and if they can be used to make the appropriate choice for surgical treatment for this patient group.
ClinicalTrials.gov Identifier: NCT02051374. First Posted: January 31, 2014. Last Update Posted: February 14, 2018.
Lumbar degenerative spondylolisthesis (LDS) is the forward slip of one vertebra over another caused by degeneration and instability of facet joints, and degeneration of ligaments and intervertebral discs . Most patients suffer from symptoms related to a concomitant spinal stenosis, such as back pain, radiating pain to the lower extremities, and, typically, increased pain when walking upright and decreased pain when bending forward [2, 3].
Several meta-analyses and systematic reviews have been published with the purpose of providing guidelines on how to surgically treat patients with degenerative spondylolisthesis. Based largely on a pseudorandomized study from 1991 , they conclude that there is moderate evidence for a tendency towards better outcome when decompression is combined with fusion [3, 5,6,7]. A recently published randomized controlled trial (RCT) has lent support to this evidence . However, several cohort studies [9,10,11] and another recently published RCT , have introduced evidence against additional fusion when operating for LDS.
The primary objective is to detect whether the intervention-related difference in outcome between decompression alone (DA) and decompression with an additional instrumented fusion (DF) 2 years after surgery, is large enough to justify the use of instrumentation. Our hypothesis is that DA is “as good as” DF for the treatment of spinal stenosis with degenerative spondylolisthesis.
Health economic analysis: To compare the cost- utility of the investigated treatments DA and DF .
Predictor analysis: To evaluate whether radiological parameters and patient characteristics in the future can be used by clinicians to choose between DA and DF.
Long-time follow-up studies: The analyses performed at 2-year follow-up will be repeated at 5- and 10-year follow-up.
The proposed trial is a 1:1 block-randomized, controlled, multicenter, non-inferiority trial, with two parallel groups.
The study is one of three trials in the NORDSTEN study, a Norwegian multicenter study on patients with lumbar spinal stenosis .
The SPIRIT checklist  has been used as a template for the present protocol. One exclusion criterion has been detached from the original study protocol (Version 1.0) received January 10, 2014 in Clinicaltrials.gov (Identifier: NCT02051374), see ‘Amendment’.
The report of the trial will be based on an adapted Consolidated Standards of Reporting Trials (CONSORT) checklist for reporting non-inferiority trials .
The surgeons at the 15 participating hospitals (Table 1) are following the inclusion and exclusion criteria to decide whether a patient is eligible or not.
The patients are given verbal and written information about the study and the alternative treatment options. If willing to participate, the patients sign an informed consent form. If a patient does not want to participate in the study, he/she will not be included in the study and will receive normal care and be treated following the hospital’s established procedures. Criteria for inclusion and exclusion are given in Table 2.
All eligible patients are being registered, and the reasons that some are not included are being documented and interpreted. A CONSORT flow chart is illustrated in Fig. 1.
Posterior approach with decompression after microsurgical principles will be performed, and the midline structures will be preserved. The surgeons will either use a microscope or magnifying glasses.
Decompression and instrumental fusion
Posterior approach with decompression will be performed, followed by posterolateral pedicle screw fixation with or without an additional cage. The surgeons will either use a microscope or magnifying glasses.
Both groups will receive perioperative intravenous antibiotic prophylaxis. Postoperative care and mobilization will follow each hospital’s normal practices and routines.
Patient reported outcome measures (PROMs) will be collected preoperatively and at 3 months, 12 months, 2 years, 5 years and 10 years postoperatively. Primary endpoint is at 2-year follow-up. To evaluate the long-term results (5- and 10-year follow-up) we will use the same primary and secondary outcome measurements as at 2-year follow-up. The time schedule for collection of data is shown in Table 3.
The primary outcome is the proportion of responders assessed by the Oswestry Disability Index (ODI) V.2.0 [21, 22]. ODI scores range from 0 to 100, where 100 represent the greatest impairment. Based on former studies [23, 24] and a presently not submitted study from The Norwegian Registry for Spine Surgery (NORSpine), an individual ODI improvement of 30% or more from baseline to follow-up has been chosen as the cut-off for being a responder. Mean scores at follow-up and mean change scores from baseline to follow-up for the ODI scores will be secondary outcomes.
Other secondary outcome measurements are the mean scores at follow-up, the mean changes from baseline to follow-up and the responder rates assessed by the Zürich Claudication Questionnaire  [ZCQ; which ranges from 1 to 4 (worst disability)], and by the Numeric Rating Scale for back and leg pain (NRS; which ranges from 0 to 10 (worst pain imaginable)]. Cut-off values for being a responder for ZCQ are defined by Tully et al. . Based on data from NORSpine, the individual thresholds for being a responder are defined as a 40% reduction in the NRS leg pain and a 33% reduction in NRS back pain.
Additional secondary PROMs are the mean scores on the Euroqol 5-D  (EQ-5D; which ranges from − 0.6 to 1, with higher scores indicating better quality of life) and the scores on the Global Perceived Effect scale  (GPE; a global assessment of patient-rated satisfaction with treatment outcome, with the answers ‘completely recovered’, ‘much improved’, ‘slightly improved’, ‘unchanged’, ‘slightly worse’, ‘much worse’ and ‘worse than ever’). For comparing the failure rate between the groups, the proportion of patients replying ‘much worse’ or ‘worse than ever’ on the GPE scale will be calculated and compared between the groups.
In addition, we will compare the rates of complications and adverse effects (Table 4), the volume of blood loss, the use of blood transfusion perioperatively and postoperatively, the duration of the surgeries from the skin being opened to when it is closed, and the length of hospital stays. Any new surgery in the lumbosacral column from the time of the index operation to follow-up will be recorded and the reoperation rates will be compared. We will distinguish between an operation at the same level as the primary operation and an operation in a new segment.
For descriptive interpretation, and for the predictor analyses, The Hopkins symptom check list (HSCL-25; a self-reported questionnaire for assessment of psychological variables) , data concerning age, gender, education, work, smoking habits, comorbidity, osteoporosis, the American Society of Anesthesiologists (ASA) grade and prior history of spinal surgery will be recorded preoperatively. For radiological evaluations we will assess the grade of spinal stenosis , the foraminal stenosis , the magnitude of the olisthesis , the segmental instability , the orientation of the facet joint , the amount of facet joint fluid , the degree of disc degeneration [34, 35], the disc height in the level of listhesis , the lumbar lordosis  and the pelvic parameters (the sacral slope, the pelvic tilt and the pelvic incidence) . A CT scan will be performed at the 2 year follow up for assessment of fusion for the DF group . The time schedule for radiological examinations is given in Table 3. The radiological evaluations will be performed by at least one spine surgeon and one radiologist.
The sample size calculation for efficacy is based on the hypothesis that the 2-year results for the decompression alone group will be at least as good as those from the fusion group when comparing the proportions of responders in each group. The sample size is computed by using the Blackwelder methodology . Based on data from the Norwegian Spine Register, the proportion of responders for the whole treatment group is expected to be 0.70. Choosing a type 1 error = 0.05, power = 0.80 and non-inferiority limit (δ) = 0.15 gives a sample size of 116. Considering these assumptions and adding 10% for possible dropouts, a total of 128 patients are required in each group.
To ensure a standardized system of enrollment, one or two research coordinators at each hospital manage the practical details regarding registration, collection and further submission of patient data to the central coordinator at the Section for musculoskeletal research (FORMI), Division of neuroscience, at Oslo University Hospital.
The computer generated 1:1 randomization is block-permuted and center-stratified. After the patient has signed the informed consent form, the randomization is performed within the 6 weeks before treatment. The computer generated randomization procedure is concealed and administered by the central coordinator at FORMI, and communicated by phone and by email to the local research coordinator. The coordinator documents the result of the randomization in the patient’s records and assigns the allocated surgical procedure to the surgeon in charge. The randomization process cannot be influenced by the patients, the investigators, the surgeons or any other persons involved in the study.
The treatment given is not blinded for the patients. For analysis and testing of the efficacy variables, the statistician will be blinded for treatment adherence.
The study coordinators are responsible for the collection and administration of data at baseline and at 3-month follow-up. Data from 12-month 2-year, 5-year and 10-year follow-up is collected by the central coordinator at FORMI. All data will be stored at the Faculty of Research support, University of Oslo. The data will be inaccessible to the research group until the first analysis at 2-year follow-up.
The first analyses will be performed 2 years after surgery. Long-term follow-up analyses will be performed at 5 and 10 years after surgery.
For the primary objective, the proportion of patients with a reduction in ODI of 30% or more from baseline to 2-year follow-up (responder-rate) is defined as the primary outcome [23, 40]. The null hypothesis (H0) is that the responder rate in the decompression alone group is inferior the responder rate in the decompression and fusion group with an amount of 0.15. H0 will be tested by forming a 95% confidence interval (CI) for the difference of proportions, and H0 will be rejected if the upper limit of the confidence interval (CI) is less than 0.15.
The alternative hypothesis is that the responder rate in the DA group is non- inferior the responder rate in the DF group (Fig. 2).
We have predefined the non- inferior margin to be 0.15 of 1.0, i.e., a 15 percentage difference in the responder rate . With this margin it will be necessary to treat 7 patients or more with fusion in addition to decompression in order to prevent one responder. (Number needed to treat = 1.0:0.15 = 6.67) .
The statistical analysis will be done according to intention-to-treat principles (ITT). A sensitivity analysis will be conducted where patient’s crossing over from one treatment to another will receive the last score before crossover. To recommend DA, both the ITT and the sensitivity analysis are required to show non-inferiority.
Descriptive statistics, including measures of centrality and variability, will be used to describe the baseline characteristics of the two treatment groups.
The difference in the proportions of responders (the primary outcome) will be estimated with the Newcombe hybrid score CI . Categorical secondary outcomes will be analyzed with Fisher mid-P tests and Newcombe hybrid score intervals. The GPE responses will be analyzed with a proportional odds logistic regression model. We will use linear mixed models to estimate the difference between the treatment groups for the continuous secondary outcomes (all follow-up measurements from inclusion to 2-year follow-up will be included). Because most change from baseline is expected to occur the first three months, the time development in the linear mixed models will be modelled as piecewise linear, with a knot at 3 months. The models will include fixed effects for treatment group, time, and treatment group x time interaction. A random intercept will be used, and – if possible – a random effect for treatment group.
For the primary outcome, the primary analysis will be a complete case analysis. If there are patients with missing data in the primary outcome, sensitivity analyses with different imputation scenarios will be performed. The scenarios include all DA patients (with missing data) are responders and none of the DF patients (with missing data) are responders, and vice versa; all DA and all DF patients are responders; all DA and all DF patients are non-responders. Missing data for the continuous secondary outcomes will be handled by the linear mixed models, which include all patients with a measurement at at least one time point.
Complete case analyses will be performed on the categorical secondary outcomes. A significance level of 5% will be used throughout.
Analyses of secondary objectives
The predictor analysis will be performed by use of a pragmatic model-building approach of Hosmer et.al . This method is advocated when risk factor modelling is of interests and not just prediction . Patients treated with decompression alone and decompression with fusion will be analyzed in separate cohorts. For each cohort the following purposefully selected baseline variables will be tested for their association to the primary outcome variable ‘responder’: 1) Patient age; 2) Gender; 3) Comorbidity (ASA group); 4) Body Mass Index; 5) Smoking; 6) ODI score; 7) NRS back pain score; 8) NRS leg pain score; 9) Hopkins symptom check list (HSCL-25); 10) The magnitude of olisthesis; 11) Segmental instability; 12) Presence of foraminal stenosis; 13) Orientation of the facet joint; 14) Amount of facet joint fluid; 15) Disc degeneration; 16) Disc height in the level of olisthesis; 17) Lumbal lordosis; 18) Pelvic incidence.
From a univariate screening, variables with P < 0.25 will be included in the multivariate analyses. Since age and gender will be of interests for clinicians when searching for the best choice of treatment, these variables will be included throughout the multivariate analysis. In the second step, the iterative process, covariates are removed if they are non- significant predictors at the 0.1 alpha level and not a confounder. Confounding is defined as a change in any remaining covariate more than 15% when removing a covariate from the model. The covariates will be deleted in descending rang according to the p-value. After deleting and refitting, the model will contain only significant covariates and confounders. In the next step, the covariates not selected from the univariate analysis one by one will be tested for their contributions in the presence of variables from the retained model. If significant at alpha level 0.15 they are included for further fitting of the multivariate model. Finally the model is iteratively reduced as before, but only variables additionally added will be excluded. From the final best fitted model for each treatment group, predicted probabilities of being a responder will be estimated for each combination of the covariates. The risk estimates will be used for building matrixes for an individual’s overall risk for being a responder following surgery. Previously, risk matrix models for predicting probability given a set of established predictors has been constructed for other conditions [46, 47].
Cost-utility will be analyzed as the difference in costs between the two treatment groups divided by their difference in Quality adjusted life years (QALYs) gained [48, 49]. QALYs will be estimated by combining EQ-5D index and time, calculating the area under the curve using the trapezoidal method. The results will be presented as an incremental cost-effectiveness ratio (ICER), meaning the cost for each unit of effect (QALY) gained from decompression alone instead of decompression with instrumented fusion. The presentation will be done from a health provider perspective based on data from two-year follow-up.
Clinical monitoring of the trial
The trial is monitored following the Helsinki Declaration, The International Conference on Harmonisation Guideline for Good Clinical Practice (ICH GCP) . An independent monitor affiliated with Møre and Romsdal Health Trust, without influence on the scientific work, will be responsible for the monitoring. Due to the non-regulated ICH GCP guideline for this trial (not including drug intervention) the risk and safety will be safeguarded at the same level as data quality. All informed consent forms will be checked and all registrations of serious events will be monitored. According to the monitoring plan selected variables will be checked. All hospitals will be visited regularly. Adapted versions of the ‘Investigator’s Site File (ISF)’ and the ‘Trial Master File (TMF)’ will be checked for essential documents during the trial. Queries and deviations will be recorded and reported, and the coordinator at the responsible hospital will have two months to send a written report with the required corrections to the monitor.
Interim analysis and stopping rules
Due to ethical considerations in agreement with the Norwegian Committee for Medical and Health Research Ethics Midt, an interim analysis for safety will be performed when 75 patients in each group have completed the 12-month follow- up. If one of the proposed stop criteria is fulfilled the study will be terminated:
The proportion of patients needing reoperation due to any condition in the operated level(s) is statistically significantly higher in one of the groups.
The proportion of responders in the DF group, assessed by the primary outcome measure, is higher than in the DA group by an amount of 0.20.
The interim analysis will be conducted by an independent statistician blinded for treatment adherence. Only data on reoperations and on the primary outcome measure (ODI) will be available to the statistician. The statistician will inform the steering committee, via the central coordinator, whether the study can be continued or not. Further information about the analysis will not be disclosed and will not be available to anyone until the main analysis at 2-year follow-up.
Ethics and dissemination
The protocol has been approved by the Norwegian Committee for Medical and Health Research Ethics Midt (2013/366).
Storage of data is approved by the Norwegian Data Inspectorate. Written informed consent is obtained from the patients. The project is in accordance with the Helsinki Declaration.
None of the principal investigators have any financial or other competing conflicts of interest.
Trial results will be communicated at national and international conferences and published in well-recognized journals.
The rationale, design and method for this prospective randomized clinical multi-center trial on patients with LDS are presented in the current protocol.
We have chosen a non-inferiority design in order to investigate whether clinical outcomes for decompression alone are not worse than decompression with fusion by more than an acceptable amount. Superiority for decompression alone is not considered to be necessary; it would be an additional benefit .
The present study will be the largest powered study comparing decompression alone and decompression with instrumented fusion in a randomized setting. It is designed and powered to provide Level 1 evidence for whether decompression alone can be advocated as the preferred method for surgical treatment of DS or not. We also aim to investigate whether patients can be assigned to the most appropriate surgical method. Finally, results at 5- and 10-year follow-up will provide high level evidence for long-time results for the two methods.
We anticipate enclosing the inclusion by the end of 2017.
Decompression with an additional fusion
EuroQol 5-dimensional questionnaire utility index
Global perceived effect
Hopkins symptom check list
Lumbar spinal stenosis
Magnetic resonance imaging
Number needed to treat
Numeric rating scale
Oswestry disability index
Patient reported outcome measures
Statistical Analysis Plan
Zurich claudication questionnaire –score
Farfan HF (2012) - The pathological anatomy of degenerative spondylolisthesis. A cadaver study. - spine (Phila pa 1976)1980;5(5):412-8:Oct.
Fitzgerald JA, Newman PH. Degenerative spondylolisthesis. J Bone Joint Surg Br. 1976;58:184–92.
Watters WC 3rd, Bono CM, Gilbert TJ, Kreiner DS, Mazanec DJ, Shaffer WO, Baisden J, Easa JE, Fernand R, Ghiselli G, Heggeness MH, Mendel RC, O'Neill C, Reitman CA, Resnick DK, Summers JT, Timmons RB, Toton JF, North American Spine S. An evidence-based clinical guideline for the diagnosis and treatment of degenerative lumbar spondylolisthesis. Spine J. 2009;9:609–14. https://doi.org/10.1016/j.spinee.2009.03.016.
Herkowitz HN, Kurz LT. Degenerative lumbar spondylolisthesis with spinal stenosis. A prospective study comparing decompression with decompression and intertransverse process arthrodesis. J Bone Joint Surg Am. 1991;73:802–8.
Martin CR, Gruszczynski AT, Braunsfurth HA, Fallatah SM, O'Neil J, Wai EK. The surgical management of degenerative lumbar spondylolisthesis: a systematic review. Spine. 2007;32:1791–8. https://doi.org/10.1097/BRS.0b013e3180bc219e.
Resnick DK, Watters WC 3rd, Sharan A, Mummaneni PV, Dailey AT, Wang JC, Choudhri TF, Eck J, Ghogawala Z, Groff MW, Dhall SS, Kaiser MG. Guideline update for the performance of fusion procedures for degenerative disease of the lumbar spine. Part 9: lumbar fusion for stenosis with spondylolisthesis. J Neurosurg Spine. 2014;21:54–61. https://doi.org/10.3171/2014.4.spine14274.
Steiger F, Becker HJ, Standaert CJ, Balague F, Vader JP, Porchet F, Mannion AF. Surgery in lumbar degenerative spondylolisthesis: indications, outcomes and complications. A systematic review. Eur Spine J. 2014;23:945–73. https://doi.org/10.1007/s00586-013-3144-3.
Ghogawala Z, Dziura J, Butler WE, Dai F, Terrin N, Magge SN, Coumans JV, Harrington JF, Amin-Hanjani S, Schwartz JS, Sonntag VK, Barker FG 2nd, Benzel EC. Laminectomy plus fusion versus laminectomy alone for lumbar spondylolisthesis. N Engl J Med. 2016;374:1424–34. https://doi.org/10.1056/NEJMoa1508788.
Chang HS, Fujisawa N, Tsuchiya T, Oya S, Matsui T. Degenerative spondylolisthesis does not affect the outcome of unilateral laminotomy with bilateral decompression in patients with lumbar stenosis. Spine. 2014;39:400–8. https://doi.org/10.1097/BRS.0000000000000161.
Forsth P, Michaelsson K, Sanden B (2013) Does fusion improve the outcome after decompressive surgery for lumbar spinal stenosis?: a two-year follow-up study involving 5390 patients. Bone joint J 95-B:960-965. 95-B/7/960;https://doi.org/10.1302/0301-620X.95B7.30776.
Austevoll IM, Gjestad R, Brox JI, Solberg TK, Storheim K, Rekeland F, Hermansen E, Indrekvam K, Hellum C. The effectiveness of decompression alone compared with additional fusion for lumbar spinal stenosis with degenerative spondylolisthesis: a pragmatic comparative non-inferiority observational study from the Norwegian registry for spine surgery. Eur Spine J. 2016. https://doi.org/10.1007/s00586-016-4683-1.
Forsth P, Olafsson G, Carlsson T, Frost A, Borgstrom F, Fritzell P, Ohagen P, Michaelsson K, Sanden B. A randomized, controlled trial of fusion surgery for lumbar spinal stenosis. N Engl J Med. 2016;374:1413–23. https://doi.org/10.1056/NEJMoa1513721.
Weinstein J, Pearson A. Fusion in degenerative spondylolisthesis becomes controversial … again. Evid Based Med. 2016;21:148–9. https://doi.org/10.1136/ebmed-2016-110474.
Peul WC, Moojen WA. Fusion for lumbar spinal stenosis--safeguard or superfluous surgical implant? N Engl J Med. 2016;374:1478–9. https://doi.org/10.1056/NEJMe1600955.
Pearson AM. Fusion in degenerative spondylolisthesis: how to reconcile conflicting evidence. Journ Spine Surg. 2016;2:143–5.
Joaquim AF, Milano JB, Ghizoni E, Patel AA. Is there a role for decompression alone for treating symptomatic degenerative lumbar spondylolisthesis?: a systematic review. Clin Spine Surg. 2016;29:191–202. https://doi.org/10.1097/bsd.0000000000000357.
Angevine PD, Berven S. Health economic studies: an introduction to cost-benefit, cost-effectiveness, and cost-utility analyses. Spine. 2014;39:S9–15. https://doi.org/10.1097/brs.0000000000000576.
Hermansen E, Austevoll IM, Romild UK, Rekeland F, Solberg T, Storheim K, Grundnes O, Aaen J, Brox JI, Hellum C, Indrekvam K. Study-protocol for a randomized controlled trial comparing clinical and radiological results after three different posterior decompression techniques for lumbar spinal stenosis: the spinal stenosis trial (SST) (part of the NORDSTEN study). BMC Musculoskelet Disord. 2017;18:121. https://doi.org/10.1186/s12891-017-1491-7.
Agha RA, Altman DG, Rosin D. The SPIRIT 2013 statement--defining standard protocol items for trials. Int J Surg. 2015;13:288–91. https://doi.org/10.1016/j.ijsu.2014.12.007.
Piaggio GF, − Elbourne DR FAU - Altman D, − Altman DG FAU - Pocock S, − Pocock SJ FAU - Evans S, Evans SJ (2010) - Reporting of noninferiority and equivalence randomized trials: an extension of the CONSORT statement. - Jama 2006;295(10):1152–60:60.
Fairbank JC, Couper J, Davies JB, O'Brien JP. The Oswestry low back pain disability questionnaire. Physiotherapy. 1980;66:271–3.
Baker DJ, Pynsent PB, J F (1989) The Oswestry disability index revisited: its reliability, repeatability and validity, and a comparison with the St Thomas’s disability index. Roland MO, Jenner JR, eds New approaches to rehabilitation and education Manchester: Manchester University Press:174–186.
Dworkin RH, Turk DC, Wyrwich KW, Beaton D, Cleeland CS, Farrar JT, Haythornthwaite JA, Jensen MP, Kerns RD, Ader DN, Brandenburg N, Burke LB, Cella D, Chandler J, Cowan P, Dimitrova R, Dionne R, Hertz S, Jadad AR, Katz NP, Kehlet H, Kramer LD, Manning DC, McCormick C, McDermott MP, McQuay HJ, Patel S, Porter L, Quessy S, Rappaport BA, Rauschkolb C, Revicki DA, Rothman M, Schmader KE, Stacey BR, Stauffer JW, von Stein T, White RE, Witter J, Zavisic S. Interpreting the clinical importance of treatment outcomes in chronic pain clinical trials: IMMPACT recommendations. J Pain. 2008;9:105–21. https://doi.org/10.1016/j.jpain.2007.09.005.
Ostelo RW, Deyo RA, Stratford P, Waddell G, Croft P, Von Korff M, Bouter LM, de Vet HC. Interpreting change scores for pain and functional status in low back pain: towards international consensus regarding minimal important change. Spine. 2008;33:90–4. https://doi.org/10.1097/BRS.0b013e31815e3a10.
Tuli SK, Yerby SA, Katz JN. Methodological approaches to developing criteria for improvement in lumbar spinal stenosis surgery. Spine. 2006;31:1276–80. https://doi.org/10.1097/01.brs.0000217615.20018.6c.
(1990) EuroQol--a new facility for the measurement of health-related quality of life. Health policy (Amsterdam, Netherlands) 16:199–208.
Kamper SJ, Ostelo RW, Knol DL, Maher CG, de Vet HC, Hancock MJ (2010) Global perceived effect scales provided reliable assessments of health transition in people with musculoskeletal disorders, but ratings are strongly influenced by current status. J Clin Epidemiol 63:760-766. Doi: S0895-4356(09)00304-7.
Derogatis LR FAU, Lipman RS FAU, Rickels KF, Uhlenhuth EH FAU, Covi L, Derogatis LR FAU, Lipman RS FAU, Rickels KF, Uhlenhuth EH FAU, Covi L (2000) - The Hopkins symptom checklist (HSCL). A measure of primary symptom dimensions - the Hopkins symptom checklist (HSCL): a self-report symptom inventory Mod Probl Pharmacopsychiatry 1974;7(0):79–110:110.
Schizas C, Theumann N, Burn A, Tansey R, Wardlaw D, Smith FW, Kulik G. Qualitative grading of severity of lumbar spinal stenosis based on the morphology of the dural sac on magnetic resonance images. Spine. 2010;35:1919–24. https://doi.org/10.1097/BRS.0b013e3181d359bd.
Lee S, Lee JW, Yeom JS, Kim KJ, Kim HJ, Chung SK, Kang HS (2010) A practical MRI grading system for lumbar foraminal stenosis. AJR Am J Roentgenol 194:1095-1098. doi: 194/4/1095;https://doi.org/10.2214/AJR.09.2772 [doi].
Dupuis PR, Yong-Hing K, Cassidy JD, Kirkaldy-Willis WH Radiologic diagnosis of degenerative lumbar spinal instability. Spine. 1985; 10:262–276.
Berlemann UF, Jeszenszky DJ FAU, Buhler DW FAU, Harms J (2005) - Facet joint remodeling in degenerative spondylolisthesis: an investigation of joint orientation and tropism. - Eur Spine J 1998;7(5):376-80:80.
Cho IY, Park SY, Park JH, Suh SW, Lee SH. MRI findings of lumbar spine instability in degenerative spondylolisthesis. J Orthop Surg (Hong Kong). 2017;25:2309499017718907. https://doi.org/10.1177/2309499017718907.
Pfirrmann CW, Metzdorf A, Zanetti M, Hodler J, Boos N. Magnetic resonance classification of lumbar intervertebral disc degeneration. Spine (Phila Pa 1976). 2001;26:1873–8.
Modic MT, Steinberg PM, Ross JS, Masaryk TJ, Carter JR Degenerative disk disease: assessment of changes in vertebral body marrow with MR imaging. Radiology. 1988 166:193–199. https://doi.org/10.1148/radiology.166.1.3336678.
Masharawi Y, Kjaer P, Bendix T, Manniche C, Wedderkopp N, Sorensen JS, Peled N, Jensen TS. The reproducibility of quantitative measurements in lumbar magnetic resonance imaging of children from the general population. Spine. 2008;33:2094–100. https://doi.org/10.1097/BRS.0b013e31817f19f7.
Schwab F, Lafage V, Patel A, Farcy JP. Sagittal plane considerations and the pelvis in the adult patient. Spine. 2009;34:1828–33. https://doi.org/10.1097/BRS.0b013e3181a13c08.
Bridwell KH, Lenke LG, McEnery KW, Baldus C, Blanke K. Anterior fresh frozen structural allografts in the thoracic and lumbar spine. Do they work if combined with posterior fusion and instrumentation in adult patients with kyphosis or anterior column defects? Spine. 1995;20:1410–8.
Blackwelder WC, Chang MA. Sample size graphs for "proving the null hypothesis". Control Clin Trials. 1984;5:97–105.
Katz NP, Paillard FC, Ekman E. Determining the clinical importance of treatment benefits for interventions for painful orthopedic conditions. J Orthop Surgery Res. 2015;10:24. https://doi.org/10.1186/s13018-014-0144-x.
Blumenthal S, McAfee PC, Guyer RD, Hochschuler SH, Geisler FH, Holt RT, Garcia R Jr, Regan JJ, Ohnmeiss DD. A prospective, randomized, multicenter Food and Drug Administration investigational device exemptions study of lumbar total disc replacement with the CHARITE artificial disc versus lumbar fusion: part I: evaluation of clinical outcomes. Spine. 2005;30:1565–75 discussion E1387-1591.
Katz N, Paillard FC, Van Inwegen R. A review of the use of the number needed to treat to evaluate the efficacy of analgesics. J Pain. 2015;16:116–23. https://doi.org/10.1016/j.jpain.2014.08.005.
Fagerland MW, Lydersen S, Laake P (2011) Recommended confidence intervals for two independent binomial proportions. Stat Methods Med Res. Doi: 0962280211415469 ;https://doi.org/10.1177/0962280211415469 [doi].
Hosmer DW, Lemeshow S. Applied logistic regression. 2nd ed. New York: John Wiley & Sons, Inc, Hoboken; 2000.
Bursac Z, Gauss CH, Williams DK, Hosmer DW. Purposeful selection of variables in logistic regression. Source Code Biol Med. 2008;3:17. https://doi.org/10.1186/1751-0473-3-17.
Conroy RM, Pyorala K, Fitzgerald AP, Sans S, Menotti A, De Backer G, De Bacquer D, Ducimetiere P, Jousilahti P, Keil U, Njolstad I, Oganov RG, Thomsen T, Tunstall-Pedoe H, Tverdal A, Wedel H, Whincup P, Wilhelmsen L, Graham IM. Estimation of ten-year risk of fatal cardiovascular disease in Europe: the SCORE project. Eur Heart J. 2003;24:987–1003.
Solberg IC, Hoivik ML, Cvancarova M, Moum B. Risk matrix model for prediction of colectomy in a population-based study of ulcerative colitis patients (the IBSEN study). Scand J Gastroenterol. 2015;50:1456–62. https://doi.org/10.3109/00365521.2015.1064991.
Drummond M (2005) Methods for the economic evaluation of health care Programmes. 3rd ed Oxford, NY : Oxford medical publications, Oxford University press;
Lonne G, Johnsen LG, Aas E, Lydersen S, Andresen H, Ronning R, Nygaard OP. Comparing cost-effectiveness of X-stop with minimally invasive decompression in lumbar spinal stenosis: a randomized controlled trial. Spine. 2015;40:514–20. https://doi.org/10.1097/brs.0000000000000798.
ICH, Harmonised, Tripartite, Guideline (2014) Guideline for good clinical practice E6(R1). 1996. http://www.ich.org/products/guidelines/efficacy/efficacy-single/article/good-clinical-practice.html.
Thanks to Eira Kathleen Ebbs for linguistic assistance in writing the manuscript. Thanks to study coordinators at the participating hospitals and the central coordinators at FORMI, for an extensive contribution in keeping track of the patients in the study.
From the start of inclusion (April 15, 2014) patients with ODI scores of less than 25 were excluded. Due to the experiences of participating surgeons, a considerable number of the patients suffering from leg and back pain, but with ODI of less than 25, were found eligible for operation but not for inclusion in the study. To enhance the external validity of the study, the steering committee decided that from date 29th August 2015, the patients should not be excluded due to ODS- score lower than 25.
Helse Vest RHF (the Western Regional Health Authority) has provided funds for the present study. The funder has no influence on study design, management and interpretation of data or the decision to submit data.
Availability of data and materials
All data will be stored at the Faculty of Research support, University of Oslo. The data will be inaccessible to the research group until the first analysis at 2-year follow-up. Storage of data is approved by the Norwegian Data Inspectorate.
Ethics approval and consent to participate
The patients are given verbal and written information about the study. If willing to participate, the patients sign an informed consent form. If a patient does not want to participate, he/she will receive normal care and be treated following the hospital’s established procedures.
The protocol has been approved by the Norwegian Committee for Medical and Health Research Ethics Midt (2013/366).
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.