Participants
In this multi-center randomized clinical trial, consecutive patients with CH presenting to 1 of 8 outpatient physical therapy clinics from a variety of geographical locations (Arizona, Georgia, New York, Ohio, Pennsylvania, South Carolina) were recruited over a 29-month period (from April 2012 to August 2014). For patients to be eligible, they had to present with a diagnosis of CH according to the revised diagnostic criteria [5] developed by the Cervicogenic Headache International Study Group (CHISG) [5, 18, 19]. CH was classified according to the “major criteria” (not including confirmatory evidence by diagnostic anesthetic blockades) and “head pain characteristics” of the CHISG. Therefore, in order to be included in the study, patients had to exhibit all of the following criteria: (1) unilaterality of the head pain without sideshift, starting in the upper posterior neck or occipital region, eventually spreading to the oculofrontotemporal area on the symptomatic side, (2) pain triggered by neck movement and/or sustained awkward positions, (3) reduced range of motion in the cervical spine [20] (i.e., less than or equal to 32 ° of right or left passive rotation on the Flexion-Rotation Test [21–23], (4) pain elicited by external pressure over at least one of the upper cervical joints (C0-3), and (5) moderate to severe, non-throbbing and non-lancinating pain. In addition, participants had to have a headache frequency of at least 1 per week for a minimum of 3 months, a minimum headache intensity pain score of two points (0–10 on the NPRS scale), a minimum disability score of 20 % or greater (i.e., 10 points or greater on the 0–50 NDI scale), and be between 18 and 65 years of age.
Patients were excluded if they exhibited other primary headaches (i.e., migraine, TTH), suffered from bilateral headaches, or exhibited any red flags (i.e., tumor, fracture, metabolic diseases, rheumatoid arthritis, osteoporosis, resting blood pressure greater than 140/90 mmHg, prolonged history of steroid use, etc.), presented with two or more positive neurologic signs consistent with nerve root compression (muscle weakness involving a major muscle group of the upper extremity, diminished upper extremity deep tendon reflex, or diminished or absent sensation to pinprick in any upper extremity dermatome), presented with a diagnosis of cervical spinal stenosis, exhibited bilateral upper extremity symptoms, had evidence of central nervous system involvement (hyperreflexia, sensory disturbances in the hand, intrinsic muscle wasting of the hands, unsteadiness during walking, nystagmus, loss of visual acuity, impaired sensation of the face, altered taste, the presence of pathological reflexes), had a history of whiplash injury within the previous 6 weeks, had prior surgery to the head or neck, had received treatment for head or neck pain from any practitioner within the previous month, had received physical therapy or chiropractic treatment for head or neck pain within the previous 3 months, or had pending legal action regarding their head or neck pain.
The most recent literature suggests that pre-manipulative cervical artery testing is unable to identify those individuals at risk of vascular complications from cervical manipulation [24, 25], and any symptoms detected during pre-manipulative testing may be unrelated to changes in blood flow in the vertebral artery [26, 27]. Hence, pre-manipulative cervical artery testing was not performed in this study; however, screening questions for cervical artery disease had to be negative [24, 28, 29]. This study was approved by the Institutional Review Board at Long Island University, Brooklyn, NY. The study was registered at www.clinicaltrials.gov with trial identifier NCT01580280. All patients were informed that they would receive either manipulation or mobilization and exercise and then provided informed consent before their enrollment in the study.
Treating therapists
Twelve physical therapists (mean age 36.6 years, SD 5.62) participated in the delivery of treatment for patients in this study. They had an average of 10.3 (SD 5.66, range 3–20 years) years of clinical experience, and all had completed a 60 h post-graduate certification program that included practical training in manual techniques including the use of cervical and thoracic manipulation. To ensure all examination, outcome assessments, and treatment procedures were standardized, all participating physical therapists were required to study a manual of standard operating procedures and participate in a 4 h training session with the principal investigator.
Examination procedures
All patients provided demographic information, completed the Neck Pain Medical Screening Questionnaire, and completed a number of self-report measures, followed by a standardized history and physical examination at baseline. Self-report measures included headache intensity as measured by the NPRS (0–10), the NDI (0–50), headache frequency (number of days with headache in the last week), headache duration (total hours of headache in the last week), and medication intake (number of times the patient had taken narcotic or over-the-counter pain medication in the past week).
The standardized physical examination was not limited to, but included measurements of C1-2 (atlanto-axial joint) passive right and left rotation ROM using the Flexion-Rotation Test (FRT). The inter-rater reliability for the FRT has been found to be excellent (ICC: 0.93; 95 % CI: 0.87, 0.96) [30].
Outcome measures
The primary outcome measure used in this study was the patient’s headache intensity as measured by the NPRS. Patients were asked to indicate the average intensity of headache pain over the past week using an 11-point scale ranging from 0 (“no pain”) to 10 (“worst pain imaginable”) at baseline, 1-week, 1-month, and 3-months following the initial treatment session [31]. The NPRS is a reliable and valid instrument to assess pain intensity [32–34]. Although no data exists in patients with CH, the MCID for the NPRS has been shown to be 1.3 in patients with mechanical neck pain [32] and 1.74 in patients with a variety of chronic pain conditions [34]. Therefore, we chose to only include patients with an NPRS score of 2 points (20 %) or greater.
Secondary outcome measures included the NDI, the Global Rating of Change (GRC), headache frequency, headache duration, and medication intake. The NDI is the most widely used instrument for assessing self-rated disability in patients with neck pain [35–37]. The NDI is a self-report questionnaire with 10-items rated from 0 (no disability) to five (complete disability) [38]. The numeric responses for each item are summed for a total score ranging between 0 and 50; however, some evaluators have chosen to multiply the raw score by two, and then report the NDI on a 0–100 % scale [36, 39]. Higher scores represent increased levels of disability. The NDI has been found to possess excellent test-retest reliability, strong construct validity, strong internal consistency and good responsiveness in assessing disability in patients with mechanical neck pain [36], cervical radiculopathy [33, 40], whiplash associated disorder [38, 41, 42], and mixed non-specific neck pain [43, 44]. Although no studies have examined the psychometric properties of the NDI in patients with CH, we chose to only include patients with an NDI score of ten points (20 %) or greater, because this cut-off score captures the MCID for the NDI, which has been reported to approximate four, eight, and nine points (0–50) in patients with mixed non-specific neck pain [44], mechanical neck pain [45], and cervical radiculopathy [33], respectively. Headache frequency was measured as the number of days with headache in the last week, ranging from 0 to 7 days. Headache duration was measured as the total hours of headache in the last week, with six possible ranges: (1) 0–5 h, (2) 6–10 h, (3) 11–15 h, (4) 16–20 h, (5) 21–25 h, or (6) 26 or more hours. Medication intake was measured as the number of times the patient had taken prescription or over-the-counter analgesic or anti-inflammatory medication in the past week for their headaches, with five options: (1) not at all, (2) once a week, (3) once every couple of days, (4) once or twice a day, or (5) three or more times a day.
Patients returned for 1-week, 4-weeks, and 3-months follow-ups where the aforementioned outcome measures were again collected. In addition, at the 1-week, 4-weeks and 3-months follow-ups, patients completed a 15-point GRC question based on a scale described by Jaeschke et al. [46] to rate their own perception of improved function. The scale ranges from -7 (a very great deal worse) to zero (about the same) to +7 (a very great deal better). Intermittent descriptors of worsening or improving are assigned values from -1 to -6 and +1 to +6, respectively. The MCID for the GRC has not been specifically reported but scores of +4 and +5 have typically been indicative of moderate changes in patient status [46]. However, it should be noted that recently Schmitt and Abbott reported that the GRC might not correlate with changes in function in a population with hip and ankle injuries [47]. All outcome measures were collected by an assessor blind to group assignment.
On the initial visit patients completed all outcome measures then received the first treatment session. Patients completed 6–8 treatment sessions of either manipulation or mobilization combined with exercise over 4 weeks. Additionally, subjects were asked if they had experienced any “major” adverse events [48, 49] (stroke or permanent neurological deficits) at each follow-up period.
Randomization
Following the baseline examination, patients were randomly assigned to receive either manipulation or mobilization and exercise. Concealed allocation was performed by using a computer-generated randomized table of numbers created by an individual not involved with recruiting patients prior to the beginning of the study. Individual, sequentially numbered index cards with the random assignment were prepared for each of 8 data collection sites. The index cards were folded and placed in sealed opaque envelopes. Blinded to the baseline examination, the treating therapist opened the envelope and proceeded with treatment according to the group assignment. Patients were instructed not to discuss the particular treatment procedure received with the examining therapist. The examining therapist remained blind to the patient’s treatment group assignment at all times; however, based on the nature of the interventions it was not possible to blind patients or treating therapists.
Manipulation group
Manipulations targeting the right and left C1-2 articulations and bilateral T1-2 articulations were performed on at least one of the 6–8 treatment sessions (Figs. 1 and 2). On other treatment sessions, therapists either repeated the C1-2 and/or T1-2 manipulations or targeted other spinal articulations (i.e., C0-1, C2-3, C3-7, T2-9, ribs 1–9) using manipulation. The selection of the spinal segments to target was left to the discretion of the treating therapist and it was based on the combination of patient reports and manual examination. For both the upper cervical and upper thoracic manipulations, if no popping or cracking sound was heard on the first attempt, the therapist repositioned the patient and performed a second manipulation. A maximum of 2 attempts were performed on each patient similar to other studies [14, 50–53]. The clinicians were instructed that the manipulations are likely to be accompanied by multiple audible popping sounds [54–58]. Patients were encouraged to maintain usual activity within the limits of pain; however, mobilization and the prescription of exercises, or any use of other modalities, were not provided to this group.
The manipulation targeting C1-2 was performed with the patient in supine. For this technique, the patient’s left posterior arch of the atlas was contacted with the lateral aspect of the proximal phalanx of the therapist’s left second finger using a “cradle hold”. To localize the forces to the left C1-2 articulation, the patient was positioned using extension, a posterior-anterior (PA) shift, ipsilateral side-bend and contralateral side-shift. While maintaining this position, the therapist performed a single high-velocity, low-amplitude thrust manipulation to the left atlanto-axial joint using right rotation in an arc toward the underside eye and translation toward the table (Fig. 1). This was repeated using the same procedure but directed to the right C1-2 articulation.
The manipulation targeting T1-2 was performed with the patient in supine. For this technique, the patient held her/his arms and forearms across the chest with the elbows aligned in a superoinferior direction. The therapist contacted the transverse processes of the lower vertebrae of the target motion segment with the thenar eminence and middle phalanx of the third digit. The upper lever was localized to the target motion segment by adding rotation away and side-bend towards the therapist while the underside hand used pronation and radial deviation to achieve rotation toward and side-bend away moments, respectively. The space inferior to the xiphoid process and costochondral margin of the therapist was used as the contact point against the patient’s elbows to deliver a manipulation in an anterior to posterior direction targeting T1-2 bilaterally (Fig. 2).
Mobilization and exercise group
Mobilizations targeting the right and left C1-2 articulations and bilateral T1-2 articulations were performed on at least one of the 6–8 treatment sessions. On other treatment sessions, therapists either repeated the C1-2 and/or T1-2 mobilizations or targeted other spinal articulations (i.e., C0-1, C2/3, C3-7, T2-9, ribs 1–9) using mobilization. The selection of the spinal segments to target was left to the discretion of the treating therapist and it was based on the combination of patient reports and manual examination. However, in order to avoid a “contact” or “attention effect” when compared with the manipulation group, therapists were instructed to mobilize one cervical segment (i.e., right and left) and one thoracic segment or rib articulation on each treatment session.
The mobilization targeting the C1-2 articulation was performed in prone. For this technique, the therapist performed one 30 s bout of left-sided unilateral grade IV PA mobilizations to the C1-2 motion segment as described by Maitland [7]. This same procedure was repeated for one 30 s bout to the right atlanto-axial joint. In addition, and on at least one session, mobilization directed to the upper thoracic (T1-2) spine with the patient prone was performed. For this technique, the therapist performed one 30 s bout of central grade IV PA mobilizations to the T1-2 motion segment as described by Maitland [7]. Therefore, we used 180 (i.e., three 30 s bouts at approximately 2 Hz) end-range oscillations in total on each subject for the mobilization treatment. Notably, there is no high quality evidence to date to suggest that longer durations of mobilization result in greater pain reduction than shorter durations or dosages of mobilization [59, 60].
Cranio-cervical flexion exercises [11, 61–63] were performed with the patient in supine, with the knees bent and the position of the head standardized by placing the craniocervical and cervical spines in a mid-position, such that a line between the subject’s forehead and chin was horizontal, and a horizontal line from the tragus of the ear bisected the neck longitudinally. An air-filled pressure biofeedback unit (Chattanooga Group, Inc., Hixson, TN) was placed suboccipitally behind the patient’s neck and preinflated to a baseline of 20 mmHg [63]. For the staged exercises, patients were required to perform the craniocervical flexion action (“a nod of the head, similar to indicating yes”) [63] and attempt to visually target pressures of 22, 24, 26, 28, and 30 mmHg from a resting baseline of 20 mmHg and to hold the position steady for 10 s [61, 62]. The action of nodding was performed in a gentle and slow manner. A 10 s rest was allowed between trials. If the pressure deviated below the target pressure, the pressure was not held steady, substitution with the superficial flexors (sternocleidomastoid or anterior scalene) occurred, or neck retraction was noticed before the completion of the 10 s isometric hold, it was regarded as a failure [63]. The last successful target pressure was used to determine each patient’s exercise level wherein 3 sets of 10 repetitions with a 10 s isometric hold were performed. In addition to mobilizations and cranio-cervical flexion exercises, patients were required to perform 10 min of progressive resistance exercises (i.e., using Therabands® or free weights) to the muscles of the shoulder girdle during each treatment session, within their own tolerance, and specifically focusing on the lower trapezius and serratus anterior [11].
Sample size
The sample size and power calculations were performed using online software from the MGH Biostatistics Center (Boston, MA). The calculations were based on detecting a 2-point (or 20 %) difference in the NPRS (headache intensity) at the 3 months follow-up, assuming a standard deviation of three points, a 2-tailed test, and an alpha level equal to 0.05. This generated a sample size of 49 patients per group. Allowing for a conservative dropout rate of 10 %, we planned to recruit at least 108 patients into the study. This sample size yielded greater than 90 % power to detect a statistically significant change in the NPRS scores.
Data analysis
Descriptive statistics, including frequency counts for categorical variables and measures of central tendency and dispersion for continuous variables were calculated to summarize the data. The effects of treatment on headache intensity and disability were each examined with a 2-by-4 mixed-model analysis of variance (ANOVA), with treatment group (manipulation versus mobilization and exercise) as the between-subjects variable and time (baseline, 1 week, 4 weeks, and 3 months follow-up) as the within-subjects variable. Separate ANOVAs were performed with the NPRS (headache intensity) and NDI (disability) as the dependent variable. For each ANOVA, the hypothesis of interest was the 2-way interaction (group by time).
An independent t-test was used to determine the between group differences for the percentage change from baseline to 3-month follow-up in both headache intensity and disability. Separate Mann–Whitney U tests were performed with the headache frequency, GRC, headache duration and medication intake as the dependent variable. We performed Little’s Missing Completely at Random (MCAR) test [64] to determine if missing data points associated with dropouts were missing at random or missing for systematic reasons. Intention-to-treat analysis was performed by using Expectation-Maximization whereby missing data are computed using regression equations. Planned pairwise comparisons were performed examining the difference between baseline and follow-up periods between-groups using the Bonferroni correction at an alpha level of .05.
We dichotomized patients as responders at the 3-month follow-up using a cut score of 2 points improvement for headache intensity as measured by the NPRS. Numbers needed to treat (NNT) and 95 % confidence intervals (CI) were also calculated at the 3 months follow-up period using each of these definitions for a successful outcome. Data analysis was performed using SPSS 21.0.