- Research article
- Open Access
- Open Peer Review
Urdu version of the neck disability index: a reliability and validity study
BMC Musculoskeletal Disorders volume 18, Article number: 149 (2017)
Despite the wide use of the neck disability index (NDI) for assessing disability in patients with neck pain, the NDI has not yet been translated and validated in Urdu. The first purpose of the present study was to translate and cross-culturally adapt the NDI into the Urdu language (NDI-U). The second purpose was to investigate the reliability, validity and responsiveness of the NDI-U in Urdu-speaking patients experiencing chronic mechanical neck pain (CMNP).
Translation and cross-cultural adaptation of the original version of the NDI were carried out using previously described procedures. Seventy-six patients with CMNP and thirty healthy participants were recruited for the study. NDI-U and visual analogue scales for pain intensity (VASpain) and disability (VASdisability) were administered to all the participants at baseline and to the patients 3 weeks after receiving physiotherapy intervention. The global rating of change scale (GROC) was also administered at this time. Test-retest reliability and internal consistency were carried out on forty-six randomly selected patients two days after they completed the NDI-U. The NDI-U was evaluated for factor analysis, content validity, construct validity (discriminative and convergent validity) and responsiveness.
An intra-class correlation coefficient (ICC2,1) revealed excellent test-retest reliability for all items (ICC2,1 = 0.86–0.98) and total scores (ICC2,1 = 0.99) of the NDI-U. The NDI-U was found internally consistent with a Cronbach’s alpha of 0.90 and a fair to good correlation between single items and the NDI-U total scores (r = 0.34 to 0.89). Factor analysis of the NDI-U produced two factors explaining 66.71% of the variance. Content validity was good, as no floor or ceiling effects were detected for the NDI-U total score. To determine discriminative validity, an independent t-test revealed a significant difference in the NDI-U total scores between the patients and healthy controls (P < 0.001). For convergent validity, Pearson’s correlation coefficient showed a strong correlation between NDI-U and VASdisability (r = 0.83, P < 0.001) and a moderate correlation between NDI-U and VASpain (r = 0.62, P < 0.001). To measure responsiveness, an independent t-test showed a significant difference in the NDI-U change scores between the stable and the improved groups (P < 0.001). Furthermore, moderate correlations were found between the NDI-U change scores and the GROC (r = 0.50, P < 0.001), VASdisability change scores (r = 0.58, P < 0.001) and VASpain change scores (r = 0.55, P < 0.001).
The results showed that the NDI-U is a reliable, valid and responsive questionnaire to measure disability in Urdu-speaking patients with CMNP.
Neck pain is a major health problem with an annual prevalence ranging from 4.8 to 79.5% in the general population [1, 2]. In 50%–80% of patients with neck pain, the symptoms do not resolve completely . Neck pain may result in disability that significantly affects an individual’s activities and reduces their ability to perform activities of daily living . Therefore, it is essential to use a reliable and valid measurement tool to determine a patient’s perception of disability and to assess treatment outcomes in patients with neck pain .
Self-reported generic and region-specific questionnaires are frequently used to measure disability in patients with neck pain [6, 7]. The neck disability index (NDI) is one of the most commonly used questionnaires to measure neck pain and disability . One study reported that the NDI is a multidimensional construct that measures a broader concept than disability . Nonetheless, the original NDI developed by Vernon and Mior  is a much more reliable and validated measure of neck pain and disability, compared to other questionnaires . The NDI has stable psychometric properties confirmed by different studies [11–15]. The NDI has been translated and validated in several languages [11–29], providing a standard measure to be used in clinical practices and research studies while allowing clinicians and researchers to share knowledge, study results of interventions, and compare results across different populations [6, 16].
Many studies adapt previously recognized and frequently used assessment tools instead of developing a new questionnaire [30, 31]. The reliability and validity of the Urdu version of the neck disability index (NDI-U) has not been studied. The aim of the present study was to translate and culturally adapt the NDI to the Urdu language according to established procedures and to test the psychometric properties of the translated version in Urdu-speaking patients with chronic mechanical neck pain (CMNP).
Translation and cultural adaptation
The translation and cultural adaptation processes were started after obtaining approval from the developer of the original NDI. These processes were performed according to the guidelines previously described  and to the COSMIN (COnsensus-based Standards for the selection of health status Measurement INstruments) criteria . The entire process consisted of five steps.
Two native Urdu-speaking translators who were also fluent in English independently translated the NDI from English into Urdu. One of the translators was an English linguistic teacher, and the second was a physiotherapist. Both translators were instructed to aim for conceptual rather than literal translation. They both provided written reports.
The original translators and one of the authors produced a consensus version by synthesizing the results of both translated versions and discussing disagreements.
The agreed upon Urdu version was translated back into English by two professional translators who were blinded to the original version. Both translators were not aware of the questionnaire concept.
An expert committee including translators, researchers, a healthcare professional and a methodologist developed a pre-final version by reviewing all the translations, the consensus version, and the original questionnaire. The entire procedure was recorded.
The pre-final version of the NDI-U was tested on 30 patients with neck pain to test for face validity. The patients were requested to complete the questionnaire. Afterwards, all the items of the questionnaire were discussed with the patients one by one. We asked patients to describe what they understand about each question and to provide their impressions of the relevance of the items to their situation and their ability to complete the questionnaire on their own. Patients were also encouraged to note any problems with the wording, instructions or layout of the questionnaire. All findings from this phase of the adaptation process were evaluated by the expert committee, and the final NDI-U was then developed following consensus (Additional file 1).
Neck Disability Index (NDI)
The NDI was derived from the Oswestry Disability Index , and it consists of ten items related to pain intensity, headache, concentration and different physical activities (lifting, personal care, recreation, work, driving, reading and sleeping) with six possible responses per item . The score of each item ranges from 0 to 5 . The highest total possible score is 50, and this score is converted to a percentage. Higher scores represent higher levels of disability . The NDI has been shown to be a valid and reliable questionnaire for patients with neck pain [10, 34, 35].
Visual analogue scale for pain (VASpain)
The VASpain consists of a 100 mm horizontal line with the words “no pain” and “worst possible pain” at the line’s ends [36, 37]. Patients were asked to quantify their neck pain by drawing a vertical mark on the area of the horizontal line that best represented their pain level during the preceding 24 h. The VASpain has been shown to be a reliable and valid tool to measure pain intensity [36–39].
Visual analogue scale for disability (VASdisability)
The VASdisability also consists of a 100 mm horizontal line with the descriptors “no restriction (0)” and “worst possible restriction (100)” at the line’s ends. Patients were asked to quantify how much their neck pain restricts their daily activities by drawing a vertical mark on the area of the horizontal line that best represented their degree of restriction. The VASdisability has been shown to have good reliability in patients with chronic musculoskeletal pain .
Global Rating of Change (GROC)
The GROC is a 15 point scale that is used to assess a patient’s self-perception of pain deterioration or improvement over time . Patients were requested to rate the overall condition of their neck from −7 (“a very great deal worse”) to +7 (“a very great deal better”) since the start of treatment. The GROC has been shown to be a validated measure and is widely used as a reference standard to test other instruments [21, 29, 41–43]. Unlike other questionnaires used to assess health status, the GROC scale is simple, quick, easy to use and requires no special training or skills to administer .
Psychometric testing of the NDI-U was performed according to COSMIN guidelines .
Patients with CMNP were recruited from two hospitals located in Rawalpindi and Islamabad, Pakistan, over a period of 12 months. Neck pain was defined as chronic if the duration of the symptoms was more than three months . Both male and female patients between 18 and 65 years of age who were able to read Urdu were included in the study. Patients were excluded if they had any of the following co-morbid diagnoses: inflammatory diseases, current infection, tumours, history of fracture and surgery on the cervical spine, severe cervical myelopathy or radiculopathy, pregnancy or extensive psychiatric disorders. Moreover, 30 healthy volunteers who had no history of pain or neck pathology who were between 19 and 26 years of age were also recruited from the staff and students of the Margalla Institute of Health Sciences Rawalpindi.
The study was approved by the Institutional Review Board of the University of Lahore, Lahore, Pakistan. All the participants provided informed written consent. The screening of the participants was carried out by physiotherapists with more than ten years of clinical experience.
During the first visit, self-report measures for the NDI-U, VASpain and VASdisability were completed by the healthy participants and patients with neck pain. Weight, height and other demographic details were also recorded. After 48 h, 46 randomly selected patients completed the NDI-U again. These patients received 9 sessions (3/week) of physiotherapy treatment with each session lasting for 30 min. These were provided by physiotherapists with clinical experience of more than twelve years. After 3 weeks of physiotherapy, patients again completed the NDI-U, VASpain and VASdisability. Additionally, patients also filled out the GROC scale at this time.
Strategies for missing items on the NDI
One fundamental problem with the NDI is that a few items (especially driving and reading) are frequently omitted by some patients . Different strategies can be used to handle these missing values . Questionnaires with 1–2 missing items were included in the present study. The patient’s total score was divided by 9 or 8 (for 1 or 2 missing items, respectively), and this average score value was used as a score for the missing item . Any questionnaire with more than two unanswered items was not accepted and removed from the study .
Similar to previous studies, all patients were asked to explain why a question was not answered in a space provided at the end of the NDI-U [14, 21]. Furthermore, for all measurements, the same instructions that were printed on the questionnaires were also given verbally to all patients by the research assistant.
All analyses were carried out using IBM SPSS 21 (IBM Corp., Armonk, NY) statistical software. The significance level was set at 0.05. Participants’ characteristics were compared using descriptive statistics.
Reliability is defined as “the extent to which the measurement of a variable is free from measurement error” . In the present study, the reliability of the NDI-U was determined by assessing test-retest reliability across repeated measures, internal consistency and measurement errors . We expected that the test-retest coefficient would be > 0.80, and we set the value of Cronbach’s alpha of the NDI-U to be ≥ 0.70 [10, 12, 14, 16, 19, 25, 29, 35, 46, 47]. A fair to moderate correlation (0.25 ≤ r < 0.75) between single items and the total score was expected [10, 15, 25, 35]. Reliability was tested in 46 randomly selected patients from the total sample who completed the NDI-U. These individuals were re-tested after two days in the same way that they were tested the first time. During this period, patients were not provided with any treatment. The sample size was set based on previously developed methods  using a power calculation to determine the required sample size for a reliable study.
Test-retest reliability was determined using an intra-class correlation coefficient (ICC2,1) and 95% confidence intervals (CIs) [14, 15, 32]. ICC values of ≥ 0.75 are considered to represent studies with excellent reliability [49, 50]. Cronbach’s alpha was calculated to determine the internal consistency of the NDI-U [32, 51]. Alpha values between 0.70 and 0.95 are considered to be acceptable . The strength of the relationship between single items and total scores of the NDI-U was assessed by computing Spearman’s correlation coefficients between each item and the total score minus the score of the item being investigated . Measurement error was determined by calculating the standard error of measurement (SEM) and the smallest detectable change (SDC) . The SEM represents the standard deviation (SD) of repeated measures in the same patient. It was computed using the formula SD × √ (1 – ICC) . The SDC is the smallest change that showed the change observed is real and not due to measurement error. The SDC was calculated as 1.96 × √2 × SEM [52, 53].
Factor analysis is frequently used to determine if items of an instrument form one or more than one dimension [54, 55]. Factor analysis was performed using the principal component factor analysis with varimax rotation. Clusters of items were identified using eigenvalues > 1 . Factor loadings ≥ 0.4 was considered adequate [29, 54]. Keiser-Meyer-Olkin (KMO) test and Bartlett’s test of sphericity were used to determine if correlations were sufficiently large to perform a factor analysis . Given earlier studies showing a one-factor or two-factor structure of the NDI in other translations, an a priori assumption about the underlying factor structure of the NDI-U was not made.
Content validity is the degree to which the content of an instrument has an adequate reflection of the construct being measured . Content validity was assessed by determining the completeness of item responses and the size of floor and ceiling effects [25, 57]. We expected that there would be less than 5% missing items for the cumulative responses of all the patients and that there would be no floor and ceiling effects [11, 14, 16, 25, 29, 46, 47]. Floor and ceiling effects were considered to be present if > 15% of the respondents achieved the lowest or highest possible total score .
Construct validity was assessed by determining the differences in the NDI-U total scores between patients and healthy controls (discriminative validity) with an independent t-test. We predicted that there would be a significant difference in total scores between these two groups . Construct validity was also assessed by measuring the correlation between NDI-U and VASdisability and VASpain (convergent validity) using Pearson’s correlation coefficients . A moderate correlation between NDI-U and VASdisability [11, 25, 46, 47] and a fair to moderate correlation between NDI-U and VASpain [11, 17, 19, 25, 35, 46, 47] were expected. The validity was considered good when at least 75% of the results matched the hypotheses .
Responsiveness is defined as “the ability of an instrument to detect change over time in the construct to be measured” . After three weeks of treatment, patients were divided into an improved group (GROC ≥ 3 (somewhat better)) and a stable group (GROC < 3 to > −3) . The change in GROC scores between −3 and 3 has been described as minimal to no change . Responsiveness was analysed by comparing the NDI-U change scores between these two groups with an independent t-test [29, 60]. We predicted that there would be a significant difference in the NDI-U change scores between the improved and stable groups [29, 60]. We also assessed responsiveness by correlating the NDI-U change scores to the GROC [21, 29] and by correlating the change scores of the NDI-U with the change scores of the VASpain and VASdisability . Pearson’s correlation coefficients were used to quantify these relationships. Moderate correlations were expected between the NDI-U change scores and the GROC, VASdisability and VASpain change scores.
Portney and Watkins  criteria were used to interpret the correlations as follows: r < 0.25 indicates no or little correlation, 0.25 ≤ r < 0.50 indicates fair correlation, 0.50 ≤ r < 0.75 indicates moderate correlation, and 0.75 ≤ r ≤ 1 indicates good correlation.
Translation and cultural adaptation
There were 13 patients who did not know how to drive a car, so they did not respond to item 8, which was related to driving. One patient did not answer item 4 related to reading, stating that he did not want to give an answer based on an assumption, as the item was not related to his life. It was decided not to change these sections since these problems could be overcome by any type of modification.
After thoroughly discussing replacing the word “pain” with “neck pain” for the items related to lifting, personal care and pain intensity and adding the option “never done” for the item related to driving, modifications performed in other translations [21, 29, 46], we decided to avoid such changes so as to be as close to the original version as possible. The patients’ general impression of the NDI-U was that both the instructions and items of the questionnaire were easy to understand and easy to complete. Furthermore, patients stated that all the included items were relevant to their underlying pain condition. Therefore, no major change was made to NDI-U after performing the pre-test.
Ninety-two patients with chronic neck pain were assessed for eligibility. Twelve patients did not meet the inclusion criteria and were excluded from the study (Fig. 1). Four patients declined to participate. The eligible patients included 30 males and 46 females. Two patients dropped out during the treatment and therefore did not complete the NDI-U, VASpain, VASdisability and GROC scale upon the completion of treatment. The data of these patients were not included in the follow-up analysis. The healthy participants were sex-matched to the patients. The demographic and clinical characteristics of the participants are shown in Table 1.
Test-retest reliability and internal consistency
The mean and standard deviation for scores of all the items, the total scores, and the reliability results of the NDI-U are shown in Table 2. The results demonstrated excellent test-retest reliability for all the items (ICC2,1 = 0.86–0.98) and total scores (ICC2,1 = 0.99) of the NDI-U. An excellent internal consistency was demonstrated with Cronbach’s alpha of 0.90. A fair to good correlation was found between single items and total scores of the NDI-U with Spearman’s correlation coefficients of 0.34 to 0.89, confirming that the NDI-U is internally consistent. The SEM and SDC for NDI-U total scores were 0.84 and 2.33, respectively.
The results of a KMO measure of sampling adequacy and Bartlett’s test of sphericity found that the KMO value was satisfactorily high (0.90) and that the Bartlett’s test was significant (P < 0.001). Based on eigenvalues > 1, a two-factor structure was demonstrated. The eigenvalue of the first factor was 5.59, which explained 36.16% of the variance. The second factor had an eigenvalue of 1.08, which explained an additional 30.55% of the variance. The total variance explained by the two factors was 66.71%. A Scree Plot (Fig. 2) also supported the presence of a two-factor structure because the plotted line straightens out after the first two factors. Factor loading for all items is shown in Table 3.
Mean scores of individual items ranged from 1.21 to 2.16 (Table 4). Descriptive statistics showed 27 patients with 1 missing item (item 8) and 5 patients with 2 missing items (item 4 & 8). Missing responses to items represented less than 5% of the total 760 NDI-U items. No floor and ceiling effects were detected for the NDI-U total score, as no patient achieved the lowest or highest possible total scores. However, the items related to personal care, headache, concentration, work, and sleeping had floor effects with 31.5, 30.3, 25, 17.1, and 35.5% of the patients scoring the lowest possible value, respectively. There were no ceiling effects for the individual items.
Results showed a significant difference in the NDI-U total scores between patients and healthy controls (P < 0.001). Subgroup analyses between patients (n = 23) and healthy controls (n = 30) of similar age groups also showed significant differences in the total scores (P < 0.001). A good correlation was found between NDI-U and VASdisability (Pearson’s correlation coefficient = 0.83, P < 0.001), and a moderate correlation was observed between NDI-U and VASpain (Pearson’s correlation coefficient = 0.62, P < 0.001). The results are shown in Table 5.
An independent t-test found a statistically significant difference in the NDI-U change scores between the two groups (9.02 ± 6.78 in the improved group, n = 49; 2.67 ± 4.26 in the stable group, n = 25; P < 0.001). A moderate correlation was found between the NDI-U change scores and GROC values (Pearson’s correlation coefficient = 0.50, P < 0.001). A moderate correlation was also found between NDI-U and VASdisability change scores (Pearson’s correlation coefficient = 0.58, P < 0.001) and between NDI-U and VASpain change scores (Pearson’s correlation coefficient = 0.55, P < 0.001).
As far as we know, this is the first study that translated and cross culturally adapted the NDI into Urdu and tested the reliability, validity and responsiveness of the NDI-U. The psychometric properties of the NDI-U were tested using pre-defined hypotheses. The results indicated that NDI-U has good reliability, validity and responsiveness.
Studying the adaptation process showed that the NDI-U was successfully developed according to established guidelines. The difficulties encountered during the adaptation process were handled by consensus decisions and the use of careful wording. The NDI-U was found to be simple and easy to use in clinical settings.
In the present study, there were more females (60.5%) than males (39.5%). This is comparable to earlier studies that have also recruited more females (52–78%) [14–16, 20, 21, 25, 35, 62–65] but in contrast to the Arabic version of the NDI that included more males (69.2%) than females (30.8%) . In current study, the patients had mean age of 43 years, which is comparable to previous studies (35–47 years) [14, 15, 20, 25, 29, 63, 65]. However, in some other studies, the mean age of the patients was higher (50–62 year) [21, 35, 64].
An excellent internal consistency was demonstrated by a Cronbach’s alpha value of 0.90, which is well in the range of the findings of earlier studies (0.74–0.96) [10, 12, 14, 19, 21–23, 25, 28, 29, 35, 60, 64]. The variations in the correlations between single items and total scores (0.34 to 0.89) in the present study were comparable to the results of other studies (0.40 to 0.84) [10, 25, 35]. The present study found excellent test-retest reliability, comparable to the original study and other translations [10–12, 14, 17, 19, 21–23, 28, 29, 35, 60]. However, the test-retest reliability is higher compared to the German (0.81), Dutch (0.84), Italian (0.84) and Thai (0.85) versions of the NDI [26, 27, 64, 65]. Cleland et al.  found a very low ICC (0.50). Similarly, Cook et al.  found an ICC value of 0.48 upon retest. In another study conducted by Vos et al. , a very low ICC (0.53) was measured in the personal care item. These variations in the test-retest results may be due to the use of different intervals to determine test-retest reliability. To avoid major changes in the patients’ conditions, an interval of 2–3 days was recommended by Dawson et al. . On the other hand, Deyo et al.  and Terwee et al.  recommended using a 1–2 week gap between testing and retesting to avoid memory effects. In the present study, a two-day interval was used to ensure that minimal changes in the patients’ conditions took place; the results obtained were similar to those of other studies that also used short test-retest intervals [11, 19, 46, 62, 69].
Based on the results of the current study, a change of at least 3 points on the NDI-U (0–50 scale) is required to label the change as a “real change”. This result is well within the range of findings observed in other studies (2–8 points on a 0–50 scale) [14, 21, 27, 65]. Young et al.  reported a SDC score of 13.4 points, but this study was performed on patients with cervical radiculopathy. In a systematic review performed by MacDermid et al. , the SDC was reported to be approximately 5 points (0–50 scale) for uncomplicated neck pain and approximately 10 points (0–50 scale) for cervical radiculopathy.
Many studies have analysed the factor analysis of the NDI and other translations. Some studies found a one-factor structure [12, 15, 21–23, 27, 34, 64, 72], and others found a two-factor structure [11, 14, 20, 26, 29, 60]. A two-factor structure was found in the present study, explaining 66.71% of the variance. This result is comparable to what was observed with the Japanese , Arabic , and German  versions, where a two-factor structure explained 61.8%, 67.58%, and 67% of the variances, respectively. However, our results of 66.71% of the variance being explained by a two-factor structure is higher than values of other versions (54–56%) [11, 20, 26]. The structure of the NDI-U is similar to those of other adaptations, with one factor related to “cognitive functioning” (items 2, 5, 6, 9, 10) and the other factor related to “pain and functional disability” (items 1, 3, 4, 7, 8). The association of the pain item with function agrees with results of the German version  but disagrees with results of the Arabic and original versions [20, 29]. Furthermore, the association of the driving item to “functional disability” agrees with the findings of the Catalan version  but disagrees with the Arabic and German versions [14, 29]. Although items 4, 7 and 8 are loaded with both factors, they are loaded more heavily with the factor labelled as “pain and functional disability”. There are some discrepancies in the factor structure of the current study compared with other studies. However, the assessment of factorial structure can be influenced by cultural differences .
The present study had 32 patients (42.10%) who did not complete item 8 (driving). These results are comparable with the Japanese and Greek versions where 38.2% and 44.6% of the patients, respectively, did not answer this item [21, 60]. In contrast, other studies reported less patients (2.2% to 30.76%) who did not answer item 8 [11, 15, 17, 27, 29]. One explanation to these differences may be the reason provided by our patients in that that they do not know how to drive. Thus, we assumed that the patients’ lack of response to this item was not secondary to a problem in translation; as such, we did not feel it was necessary to make any changes to this section.
There were also 5 patients (6.58%) who did not complete item 4 (reading). This result was slightly lower than that reported by Trouli et al. (9.2%) . The patients who missed this item stated that they did not want to answer, as reading was not relevant to their lives.
The present study did not find any floor or ceiling effects for the NDI-U total scores. However, floor effects were observed for individual items (items 2, 5, 6, 7, 9). These results were comparable to those of the Finish (2 items) , Korean (3 items) , and Dutch (2 items) versions  of the NDI that have reported floor effects for individual items.
Criterion validity of the NDI-U was not analysed due to the unavailability of a gold standard for health-related questionnaires . The NDI-U was found to have good construct validity. Indeed, the translated version detected significant differences in the NDI-U total scores between the patients and the healthy controls, consistent with the German version of the NDI . Furthermore, the NDI-U showed positive correlations between total scores and either VASpain or VASdisability, consistent with previous studies [22, 25, 64]. The effect size of the correlation between NDI-U and VASdisability was good (r = 0.83) in the present study but only moderate (r = 0.52) in the Dutch version of the NDI . The correlation between NDI-U and VASpain (r = 0.58) was similar to the findings of the Iranian, Spanish, Turkish and German versions (r = 0.51–0.71) [14, 17, 19, 22] but higher than other versions (r = 0.22–0.43) [25, 64].
Regarding responsiveness, the NDI is considered to be a suitable test to detect changes over time. The NDI is frequently used in patients with neck pain to evaluate the effectiveness of treatment strategies . The present study found significant differences between the stable and improved groups in their NDI-U scores, similar to previous studies [29, 60, 64]. Furthermore, a significant correlation was observed between NDI-U change scores and GROC values, which agrees with the results of the earlier studies [21, 29]. The strength of the correlation was moderate in the present study, poor in the Geek version , and good in the Arabic version . The instrument showed positive moderate correlations between NDI-U change scores and VASpain and VASdisability change scores.
First, a short interval was used to ensure patient conditions remained as stable as possible to determine test-retest reliability. Therefore, memory effects on our results cannot be completely ruled out. Second, our sample mainly included patients with mild to moderate disability from CMNP. Therefore, it may not be appropriate to extrapolate our results to patients with severe or (sub)acute disability or to patients having neck pain secondary to non-mechanical causes. Third, data were mainly collected from patients attending outpatient physiotherapy clinics. Therefore, the sample may not be a true representation of the general population experiencing neck pain. Consequently, the results cannot be generalized to inpatients. Finally, healthy controls were not age-matched to the patients. The authors believe that the generalizability of the results to the general population should not be affected, as the subgroup analysis between patients and healthy controls of a similar age group also found a significant difference in total scores between the two groups.
The strength of this study is that the psychometric properties of the NDI-U were tested using pre-defined hypotheses. Another strength of the study is that, to the best of the authors’ knowledge, it was the first study to measure the responsiveness on the index by determining the correlation of change scores between the NDI-U and the VASdisability.
The NDI-U is a reliable, valid and responsive questionnaire that has a 2-factor structure. It consists of simple words that can be easily understood by the patients. Therefore, the NDI-U can be used to evaluate neck disability in Urdu-speaking patients with CMNP in clinical and research settings.
Chronic mechanical neck pain
COnsensus-based Standards for the selection of health status Measurement INstruments
Global rating of change scale
Intra-class correlation coefficient
Neck disability index
Urdu version of the neck disability index
Smallest detectable change
Standard error of measurement
- VASdisability :
Visual analogue scale for disability
- VASpain :
Visual analogue scale for pain.
Hoy DG, Protani M, De R, Buchbinder R. The epidemiology of neck pain. Best Pract Res Clin Rheumatol. 2010;24(6):783–92.
Cohen SP. Epidemiology, diagnosis, and treatment of neck pain. Mayo Clin Proc. 2015;90(2):284–99.
Carroll LJ, Hogg-Johnson S, van der Velde G, Haldeman S, Holm LW, Carragee EJ, Hurwitz EL, Côté P, Nordin M, Peloso PM. Course and prognostic factors for neck pain in the general population: results of the Bone and Joint Decade 2000–2010 Task Force on Neck Pain and Its Associated Disorders. J Manipulative Physiol Ther. 2009;32(2):S87–96.
Manchikanti L, Singh V, Datta S, Cohen SP, Hirsch JA, American Society of Interventional Pain P. Comprehensive review of epidemiology, scope, and impact of spinal pain. Pain Physician. 2009;12(4):E35–70.
Chapman JR, Norvell DC, Hermsmeyer JT, Bransford RJ, DeVine J, McGirt MJ, Lee MJ. Evaluating common outcomes for measuring treatment success for chronic low back pain. Spine. 2011;36:S54–68.
Pietrobon R, Coeytaux RR, Carey TS, Richardson WJ, DeVellis RF. Standard scales for measurement of functional outcome for cervical pain or dysfunction: a systematic review. Spine (Phila Pa 1976). 2002;27(5):515–22.
Schellingerhout JM, Verhagen AP, Heymans MW, Koes BW, de Vet HC, Terwee CB. Measurement properties of disease-specific questionnaires in patients with neck pain: a systematic review. Qual Life Res. 2012;21(4):659–70.
Vernon H. The Neck Disability Index: state-of-the-art, 1991–2008. J Manipulative Physiol Ther. 2008;31(7):491–502.
van Randeraad-van der Zee CH, Beurskens AJ, Swinkels RA, Pool JJ, Batterham RW, Osborne RH, de Vet HC. The burden of neck pain: its meaning for persons with neck pain and healthcare providers, explored by concept mapping. Qual Life Res. 2016;25(5):1219–25.
Vernon H, Mior S. The Neck Disability Index: a study of reliability and validity. J Manipulative Physiol Ther. 1991;14(7):409–15.
Wlodyka-Demaille S, Poiraudeau S, Catanzariti JF, Rannou F, Fermanian J, Revel M. French translation and validation of 3 functional disability scales for neck pain. Arch Phys Med Rehabil. 2002;83(3):376–82.
Cook C, Richardson JK, Braga L, Menezes A, Soler X, Kume P, Zaninelli M, Socolows F, Pietrobon R. Cross-cultural adaptation and validation of the Brazilian Portuguese version of the Neck Disability Index and Neck Pain and Disability Scale. Spine (Phila Pa 1976). 2006;31(14):1621–7.
Vos CJ, Verhagen AP, Koes BW. Reliability and responsiveness of the Dutch version of the Neck Disability Index in patients with acute neck pain in general practice. Eur Spine J. 2006;15(11):1729–36.
Swanenburg J, Humphreys K, Langenfeld A, Brunner F, Wirth B. Validity and reliability of a German version of the Neck Disability Index (NDI-G). Man Ther. 2014;19(1):52–8.
Misterska E, Jankowski R, Glowacki M. Cross-cultural adaptation of the Neck Disability Index and Copenhagen Neck Functional Disability Scale for patients with neck pain due to degenerative and discopathic disorders. Psychometric properties of the Polish versions. BMC Musculoskelet Disord. 2011;12(1):84.
Lee H, Nicholson LL, Adams RD, Maher CG, Halaki M, Bae SS. Development and psychometric testing of Korean language versions of 4 neck pain and disability questionnaires. Spine (Phila Pa 1976). 2006;31(16):1841–5.
Aslan E, Karaduman A, Yakut Y, Aras B, Simsek IE, Yagly N. The cultural adaptation, reliability and validity of neck disability index in patients with neck pain: a Turkish version study. Spine (Phila Pa 1976). 2008;33(11):E362–5.
Telci EA, Karaduman A, Yakut Y, Aras B, Simsek IE, Yagli N. The cultural adaptation, reliability, and validity of neck disability index in patients with neck pain: a Turkish version study. Spine (Phila Pa 1976). 2009;34(16):1732–5.
Mousavi SJ, Parnianpour M, Montazeri A, Mehdian H, Karimi A, Abedi M, Ashtiani AA, Mobini B, Hadian MR. Translation and validation study of the Iranian versions of the Neck Disability Index and the Neck Pain and Disability Scale. Spine (Phila Pa 1976). 2007;32(26):E825–31.
Nieto R, Miro J, Huguet A. Disability in subacute whiplash patients: usefulness of the neck disability index. Spine (Phila Pa 1976). 2008;33(18):E630–5.
Trouli MN, Vernon HT, Kakavelakis KN, Antonopoulou MD, Paganas AN, Lionis CD. Translation of the Neck Disability Index and validation of the Greek version in a sample of neck pain patients. BMC Musculoskelet Disord. 2008;9:106.
Andrade Ortega JA, Delgado Martinez AD, Almecija Ruiz R. Validation of the Spanish version of the Neck Disability Index. Spine (Phila Pa 1976). 2010;35(4):E114–8.
Salo P, Ylinen J, Kautiainen H, Arkela-Kautiainen M, Hakkinen A. Reliability and validity of the finnish version of the neck disability index and the modified neck pain and disability scale. Spine (Phila Pa 1976). 2010;35(5):552–6.
Wu S, Ma C, Mai M, Li G. Translation and validation study of Chinese versions of the neck disability index and the neck pain and disability scale. Spine (Phila Pa 1976). 2010;35(16):1575–9.
Jorritsma W, de Vries GE, Dijkstra PU, Geertzen JH, Reneman MF. Neck Pain and Disability Scale and Neck Disability Index: validity of Dutch language versions. Eur Spine J. 2012;21(1):93–100.
Monticone M, Ferrante S, Vernon H, Rocca B, Dal Farra F, Foti C. Development of the Italian Version of the Neck Disability Index: cross-cultural adaptation, factor analysis, reliability, validity, and sensitivity to change. Spine (Phila Pa 1976). 2012;37(17):E1038–44.
Uthaikhup S, Paungmali A, Pirunsan U. Validation of Thai versions of the Neck Disability Index and Neck Pain and Disability Scale in patients with neck pain. Spine (Phila Pa 1976). 2011;36(21):E1415–21.
Song K-J, Choi B-W, Choi B-R, Seo G-B. Cross-cultural adaptation and validation of the Korean version of the neck disability index. Spine. 2010;35(20):E1045–9.
Shaheen AAM, Omar MTA, Vernon H. Cross-cultural adaptation, reliability, and validity of the Arabic version of Neck Disability Index in patients with neck pain. Spine. 2013;38(10):E609–15.
Guillemin F, Bombardier C, Beaton D. Cross-cultural adaptation of health-related quality of life measures: literature review and proposed guidelines. J Clin Epidemiol. 1993;46(12):1417–32.
Beaton DE, Bombardier C, Guillemin F, Ferraz MB. Guidelines for the process of cross-cultural adaptation of self-report measures. Spine (Phila Pa 1976). 2000;25(24):3186–91.
Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, Bouter LM, de Vet HC. The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study. Qual Life Res. 2010;19(4):539–49.
Fairbank JC, Couper J, Davies JB, O’Brien JP. The Oswestry low back pain disability questionnaire. Physiotherapy. 1980;66(8):271–3.
Hains F, Waalen J, Mior S. Psychometric properties of the neck disability index. J Manipulative Physiol Ther. 1998;21(2):75–80.
McCarthy MJ, Grevitt MP, Silcocks P, Hobbs G. The reliability of the Vernon and Mior neck disability index, and its validity compared with the short form-36 health survey questionnaire. Eur Spine J. 2007;16(12):2111–7.
Huskisson EC. Measurement of pain. Lancet. 1974;2(7889):1127–31.
Ferreira-Valente MA, Pais-Ribeiro JL, Jensen MP. Validity of four pain intensity rating scales. Pain. 2011;152(10):2399–404.
Bijur PE, Silver W, Gallagher EJ. Reliability of the visual analog scale for measurement of acute pain. Acad Emerg Med. 2001;8(12):1153–7.
Williamson A, Hoggart B. Pain: a review of three commonly used pain rating scales. J Clin Nurs. 2005;14(7):798–804.
Boonstra AM, Schiphorst Preuper HR, Reneman MF, Posthumus JB, Stewart RE. Reliability and validity of the visual analogue scale for disability in patients with chronic musculoskeletal pain. Int J Rehabil Res. 2008;31(2):165–9.
Kamper SJ, Maher CG, Mackay G. Global rating of change scales: a review of strengths and weaknesses and considerations for design. J Man Manip Ther. 2009;17(3):163–70.
Pengel LH, Refshauge KM, Maher CG. Responsiveness of pain, disability, and physical impairment outcomes in patients with low back pain. Spine (Phila Pa 1976). 2004;29(8):879–83.
Rebbeck TJ, Refshauge KM, Maher CG, Stewart M. Evaluation of the core outcome measure in whiplash. Spine (Phila Pa 1976). 2007;32(6):696–702.
Gross A, Miller J, D’Sylva J, Burnie SJ, Goldsmith CH, Graham N, Haines T, Bronfort G, Hoving JL, COG. Manipulation or mobilisation for neck pain: a Cochrane Review. Man Ther. 2010;15(4):315–33.
Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, Bouter LM, de Vet HC. The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J Clin Epidemiol. 2010;63(7):737–45.
Kesiktas N, Ozcan E, Vernon H. Clinimetric properties of the Turkish translation of a modified neck disability index. BMC Musculoskelet Disord. 2012;13:25.
Kose G, Hepguler S, Atamaz F, Oder G. A comparison of four disability scales for Turkish patients with neck pain. J Rehabil Med. 2007;39(5):358–62.
Walter S, Eliasziw M, Donner A. Sample size and optimal designs for reliability studies. Stat Med. 1998;17(1):101–10.
Fleiss JL. The Design and Analysis of Clinical Experiments. New York: Wiley; 1999.
Rosner B. Fundamentals of Biostatistics. 7th ed. Boston: Cengage Learning; 2010.
Schellingerhout JM, Heymans MW, Verhagen AP, de Vet HC, Koes BW, Terwee CB. Measurement properties of translated versions of neck-specific questionnaires: a systematic review. BMC Med Res Methodol. 2011;11(1):87.
Terwee CB, Bot SD, de Boer MR, van der Windt DA, Knol DL, Dekker J, Bouter LM, de Vet HC. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60(1):34–42.
de Vet HC, Terwee CB, Knol DL, Bouter LM. When to use agreement versus reliability measures. J Clin Epidemiol. 2006;59(10):1033–9.
Norman GR, Streiner DL. Biostatistics: The Bare Essentials. Hamilton: B.D. Decker; 1994.
Sim J, Wright C. Research in health care: concepts, designs and methods. Cheltenham: Nelson Thornes; 2000.
Ntoumanis N. A step-by-step guide to SPSS for sport and exercise studies. London: Routledge London; 2001.
Mokkink LB, Terwee CB, Knol DL, Stratford PW, Alonso J, Patrick DL, Bouter LM, de Vet HC. The COSMIN checklist for evaluating the methodological quality of studies on measurement properties: a clarification of its content. BMC Med Res Methodol. 2010;10:22.
Young BA, Walker MJ, Strunce JB, Boyles RE, Whitman JM, Childs JD. Responsiveness of the Neck Disability Index in patients with mechanical neck disorders. Spine J. 2009;9(10):802–8.
Jaeschke R, Singer J, Guyatt GH. Measurement of health status. Ascertaining the minimal clinically important difference. Control Clin Trials. 1989;10(4):407–15.
Nakamaru K, Vernon H, Aizawa J, Koyama T, Nitta O. Crosscultural adaptation, reliability, and validity of the Japanese version of the neck disability index. Spine. 2012;37(21):E1343–7.
Portney LG, Watkins MP. Foundations of clinical research: applications to practice. 2nd ed. New Jersey: Prentice Hall Health, Upper Saddle River; 2000.
Ackelman BH, Lindgren U. Validity and reliability of a modified version of the neck disability index. J Rehabil Med. 2002;34(6):284–7.
Johansen JB, Andelic N, Bakke E, Holter EB, Mengshoel AM, Roe C. Measurement properties of the norwegian version of the neck disability index in chronic neck pain. Spine (Phila Pa 1976). 2013;38(10):851–6.
Cramer H, Lauche R, Langhorst J, Dobos GJ, Michalsen A. Validation of the German version of the Neck Disability Index (NDI). BMC Musculoskelet Disord. 2014;15:91.
Jorritsma W, de Vries GE, Geertzen JH, Dijkstra PU, Reneman MF. Neck Pain and Disability Scale and the Neck Disability Index: reproducibility of the Dutch Language Versions. Eur Spine J. 2010;19(10):1695–701.
Cleland JA, Childs JD, Whitman JM. Psychometric properties of the Neck Disability Index and Numeric Pain Rating Scale in patients with mechanical neck pain. Arch Phys Med Rehabil. 2008;89(1):69–74.
Dawson J, Fitzpatrick R, Carr A. Questionnaire on the perceptions of patients about shoulder surgery. J Bone Joint Surg (Br). 1996;78(4):593–600.
Deyo RA, Diehr P, Patrick DL. Reproducibility and responsiveness of health status measures. Statistics and strategies for evaluation. Control Clin Trials. 1991;12(4 Suppl):142S–58S.
Odole AC, Adegoke BO, Akomas NC. Validity and test re-test reliability of the neck disability index in the Nigerian clinical setting. Afr J Med Med Sci. 2011;40(2):135–8.
Young IA, Cleland JA, Michener LA, Brown C. Reliability, construct validity, and responsiveness of the neck disability index, patient-specific functional scale, and numeric pain rating scale in patients with cervical radiculopathy. Am J Phys Med Rehabil. 2010;89(10):831–9.
MacDermid JC, Walton DM, Avery S, Blanchard A, Etruw E, McAlpine C, Goldsmith CH. Measurement properties of the neck disability index: a systematic review. J Orthop Sports Phys Ther. 2009;39(5):400–17.
Pickering PM, Osmotherly PG, Attia JR, McElduff P. An examination of outcome measures for pain and dysfunction in the cervical spine: a factor analysis. Spine (Phila Pa 1976). 2011;36(7):581–8.
White P, Lewith G, Prescott P, Conway J. Acupuncture versus placebo for the treatment of chronic mechanical neck pain: a randomized, controlled trial. Ann Intern Med. 2004;141(12):911–9.
Availability of data and materials
All data generated or analysed during this study are included in this published article [and its supplementary information files].
MNF, MAMB and SAG: Substantial contribution to conception and design. MNF, MAMB and AH: Acquisition of data. MNF: Analysis and interpretation of data. MNF, MAMB and AH: Drafting of the manuscript. MAMB and SAG: Critical revision of the manuscript for important intellectual content. MNF and AH: Statistical analysis. All authors: Final approval of the manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
The study was approved by the Institutional Review Board of the University of Lahore, Lahore, Pakistan. (IRB No: 1036).
All the participants provided informed written consent.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.