- Research article
- Open Access
- Open Peer Review
Reliability of three landmarking methods for dual inclinometry measurements of lumbar flexion and extension
BMC Musculoskeletal Disordersvolume 16, Article number: 121 (2015)
To examine the intra and inter-rater reliability of lumbar flexion and extension measurements attained using three landmarking methods for dual inclinometry.
This was a repeated measures reliability study. Convenience sampling was used to obtain forty volunteer subjects. Two assessors measured a series of lumbar flexion and extension movements using the J-Tech™ dual inclinometer. Three different landmarking methods were used: 1) straight palpation of PSIS and L1, 2) palpation of PSIS and the site of the nearest 5 cm interval point closest to L1 and 3) location of PSIS and 15 cm cephalad. Upon landmarking, adhesive tape was used to mark landmarks and the inclinometer was placed on sites for three trials of flexion and extension. Tape was removed and landmarks were relocated by the same assessor (intra-rater) for an additional three trials; and this process was repeated by a second assessor (inter-rater). Reliability was determined using intra-class correlation coefficients.
Reliability within a set of three repetitions was very high (ICCs > 0.90); intra-rater reliability after relocating landmarks was high (ICCs > 0.80); reliability between therapists was moderate to high (0.60 > ICCs < 0.76). Assessment of flexion and extension movements by straight palpation of bony landmarks as in the Straight palpation of PSIS and L1 method (ICC: Flexion 0.60; Extension 0.74) was found to be marginally less reliable than the other two landmarking measurement strategies (ICC: Flexion 0.66; Extension 0.76).
All three methods of land marking are reliable. We recommend the use of the PSIS to 15 cm cephalad method as used in the modified-modified Schobers test as it is the simplest to perform and aligns with current clinical practice.
Low back pain (LBP) is one of the most common musculoskeletal problems across the globe with an estimated lifetime incidence of 60-80 % . The high costs of low back pain are borne by the government, insurance companies, and the general public in the form of medical treatments and impairment compensations [2, 3]. Range of motion (ROM) amongst other variables has been identified to be an indicator of impairment due to back pain .
A great deal of research has gone into examining different tools for measuring spinal mobility [5–8]. Tools available for clinical use include: plumb line,  goniometry,  fingertip to floor,  flexicurve,  tape measures,  visual-photographs,  and dual inclinometry.  At present, there is no conclusive evidence to advocate one method over another . Radiographs are considered the gold standard for measuring absolute joint motion. [4, 7] However, performing a radiograph on every patient with LBP is costly and time-consuming in a clinical setting. Furthermore, range of motion measurements often reflect physical impairment or functional mobility in which case external measures of the ROM are more directly applicable.
One of the methods that has been used to measure joint mobility in research and clinical practice is computerized dual inclinometry (CDI). Previous studies have addressed the validity and reliability of the CDI [3, 7, 8, 17–19]. An examination by Nitschke et al.  showed dual inclinometry had acceptable intra-rater reliability results (flexion ICC = 0.90, extension ICC = 0.71), but fair to poor inter-rater values (flexion ICC = 0.67, extension ICC = 0.35). In a systematic review on the validity of instruments used to measure lumbar ROM, three out of the four studies included were on dual inclinometry . The authors of that review concluded that there was little evidence to support current measures of lumbar ROM. The poor results were mostly attributed to differences between the raters, reflected by the accuracy and consistency of manual palpation of bony landmarks, and the handling of the inclinometer heads along the subject’s spine . Nonetheless, dual inclinometry is considered to be more valid tool than modified-modified Schobers (MMS)  because the measurement results are expressed in degrees, which correlate with the angular motion of lumbar flexion and extension .
According to guidelines set out by the American Medical Association (AMA) in its 4th and 5th edition, compensation entitlements for patients with low back pain were based in part, on the impairment of back movement [23, 24]. It recommended the use of multiple measures including the CDI and the MMS. In the 6th edition however, ROM was removed from the list because of variability in results and lack of strong evidence supporting the validity and reliability of the ROM measures that are currently used . When making judgments on a patient’s level of impairment, particularly when his/her future financial compensations hang in the balance, reliable methods are essential for determining impairment in spinal motion.
When making routine clinical decisions, a variety of measurement characteristics are important to consider. Ideally tools and the associated procedures to use them are reliable, valid, cost-effective & easy-to-use . While a number of factors including rater skill, equipment limitations, patient selection and pain status may affect reliability of results, certainly landmarking is one of the major issues that affects the reliability and validity of joint motion measurements. Findings from previous reliability studies of the lumbar ROM measures suggest that palpation inaccuracies were the main source of error [3, 7, 8, 18, 19, 26, 27]. On the other hand, studies on palpation skills of clinicians in palpating and identifying spinal levels indicate great variability in terms of reproducibility and repeatability between clinicians [28–32]. Hence, one strategy for improving reliability is to study the impact of different landmarking techniques. The utility of a landmarking method will be based on producing accurate measurements with highest reliability with the least difficulty, time and cost. The purpose of this study is to examine the intra-rater and inter-rater reliability of lumbar flexion and extension measurements with a J-Tech dual inclinometer,Footnote 1 using three different landmarking methods.
Repeated measures reliability study.
Forty subjects (26 male, 14 female), ranging in age from 19 to 71 (mean = 34.2, SD = 14.5) were recruited by convenience sampling to participate in the study. (see Table 1) They were recruited from the community by posters and word of mouth. Any one above the age of 18 was selected. LBP was not a factor in the inclusion criteria for subject selection. They were excluded if they had experienced any trauma or surgery to their back or if they have any difficulties following instruction. The purpose of the study was explained to each subject and written informed consent was obtained before testing began. Subjects were also informed that participation in the study was completely voluntary, and they could withdraw at any time. This study was approved by the Health Sciences Research Ethics Board (HSREB) of the University of Western Ontario in London, Ontario, Canada.
The three landmarking techniques that were used. Landmarks were chosen based on our knowledge of anatomy of the lumbar spine and biomechanical measurement strategies used in the MMS .
Method 1- Straight palpation of PSIS and L1
To determine the lower landmark, subject’s PSISs were palpated and a line connecting both PSIS represents the level of S2. The upper landmark was determined by counting up spinous processes from S2 to L1.
Method 2 – PSIS to cephalad 10 or 15 cm
To determine lower landmark, the subject’s PSISs were palpated and a line connecting both PSIS represents the level of S2. The upper landmark was determined by counting up spinous processes from S2 to L1. The distance between S2 and L1 was measured in centimeters and rounded to either 10 or 15 cm based on proximity. An adhesive mark was then placed at the 10 or 15 cm level above the S2 level located.
Method 3 – PSIS to 15 cm cephalad
To determine lower landmark, the subject’s PSISs were palpated and a line connecting both PSIS represents the level of S2. The upper landmark was determined by measuring exactly 15 cm above the located S2 level.
Each subject was asked to complete a set of three lumbar flexion and extension movements prior to testing as a warm-up procedure. Three different land marking methods (see below) were used to identify the start & end of the lumbar spine to be measured. The upper and lower spinal landmarks were marked by a horizontal line on a piece of adhesive tape.Footnote 2 Adhesive marks were removed and re-landmarked for each set of data. The two heads of the dual inclinometer were placed at the low marked levels along the spine; the MASTER head at the upper landmark and the SLAVE head placed at the lower landmark. During each set, subjects performed a series of three alternating flexion and extension movements. All subjects were instructed to remove their shoes and to stand upright with feet shoulder-width apart, and both knees straight throughout the process. Subjects’ shirts were lifted up and clipped, in order to expose the lumbar spine. The instructions given to each subject were, “bend forward towards your toes starting with tucking your chin to your chest and slowly leaning down towards the floor” and “bend backwards as far as you can, with your hands on your waist and knees straight.” The J-Tech™ equipment was calibrated before each set of repeated movements. Two sets of data were obtained for lumbar flexion and extension for each rater and each landmarking method for a total of 160 sets. One set constituted three lumbar flexion movements and three lumbar extension movements that were alternating in nature. The adhesive marks were removed for each set and the order of rater was randomized.
Inter and intra rater reliability was calculated using intra class correlations (ICC) [33, 34]. An ICC value can range between 0 and 1 with zero indicating no reliability and 1 indicating perfect reliability.
All three methods exhibited high to very high intra-rater reliability (ICC 0.78 to 0.93) for both lumbar flexion and extension values. (see Table 2) Extension measurements demonstrated high intra-rater reliability than flexion measurements. None of the three methods was superior to the other.
All three methods demonstrated high inter-rater reliability (ICC > 0.74) for extension measurements, while only moderate inter-rater reliability (ICC > 0.60) was observed for the three methods of landmarking for flexion measurements (see Table 3).
We used the following benchmarks in interpreting our results: 1 indicates perfect reliability, 0.90 to 0.99 = very high reliability; 0.70 to 0.89 = high reliability; 0.50 to 0.69 = moderate reliability; 0.26 to 0.49 = low reliability and 0.00 to 0.25 = little, if any, reliability .
The results of the current study indicate that regardless of the landmarking method utilized (methods 1, 2 or 3), dual inclinometry had good intra-rater reliability and fair to good inter-rater reliability for lumbar flexion and extension measurements. This adds on to the credibility of using dual inclinometer because reliable objective measures are fundamental to making data-driven clinical decisions. This study demonstrates reliable methods of measuring lumbar ROM are available. Since, lack of such evidence resulted in ROM being excluded from the measures that are required to calculate impairment due to back pain . Since previous studies gave mixed messages on reliability of these measures, [4, 8, 18, 26] we do not suggest these findings are sufficient to change that decision. This study proposed a method to improve reliability of spinal motion measurements and there is value in adopting consistent methods to allow for greater comparability.
Currently, there is no clear evidence about the relevance of lumbar ROM in low back pain. This is commonly seen in routine clinical practice. Symptomatic patients might not show impaired range of motion while symptomatic patients might present substantial impairment of lumbar movements. The following is a classic example of this observation. Quack and colleagues (2007) compared MRI findings to ROM tests of the lumbar spine and found no clear relationship between the changes observed on a MRI to the ROM tests of the lumbar spine . This is due to our very limited understanding of the relationship between impairment and function low back pain. Hence, studies on the relationship of spinal motion to function are needed to determine if this impairment should be considered important when evaluating impairment.
Nitschke et al.  demonstrated poor inter and intra-rater reliability for measurements of thoracolumbar flexion, extension, lateral flexion and rotation using a J-Tech CDI with respect to reliability. On the contrary, the current study has shown that with appropriate land marking techniques the reliability was fair to good. The differences in reliability reported across studies could be due to positioning. Investigators in the Nitschke et al. study used directions provided in the AMA guide (2nd edition) which does not clearly define the location for the master unit; it defines it as “mid-sacrum” which could bring about substantial variation in identifying this reference point between assessors. On the other hand in the current study we used an easy-to- identify landmark, PSIS which even a novice assessor can identify easily.
An important and common source of error while measuring lumbar range of motion could be to do with the preparation of subjects. Two important aspects of subject preparation are: exposing the lower spine to improve landmarking and considering the flexibility of hamstrings which could reduce the movement in pelvis and consequently in the lumbar spine . In this study we exposed the lumbar spine by lifting and clipping the clothing. To improve flexibility of the hamstrings we made our subjects perform some practice repetitions which can increase the flexibility of the hamstrings considerably through ‘warm-up’ . It is also believed that hamstring stretching could have a considerable effect on the lumbar spine because of its pelvic attachment; this might increase pain in patients with LBP and limit the actual lumbar ROM during the actual test. To this end, we did not incorporate a complete hamstring stretching protocol which may have affected spinal motion and also we ensured that the stretches were performed within the limits of pain.
The technique of applying the dual inclinometer also has potential sources of error. Positioning of the MASTER and SLAVE heads of the dual inclinometer may be problematic, including the ability to maintain a constant pressure with the heads against the skin. Measuring lumbar extension can be particularly awkward due to the tendency for the heaping up and folding of skin. Also, with highly flexible subjects, it can be difficult to position the heads to prevent them from colliding. Improvement can be made by moving either one of the measurements heads slightly lateral to prevent collision. This would still provide an accurate measurement without the perfect alignment of the heads because the unit measures the angle between the intersecting planes created at the heads.
Despite the mentioned sources of error, this study has shown dual inclinometry to be a reliable tool for measuring lumbar flexion and extension. Although validity was not tested, dual inclinometry provides its measurements in degrees, which better represents the true spinal angular movement than the linear skin distraction method of MMS. Moreover the criterion validity of dual inclinometer has been tested against the gold standard of radiographic measurements in previous studies [39, 40]. However, additional research is needed to confirm the validity of the dual inclinometry techniques mentioned in this study for measuring lumbar spinal movement.
There are many different landmarking techniques currently used by clinicians to locate a specific spinal level. However, the reliability between therapists to find the same spinal level tends to be problematic . Sources of error include the level of training of the therapist, how well the lumbar spine is exposed away from clothing, and the varying distances for individuals of the bony landmarks to the surface of the skin. However, studies have shown that even highly experienced clinicians have low reliability in accurate ascertainment of spinal level, suggesting that measurement strategies based on palpation have inherent limitations . For this reason and knowing that, most patients in this study fell into the 15 cm distance that is currently used in the MMS. A measurement strategy that utilizes this method may produce more reliable results when one considers the wide spectrum of potential testers that might use inclinometry for spinal evaluation. A limitation of this method is that it does not represent the same portion of the spine for people of different heights. Whether this means the measurement is less valid or less related to function in different populations or across people of different heights, needs further study. A limitation of this current study is its inability to calculate values for absolute reliability which would have made the study even more clinically relevant. We recommend future studies to determine Standard Error of Measurement (SEM) values to determine absolute reliability.
All three landmarking techniques have almost the same reliability. However, from a clinical standpoint, the PSIS to 15 cm cephalad method as used in the modified-modified Schobers test, is recommended, as it is the simplest to perform and aligns with routine clinical practice. Dual inclinometry can produce clinically reliable measurements provided a reliable landmarking strategy is employed.
Tracker M.E. J-Tech Medical Industries, Utah, USA
3 M™ Micropore™ Medical Tape, 3 M in Canada, London ON.
Long DM, BenDebba M, Torgerson WS, Boyd RJ, Dawson EG, Hardy RW, et al. Persistent back pain and sciatica in the United States: patient characteristics. J Spinal Disord. 1996;9(1):40–58.
Thomas E, Silman AJ, Papageorgiou AC, Macfarlane GJ, Croft PR. Association between measures of spinal mobility and low back pain. An analysis of new attenders in primary care. Spine. 1998;23(3):343–7.
Nitschke JE, Nattrass CL, Disler PB, Chou MJ, Ooi KT. Reliability of the American Medical Association guides’ model for measuring spinal range of motion. Its implication for whole-person impairment rating. Spine. 1999;24(3):262–8.
Rondinelli R, Murphy J, Esler A, Marciano T, Cholmakjian C. Estimation of normal lumbar flexion with surface inclinometry. A comparison of three methods. Am J Phys Med Rehabil. 1992;71(4):219–24.
Dopf CA, Mandel SS, Geiger DF, Mayer PJ. Analysis of spine motion variability using a computerized goniometer compared to physical examination. A prospective clinical study. Spine. 1994;19(5):586–95.
MacDermid JCCK, Gandhi R. The reliability and validity of double inclinometer in measuring lumbar mobility. Physiother Can. 2000;53(1):1–s29.
Saur PM, Ensink FB, Frese K, Seeger D, Hildebrandt J. Lumbar range of motion: reliability and validity of the inclinometer technique in the clinical measurement of trunk flexibility. Spine. 1996;21(11):1332–8.
Williams R, Binkley J, Bloch R, Goldsmith CH, Minuk T. Reliability of the modified-modified Schober and double inclinometer methods for measuring lumbar flexion and extension. Phys Ther. 1993;73(1):33–44.
Gardocki RJ, Watkins RG, Williams LA. Measurements of lumbopelvic lordosis using the pelvic radius technique as it correlates with sagittal spinal balance and sacral translation. Spine J. 2002;2(6):421–9.
Fitzgerald GK, Wynveen KJ, Rheault W, Rothschild B. Objective assessment with establishment of normal values for lumbar spinal range of motion. Phys Ther. 1983;63(11):1776–81.
Kippers V, Parker AW. Toe-touch test. A measure of its validity. Phys Ther. 1987;67(11):1680–4.
Hart DL, Rose SJ. Reliability of a noninvasive method for measuring the lumbar curve*. J Orthop Sports Phys Ther. 1986;8(4):180–4.
Beattie P, Rothstein JM, Lamb RL. Reliability of the attraction method for measuring lumbar spine backward bending. Phys Ther. 1987;67(3):364–9.
Bryan JM, Mosner E, Shippee R, Stull MA. Investigation of the validity of postural evaluation skills in assessing lumbar lordosis using photographs of clothed subjects. J Orthop Sports Phys Ther. 1990;12(1):24–9.
Mayer TG, Tencer AF, Kristoferson S, Mooney V. Use of noninvasive techniques for quantification of spinal range-of-motion in normal subjects and chronic low-back dysfunction patients. Spine (Phila Pa 1976). 1984;9(6):588–95.
Littlewood C, May S. Measurement of range of movement in the lumbar spine—what methods are valid? A systematic review. Physiotherapy. 2007;93(3):201–11.
Mayer TG, Kondraske G, Beals SB, Gatchel RJ. Spinal range of motion. Accuracy and sources of error with inclinometric measurement. Spine. 1997;22(17):1976–84.
Chen SP, Samo DG, Chen EH, Crampton AR, Conrad KM, Egan L, et al. Reliability of three lumbar sagittal motion measurement methods: surface inclinometers. J Occup Environ Med. 1997;39(3):217–23.
Rainville J, Sobel JB, Hartigan C. Comparison of total lumbosacral flexion and true lumbar flexion measured by a dual inclinometer technique. Spine. 1994;19(23):2698–701.
Cupon LN, Jahn WT. Current standards for measuring spinal range of motion for impairment. J Chiropractic Med. 2003;2(1):8–12.
Van A, J.A.M, der Korst V. Assessment of the flexibility of the LumbarÂ. Scand J Rheumatol. 1973;2(2):87–91.
Miller SA, Mayer T, Cox R, Gatchel RJ. Reliability problems associated with the modified Schober technique for true lumbar flexion measurement. Spine. 1992;17(3):345–8.
Association AM. Guides to the Evaluation of Permanent Impairment 4 th edition. 4th ed. 1993. 1(2):112–130.
Cocchiarela L. American Medical Association, Anderson GBJ: Guides to the Evaluation of Permanent Impairment. 5th edition: Amer Medical Assn. 5th ed. 2001.
Rondinelli RD, Genovese E, Brigham CR. American Medical Association: Guides to the evaluation of permanent impairment. 6th edition: 6th ed. Chicago, Ill.: American Medical Association. 6th ed. 2008.
Gill K, Krag MH, Johnson GB, Haugh LD, Pope MH. Repeatability of four clinical methods for assessment of lumbar spinal motion. Spine. 1988;13(1):50–3.
Mayer RS, Chen IH, Lavender SA, Trafimow JH, Andersson GB. Variance in the measurement of sagittal lumbar spine range of motion among examiners, subjects, and instruments. Spine. 1995;20(13):1489–93.
Billis EV, Foster NE, Wright CC. Reproducibility and repeatability: errors of three groups of physiotherapists in locating spinal levels by palpation. Man Ther. 2003;8(4):223–32.
Binkley J, Stratford PW, Gill C. Interrater reliability of lumbar accessory motion mobility testing. Phys Ther. 1995;75(9):786–92. discussion 793–5.
Downey B, Taylor N, Niere K. Can manipulative physiotherapists agree on which lumbar level to treat based on palpation? Physiotherapy. 2003;89(2):74–81.
Harlick JC, Milosavljevic S, Milburn PD. Palpation identification of spinous processes in the lumbar spine. Man Ther. 2007;12(1):56–62.
McKenzie AM, Taylor NF. Can Physiotherapists locate Lumbar spinal levels by palpation? Physiotherapy. 1997;83(5):235–9.
Fleiss JL. Design and analysis of clinical experiments: Wiley. 2011.
Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psychol Bull. 1979;86(2):420–8.
Portney L, Watkins M. Foundation of Clinical Research: Applications to practice. Upper Saddle River, NJ: Prentice-Hall; 2000.
Quack C, Schenk P, Laeubli T, Spillmann S, Hodler J, Michel BA, et al. Do MRI findings correlate with mobility tests? An explorative analysis of the test validity with regard to structure. Eur Spine J. 2007;16(6):803–12.
Stokes IA, Abery JM. Influence of the hamstring muscles on lumbar spine curvature in sitting. Spine (Phila Pa 1976). 1980;5(6):525–8.
Woods K, Bishop P, Jones E. Warm-up and stretching in the prevention of muscular injury. Sports Med. 2007;37(12):1089–99.
Perret C, Poiraudeau S, Fermanian J, Colau MM, Benhamou MA, Revel M. Validity, reliability, and responsiveness of the fingertip-to-floor test. Arch Phys Med Rehabil. 2001;82(11):1566–70.
Sarah T. A systematic review of methods to measure posture. Physical Ther Rev. 2003;8(1):45–50.
The publishing cost of this manuscript is covered by Western libraries Open Access Fund.
None of the authors have any competing interests to declare.
JM, VA, and JV were responsible for manuscript thought, writing and editing. KP and AK helped in the data collection. All authors read and approved the final manuscript.