- Research article
- Open Access
- Open Peer Review
Functional movement analysis in patients with chronic nonspecific low back pain: a reliability and validity study
BMC Musculoskeletal Disordersvolume 20, Article number: 395 (2019)
Individuals afflicted with nonspecific chronic low back pain (CLBP) exhibit altered fundamental movement patterns. However, there is a lack of validated analysis tools. The present study aimed to elucidate the measurement properties of a functional movement analysis (FMA) in patients with CLBP.
In this validation (cross-sectional) study, patients with CLPB completed the FMA. The FMA consists of 11 standardised motor tasks mimicking activities of daily living. Four investigators (two experts and two novices) evaluated each item using an ordinal scale (0–5 points, one live and three video ratings). Interrater reliability was computed for the total score (maximum 55 points) using intra class correlation and for the individual items using Cohen’s weighted Kappa and free-marginal Kappa. Validity was estimated by calculating Spearman’s Rho correlations to compare the results of the movement analysis and the participants’ self-reported disability, and fear of movement.
Twenty-one participants (12 females, 9 males; 42.7 ± 14.3 years) were included. The reliability analysis for the sum score yielded ICC values between .92 and.94 (p < .05). The classification of individual scores are categorised ‘slight’ to ‘almost perfect’ agreement (.10–.91). No significant associations between disability or fear of movement with the overall score were found (p > .05). The study population showed comparably low pain levels, low scores of kinesiophobia and disability.
The functional movement analysis displays excellent reliability for both, live and video rating. Due to the low levels of disability and pain in the present sample, further research is necessary to conclusively judge validity.
Chronic low back pain (CLBP) is a major health burden with a life time prevalence up to 84% . The pathogenesis of CLBP is multifactorial. The symptoms can origin from several anatomical structures including nerve roots, intervertebral disks, muscles, fasciae, and bones [2, 3] as well as from psychological factors, such as stress, depression or anxiety . More detailed, neuromuscular factors (i.e. deficits or impairments) are particularly named as risk factors and contributors to CLBP [5, 6]. Unlike the pain symptoms (as they are patient self-reported), neuromuscular contribution to CLBP may be assessed objectively.
While it is unclear whether they represent another potential (neuromuscular) cause or a consequence of the disorder, aberrations of fundamental movement patterns have been observed in patients with CLBP [7, 8]. Yet, all these reports [7, 8] focused on one particular joint or movement only. Systematic functional movement analyses, capturing fundamental movement patterns representing activities of daily living, might thus be a valuable addition to instrumental diagnostics like radiography and magnet resonance imaging . In a previous trial, Wilke and Buhmann  showed that a functional movement analysis (FMA) could discriminate movement patterns of healthy individuals and patients with CLBP. The latter achieved considerably lower scores, reflecting worse movement quality, and increased side-to-side asymmetry when compared to a control group . Despite these intriguing findings, the authors presented only a pilot evaluation of reliability. Therefore, our trial aimed to more thoroughly evaluate the measurement properties of the tool. This was done in two ways by assessing reliability in view of rater experience, assessment modes (live vs. video) and cross-validation to established subjective self-reported measures.
Ethical standard and study design
The study was approved by a local ethics committee and conducted in accordance to the ethical standards set by the declaration of Helsinki with its recent modification of Fortaleza [9, 10]. Each participant signed informed consent prior to study enrolment.
Adults with CLBP were recruited. Recruitment strategies included posting of flyers (public) and personal addressing (outpatient rehabilitation centre) through the investigators. Participants were considered eligible if they fulfilled the following criteria: (1) chronic (> 13 weeks/3 months) nonspecific low back pain and (2) age from 18 to 65 years. Exclusion criteria comprised (1) severe psychiatric, neurological or cardiovascular diseases; (2) orthopaedic disorders except for low back pain; (3) pregnancy; (4) acute infectious disease and (5) intake of painkilling drugs, analgesics or muscle relaxants within the previous 48 h.
All participants performed the 30-min functional movement analysis  on two separate days. In between wash-out period was 1 week. The test consists of eleven movement tasks picturing activities of daily living (Table 1). For all the individual items, three repetitions were performed and each was rated. The best from these three was used for analysis. Unilateral, non-symmetrical tasks (e.g. lunge) were performed on both body sides. To ensure uniform testing conditions, all analyses were instructed by the same investigator. Standardised verbal commands as well as photo illustrations were used.
For each of the 11 test items, performance was rated by means of an ordinal 6-point-Likert scale (0 to 5 points). Scoring was based on the identification of predefined error patterns indicating a lack of joint mobility or stability within the respective tasks (Fig. 1). If no compensatory movements were observed and the task was completed with high precision, the maximum value of five points per item was awarded. In contrast, each observed error pattern led to the deduction of one point. Thus, one error led to four, two errors to three and three errors to two points. A task was scored with one point, if more than three errors became manifest or the participant was unable to execute the requested movement. When reporting pain, zero points were documented, regardless of the error count. In eight of the 11 items, a predefined, simplified version was to be completed if a score of 5 was not achieved. In this case, a maximum of 4 points were obtainable. Again, but this time starting from the score of 4, each observed error pattern led to the deduction of one point.
Functional movement quality
Live and video ratings were conducted. The former was done by the investigator instructing the participants. For video rating, all analyses were captured from the frontal and sagittal plane, using two high-resolution cameras (HDR-CX240, Sony, Minato, Tokio), according to the procedures recommended in previous investigations . Three raters independently evaluated the videos and one novice rater scores live. One of the video raters was a novice in scoring of non-apparative movement analyses, while the other two, classified as experts, had long-standing experience with the assessment and the observation of functional movement patterns. Prior to study initiation, the two novice raters received a detailed training on the use of the tool including demo scoring from the expert raters.
The total score was calculated by cumulating the individual scores of all 11 items. The highest achievable result was 55 points. In addition, the number of side-to-side asymmetries was documented. An asymmetry was defined as an unequal item score between left and right in the 7 non-symmetrical items. The maximum number of asymmetries, hence, was 7.
Self-reported function and disability
In addition to movement quality, psychometric data on self-reported function and disability were collected. To capture the levels of physical activity during the 7 day prior to study initiation, the participants completed the short form of the International Physical Activity Questionnaire (IPAQ) questionnaire (IPAQ-SF) . With its seven questions, the IPAQ-SF assesses the number of days spent with intensive, moderate, and light activity, as well as the days characterized by sedentary behaviour. The outcome variable used for analysis was the overall level of activity in MET-minutes per week. The instrument has been demonstrated to display sufficient measurement properties [12, 13].
The pain intensity and the subjective disability of the patients was measured with the Chronic Pain Grade Questionnaire (CPGQ), developed by Korff et al. (1992) . The six items of the questionnaire generate the data, each using an 11-point Likert-ordinal scale (ranging from 0 to 10). Three questions are based on pain intensity and three are based on subjective disability. Additionally, the number of days with disability during the past 3 months was asked. The pain intensity sum score and the disability sum score are built by means of z-transfomations of the original Likert scale questions on disability (3 questions) and pain (3 questions). Sum scores can range from 0 to 100 points. Based on these two scores, the participants were stratified according to the following severities of chronic pain: 0 = no pain in the last 3 months; I = low disability and low pain intensity; II = low disability and high pain intensity; III = high disability and moderate limiting; IV = high disability and severely limiting. An evaluation of the German version of the CPGQ shows that the questionnaire is a reliable and valid instrument for the rating of chronic pain severity .
To evaluate functional disability in performing activities of daily living, the Quebec Pain Disability Scale (QBPDS) was applied. Twenty questions addressing the capacity to perform activities of daily living were to be answered and scored on 6-point scales (0 = not difficult at all; 1 = marginally difficult; 2 = somewhat difficult; 3 = fairly difficult; 4 = very difficult; 5 = impossible to perform). The questionnaire is based on six sum categories: sleeping/rest (question 1–3); sitting/standing (question 4–6); locomotion (question 7–9); movement (question 10–12); bending forward (question 13–16) and carry heavy materials (question 17–20). The internal consistence of this questionnaire is good for the sum score (Cronbach’s alpha = .94), test-retest reliability (ICC = .81) has been shown to be high .
Pain-related fear of movement and injury was measured by means of the Tampa Scale of Kinesiophobia. A validated German version (TSK-GV) with eleven items to be scored on a 4-point Likert scale (strongly disagree to strongly agree) was used. The TSK-GV has been demonstrated to exhibit high reliability and validity .
Reliability of the total score estimated by means of intraclass correlation coefficients (ICC 2.1). Reliability was analyzed twice, once for interrater (expert vs. novice; live vs. video rating) and once for intrarater (test-retest-design) agreement. According to Fleiss (1999) , resulting values were interpreted as ‘poor’ (ICC < .4), ‚fair to good’ (ICC .4–.75) and ‚excellent’ (ICC > .75).
To judge the concordance of the individual item ratings, Cohen’s weighted kappa statistics were used. Reliability of the side-to-side asymmetry rating was calculated with Free Marginal Kappa statistics . The interpretation of all Kappa values was based on the recommendations of Landis and Koch (1977): k < 0 (‘poor agreement’); k = 0–0.20 (‘slight agreement’); k = 0.21–0.40 (‘fair agreement’); k = 0.41–0.60 (‘moderate agreement’); k = 0.61–0.80 (‘substantial agreement’); k = 0.81–1.00 (‘almost perfect agreement’). Every correlation was evaluated against the most experienced expert rater.
Systematic associations between movement pattern quality (assed via the FMA) and subjective disability (QBPDS), pain intensity (Korff’s CPG) as well as fear of movement (TSK scale) were examined using Spearman-Rho correlations. The significance level for all analyses was set to α = .05. All statistical calculations were done with SPSS 22.0 (SPSS Inc., Chicago, IL. USA).
No participant had to be excluded after study enrolment, no participant withdrawed his/her consent. Twenty-one participants (females = 12; males = 9 43 ± 14 years) were included. Psychometric data are displayed in Table 2. Overall, the sample displayed considerably low levels of pain, disability and kinesiophobia. The mean sum score achieved in the FMA was 31.0 ± 6.2 points. Per average, 2.5 ± 1.3 asymmetries were detected.
The total score’s ICC values for the interrater reliability ranged between .92 and .94. The expert rater with video rating reached the highest ICC, the interrater correlation of the novice video rater was 0.93 and live rating reached the lowest ICC value.
The concordance of the individual scores are categorized in ‘slight agreement’ to ‘almost perfect agreement’. On average, three of the eleven items showed a fair (squat, rotary stability, side plank), three a moderate (inline lunge, forward bending, pelvic stability), three a substantial (hurdle step, push-up, pull-up) and two an almost perfect agreement (thoracic mobility, shoulder mobility) (Table 3).
The Free Marginal Kappa values for the ratings of movement asymmetry are displayed in Table 4. The agreement between the raters regarding asymmetries ranged from ‘poor’ to ‘substantial’ agreement. On average, the seven items reached once ‘poor’ (side plank), once ‘slight’ (inline lunge), three times ‘fair’ (hurdle step, pelvic stability, rotary stability), once ‘moderate’ (thoracic mobility) and once ‘substantial’ (shoulder mobility) correspondence.
The total score ICC value for the intrarater reliability is .91. The Free Marginal Kappa values show an agreement between the raters regarding ratings of the movement asymmetry ranged from ‘poor’ to ‘substantial’ agreement. The corresponding values are displayed in Table 4.
No significant associations between total sum score of the functional movement analysis and measures of subjective movement disability or fear of movement were detected (p > .05; Table 5).
Our results show excellent values for interrater (video as well as live rating) and intrarater reliability (live rating) for the functional movement analysis. The interrater correspondence of the eleven individual items reached from ‘slight’ to ‘almost perfect’ agreement. Six of seven items with asymmetries reached ‘slight’ to ‘substantial’ agreement for both inter- and intrarater reliability. More experience in the use of FMA resulted in an only minimally improved accordance. No correlations between the subjective outcomes and functional movement analysis were identified.
Previous studies investigating the reliability of other systematic movement analysis approaches (e.g. the Functional Movement Screen, FMS) show inconsistent results (ICCs ranging from 0.38 to 0.92) [20,21,22]. A systematic review found a mean ICC of 0.81 for interrater reliability of the FMS . The overall reliability in the present FMA is higher compared to these other findings. Also, the individual items in the present FMA showed a better reliability than the FMS: substantial reliability (k ≥ 0.4) in 72% (8/11) compared to only 57% (4/7) .
In other visual movement analyses, good interrater reliability values were reached when different degrees of experience (expert versus novice) and assessment modes (live versus video rating) were compared. In a movement analysis designed to investigate the lower limbs with a standardised procedure, good interrater reliability (trained beginners vs. experts) values were reached . Similar results were found for the FMS: Trained novice raters were objective compared to expert raters . These findings are in line with the results of the present FMA and support our findings.
Other studies comparing live versus video rating showed varying reliability, ranging from poor (ICC = .23) to excellent (ICC = .92) [21, 26]. Our results support the latter. In the present study, the videos were recorded from the frontal and the sagittal plane to cover the entire movement. Advantages of this video-based evaluation includes the possibility to watch movements as often as necessary and at different velocities as well as pausing the video at critical positions. The excellent reliability in the present FMA between live and video rating indicates that the described standardised video recording with two cameras from two different perspectives is successful.
We found no significant associations between total score of the functional movement analysis and measures of subjective movement disability or fear of movement. One previous study demonstrated that the movement analysis can discriminate between back pain patients and healthy individuals . Patients show a lower sum-score and display more movement asymmetries. The patients with back pain in the study of Wilke and Buhmann (2013)  reached a mean sum score of 32.0 points and had 3.8 asymmetries. The population of the present study reached comparable values in sum-score of 31 points, but showed considerably fewer asymmetries of 2.5 points. Reasons for the finding of no association may be found in (1) the low subjective movement disability and pain intensity levels in our participants, in particular when compared to other study populations with CLBP [16, 17], (2) in the low variability of the data (QBPDS values ranging only from 0 to 36 of possible 0–100 points), and (3) a high physical activity of the participants. The study population showed a mean amount of physical activity of 3888 MET-minutes in the week before study inclusion. This can be interpreted as a very high activity level . One may speculate that active people with chronic back pain do not change their movement behavior as strongly as their inactive counterparts. Therefore, they experienced only a low subjective movement disability in their everyday life. In any case, as physical activity, to a certain extent, is associated with reduced back pain and movement disability , this might also explain the lack of correlations. Further research should, therefore, aim to examine if associations of movement quality and self-reported parameters of pain and function become manifest in patients with increased disability and movement fear.
The present findings have implications for clinical practice. So far, systematic visual investigations of functional movement patterns have only been implemented in competitive sports [28, 29]. As alterations of fundamental movement patterns have been demonstrated in patients with low back pain [8, 30], the present tool may be an interesting addition to the available pool of diagnostic instruments. The present FMA shows a good interrater reliability and can be confidently used by both novice and experienced raters using live and video scoring. It provides a straightforward estimate of fundamental movement quality. Regarding to the validity, the FMA could discriminate between patient and healthy adults , but could not distinguished between different disability levels in participants with a low overall level of pain. Consequently, and as our target population did not indicate pain during movements, it can safety be applied in participants with chronic pain, yet its discrimination validity is at least questionable. Nonetheless, further research, not displaying the limitation of patients with low disability levels, limited pain and high physical activity, should be conducted in order to conclusively judge its value regarding an association with self-reported measures. Moreover, future studies are warranted to examine whether determined task failures due to impaired movement patterns can be addressed and improved by particular correction exercises.
The standardised functional movement analysis displays a good interrater-reliability when different education degrees and observation methods are compared. The functional movement is usable in patients with chronic low back pain but it remains unclear whether the movement analysis can identify different degrees of subjective disability in tasks of daily living and should be clarified in additional studies.
Availability of data and materials
The datasets used and analysed during the current study are available from the corresponding author on reasonable request.
Chronic Low Back Pain
Chronic Pain Grade
Functional Movement Analysis
Functional Movement Screen
Intraclass Correlation Coefficient
International Physical Activity Questionnaire- Short Form
Metabolic Equivalent of Task
Quebec Back Pain Disability Scale
Tampa Scale of Kinesiophobia – German Version
Breivik H, Collett B, Ventafridda V, Cohen R, Gallacher D. Survey of chronic pain in Europe: prevalence, impact on daily life, and treatment. Eur J Pain. 2006;10:287–333. https://doi.org/10.1016/j.ejpain.2005.06.009.
Allegri M, Montella S, Salici F, Valente A, Marchesini M, Compagnone C, et al. Mechanisms of low back pain: a guide for diagnosis and therapy. F1000Res. 2016. https://doi.org/10.12688/f1000research.8105.2.
Wilke J, Buhmann HW. Qualität grundlegender Bewegungsmuster bei Patienten mit chronischen lumbalen Rückenschmerzen: Eine quasi-experimentelle Querschnittsstudie. Sportverletz Sportschaden. 2013;27:219–25. https://doi.org/10.1055/s-0033-1355855.
Deyo RA, Bryan M, Comstock BA, Turner JA, Heagerty P, Friedly J, et al. Trajectories of symptoms and function in older adults with low back disorders. Spine. 2015;40:1352–62. https://doi.org/10.1097/BRS.0000000000000975.
Brown SHM, McGill SM. The intrinsic stiffness of the in vivo lumbar spine in response to quick releases: implications for reflexive requirements. J Electromyogr Kinesiol. 2009;19:727–36. https://doi.org/10.1016/j.jelekin.2008.04.009.
Borghuis J, Hof AL, Lemmink KAPM. The importance of sensory-motor control in providing core stability: implications for measurement and training. Sports Med. 2008;38:893–916. https://doi.org/10.2165/00007256-200838110-00002.
Burnett AF, Cornelius MW, Dankaerts W, O'sullivan PB. Spinal kinematics and trunk muscle activity in cyclists: a comparison between healthy controls and non-specific chronic low back pain subjects-a pilot investigation. Man Ther. 2004;9:211–9. https://doi.org/10.1016/j.math.2004.06.002.
O'sullivan PB, Mitchell T, Bulich P, Waller R, Holte J. The relationship beween posture and back muscle endurance in industrial workers with flexion-related low back pain. Man Ther. 2006;11:264–71. https://doi.org/10.1016/j.math.2005.04.004.
Declaration of Helsinki. Recommendations guiding doctors in clinical research. Adopted by the world medical association in 1964. Wis Med J. 1967;66:25–6.
World Medical Association. Declaration of Helsinki: ethical principles for medical research involving human subjects. JAMA. 2013;310:2191–4. https://doi.org/10.1001/jama.2013.281053.
IPAQ scoring protocol. Guidelines for Data Processing and Analysis of the International Physical Activity Questionnaire (IPAQ) – Short and Long Forms; Version:November 2005. 2005. https://www.researchgate.net/file. PostFileLoader.html?id=56f92d66615e27d49a658031&assetKey=AS%3A344600888791041%401459170662924. Accessed Nov 2017.
Craig CL, Marshall AL, Sjöström M, Bauman AE, Booth ML, Ainsworth BE, et al. International physical activity questionnaire: 12-country reliability and validity. Med Sci Sports Exerc. 2003;35:1381–95. https://doi.org/10.1249/01.MSS.0000078924.61453.FB.
Hagströmer M, Oja P, Sjöström M. The international physical activity questionnaire (IPAQ): a study of concurrent and construct validity. PHN. 2006;9:461. https://doi.org/10.1079/PHN2005898.
von Korff M, Ormel J, Keefe FJ, Dworkin SF. Grading the severity of chronic pain. Pain. 1992;50:133–49.
Klasen BW, Hallner D, Schaub C, Willburger R, Hasenbring M. Validation and reliability of the German version of the chronic pain grade questionnaire in primary care back pain patients. Psychosoc Med. 2004;1:Doc07.
Riecke J, Holzapfel S, Rief W, Lachnit H, Glombiewski JA. Cross-cultural adaption of the German Quebec Back pain disability scale: an exposure-specific measurement for back pain patients. J Pain Res. 2016;9:9–15. https://doi.org/10.2147/JPR.S92615.
Rusu AC, Kreddig N, Hallner D, Hülsebusch J, Hasenbring MI. Fear of movement/(re) injury in low back pain: confirmatory validation of a German version of the Tampa scale for Kinesiophobia. BMC Musculoskelet Disord. 2014;15:280. https://doi.org/10.1186/1471-2474-15-280.
Fleiss JL. The design and analysis of clinical experiments. Hoboken: John Wiley & Sons, Inc; 1999.
Brennan RL, Prediger DJ. Coefficient kappa: some uses, misuses, and alternatives. Educ Psychol Meas. 2016;41:687–99. https://doi.org/10.1177/001316448104100307.
Minick KI, Kiesel KB, Burton L, Taylor A, Plisky P, Butler RJ. Interrater reliability of the functional movement screen. J Strength Cond Res. 2010;24:479–86. https://doi.org/10.1519/JSC.0b013e3181c09c04.
Shultz R, Anderson SC, Matheson GO, Marcello B, Besier T. Test-retest and interrater reliability of the functional movement screen. J Athl Train. 2013;48:331–6. https://doi.org/10.4085/1062-6050-48.2.11.
Teyhen DS, Shaffer SW, Lorenson CL, Halfpap JP, Donofry DF, Walker MJ, et al. The functional movement screen: a reliability study. J Sports Sci Med. 2012;42:530–40. https://doi.org/10.2519/jospt.2012.3838.
Bonazza NA, Smuin D, Onks CA, Silvis ML, Dhawan A. Reliability, validity, and injury predictive value of the functional movement screen: a systematic review and meta-analysis. Am J Sports Med. 2017;45:725–32. https://doi.org/10.1177/0363546516641937.
Moran RW, Schneiders AG, Major KM, Sullivan SJ. How reliable are functional movement screening scores? A systematic review of rater reliability. Br J Sports Med. 2016;50:527–36. https://doi.org/10.1136/bjsports-2015-094913.
Harris-Hayes M, Steger-May K, Koh C, Royer NK, Graci V, Salsich GB. Classification of lower extremity movement patterns based on visual assessment: reliability and correlation with 2-dimensional video analysis. J Athl Train. 2014;49:304–10. https://doi.org/10.4085/1062-6050-49.2.21.
Mischiati CR, Comerford M, Gosford E, Swart J, Ewings S, Botha N, et al. Intra and inter-rater reliability of screening for movement impairments: movement control tests from the Foundation matrix. J Sports Sci Med. 2015;14:427–40.
Gordon R, Bloxham S. A systematic review of the effects of exercise and physical activity on non-specific chronic low Back pain. Healthcare (Basel). 2016. https://doi.org/10.3390/healthcare4020022.
Chorba RS, Chorba DJ, Bouillon LE, Overmyer CA, Landis JA. Use of a functional movement screening tool to determine injury risk in female collegiate athletes. N Am J Sports Phys Ther. 2010;5:47–54.
Kiesel K, Plisky PJ, Voight ML. Can serious injury in professional football be predicted by a preseason functional movement screen? N Am J Sports Phys Ther. 2007;2:147–58.
Vogt L, Pfeifer K, Banzer W. Neuromuscular control of walking with chronic low-back pain. Man Ther. 2003;8:21–8. https://doi.org/10.1054/math.2002.0476.
Ethics approval and consent to participate
The local Ethics Committee of the Department of Psychology and Sports Sciences of the Goethe University in Frankfurt approved this study. All participants gave written informed consent before data collection began.
Consent for publication
Jan Wilke and Daniel Niederer are members of the editorial board.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.