This article has Open Peer Review reports available.
Test-retest reliability of knee kinesthesia in healthy adults
© Ageberg et al; licensee BioMed Central Ltd. 2007
Received: 14 November 2006
Accepted: 03 July 2007
Published: 03 July 2007
Sensory information from mechanoreceptors in the skin, muscles, tendons, and joint structures plays an important role in joint stability. A joint injury can lead to disruption of the sensory system, which can be measured by proprioceptive acuity. When evaluating proprioception, assessment tools need to be reliable. The aim of this study was to assess the test-retest reliability of a device designed to measure knee proprioception.
Twenty-four uninjured individuals (14 women and 10 men) were examined with regard to test-retest reliability of knee kinesthesia, measured by the threshold to detection of passive motion (TDPM). Measurements were performed towards extension and flexion from the two starting positions, 20 degrees and 40 degrees knee joint flexion, giving four variables. The mean difference between test and retest together with the 95% confidence interval (test 2 minus test 1), the intraclass correlation coefficient (ICC2,1), and Bland and Altman graphs with limits of agreement, were used as statistical methods for assessing test-retest reliability.
The intraclass correlation coefficients ranged from 0.59 to 0.70 in all variables except one. No difference was found between test and retest in three of the four TDPM variables. TDPM would need to decrease between 10% and 38%, and increase between 17% and 24% in groups of uninjured subjects to be 95% confident of detecting a real change. The limits of agreement were rather wide in all variables. The variables associated with the 20-degree starting position tended to have higher intraclass correlation coefficients and narrower limits of agreement than those associated with 40 degrees.
Three TDPM variables were considered reliable for observing change in groups of subjects without pathology. However, the limits of agreement revealed that small changes in an individual's performance cannot be detected. The higher intraclass correlation coefficients and the narrower limits of agreement in the variables associated with the starting position of 20 degrees knee joint flexion, indicate that these variables are more reliable than those associated with 40 degrees. We, therefore, recommend that the TDPM be measured with a 20-degree starting position.
Sensory information from mechanoreceptors in the skin, muscles, tendons, and joint structures plays an important role in joint stability [1–4]. The sensorimotor system covers the whole process from a sensory stimulus to muscle activation, i.e., acquisition of a sensory stimulus and conversion of the stimulus into a neural signal, transmission of the neural signal via afferent pathways to the central nervous system (CNS), processing and integration of the signal by the various centers of the CNS, and motor response resulting in muscle activation for the performance of various tasks and joint stabilization . Proprioception is the process occurring along the afferent pathways of the sensorimotor system. It is defined as the acquisition of stimuli by peripheral mechanoreceptors (such as joint motion, position, velocity, length and tension of tissue) and the conversion of these mechanical stimuli into a neural signal that is transmitted along the afferent pathways to the CNS for processing .
A joint injury or joint disease, e.g., a knee injury or knee osteoarthritis (OA), can lead to a disturbance in the sensory system. This disturbance can be measured by proprioceptive acuity. Several studies have concluded that subjects with a knee injury or knee OA have impaired proprioception [6–13]. Two common measures of proprioception are kinesthesia, e.g., the threshold to detection of a passive motion (TDPM), and joint position sense (JPS), e.g., the active reproduction test. The TDPM is the most established test, is more reliable, and more sensitive in detecting differences between groups, such as between patients with anterior cruciate ligament (ACL) injury and uninjured controls, than measures of JPS [6, 7]. A relation between impaired kinesthesia, measured by TDPM, and poor functional performance (measured by the one-leg hop test for distance, or balance in single-limb stance) and poor subjective outcome (measured by disease-specific questionnaires or subjective estimation of extremity function on a visual analog scale) has been found in patients with knee injury or knee OA [9, 14–18]. Thus, kinesthesia may be an important indicator of the result of knee injury or knee disease.
When evaluating kinesthesia or the effects of intervention on kinesthesia, the assessment tools used need to be reliable. The two components of measurement error are systematic bias, e.g., learning or fatigue effects during the test, and random error due to inherent subject or instrument variation. To obtain sufficient information about the assessment tool, it has been recommended that several statistics be used; i.e., relative reliability, analysis of systematic change in the mean, and absolute reliability [19–22]. The intraclass correlation coefficient (ICC), which includes the systematic bias, can be used to assess relative reliability [20–22]. However, one disadvantage of the ICC is that it provides a value between 0 and 1, which is difficult to interpret clinically. To detect whether there is a systematic change in the mean, the paired t-test or mean difference between test and retest with a 95% confidence interval (CI) can be used [21, 22]. Methods used to describe absolute reliability include calculations expressing the actual units of measurement, such as the Bland and Altman 95% limits of agreement (LOA) [21–23]. The LOA provide a 95% range of error for individuals, i.e., a real change in an individual's performance (e.g., before and after intervention) would be outside the LOA. The smaller the range, the more sensitive the method is in detecting change [19, 23, 24].
The aim of the present study was to assess the test-retest reliability of a device designed to measure knee proprioception, specifically the TDPM, in uninjured men and women.
Twenty-four individuals (14 women and 10 men) with no history of neurological disease or major orthopedic lesions were included in the study. The sample size was based on the recommendations of Fleiss, i.e., that 15 to 20 subjects would be required for estimating the reliability of a quantitative variable . The subjects' mean age was 41 years (SD 7.9 years), mean height 174 cm (SD 8.4 cm), mean weight 74 kg (SD 12.6 kg), and median activity level 4 (quartiles 4 to 5, range 2 to 9) according to the Tegner activity level scale, equal to moderately heavy work or recreational sports such as jogging, bicycling, or cross-country skiing . The Research Ethics Committee at Lund University approved the study. All subjects gave their written informed consent to participate in the study.
The subject lies in a lateral decubitus position, with the lower leg in the plastic splint. The splint supports the posterolateral part of the leg, but also has a slight anterior curve (to avoid valgus stress at the knee). The oversized construction allows for differences in the girth of the lower leg. Two bars mounted on the platform serve as guides for placing the thigh and trunk in a standard position, with the hip joint semiflexed. The knee joint was carefully positioned at the center of rotation. Markings on the platform allow accurate positioning of the knee in the different starting positions of knee joint flexion: 20° and 40°. Zero degrees is defined as full extension. The upper thigh and hip rest on a foam pillow (which can be adjusted to different heights, due to more extreme varus/valgus angulations), and pillows were also placed under the back to help the subject relax during the test. Care was taken to reduce any external stimuli of limb movement except those from the knee joint and surrounding structures. To minimize cutaneous sensations during the tests, all subjects wore short pants and a thick woolen sock, and the knee had no contact with the underlying surface. Visual cue of the leg was reduced by the subject's position, and closed eyes during the test, and auditory impulses were reduced during the threshold trial by earmuffs and a tape recorder playing a sound imitating the motor.
Measurements of the TDPM were performed towards extension (TE) and flexion (TF) from the two starting positions, 20° and 40° knee joint flexion, giving the variables TE20, TE40, TF20, and TF40. The subjects were asked to close their eyes, concentrate on their knee and respond (by raising their hand) when they felt any sensation of movement in their knee. The tape recorder was then turned on and, after a delay of 5 to 15 seconds (this information was not given to the subjects), the motor started to move the leg at a calibrated angular velocity of 0.5°·s-1. When the subject responded, the assessor stopped the motor and the movement was registered in degrees. The median values of three consecutive measurements of TE20, TE40, TF20, and TF40 were determined [16, 17, 27–29]. Higher values indicate poorer proprioceptive acuity . The subjects were tested twice (test 1 and test 2), at about the same time of day with an interval of approximately one week, median value 7 days (quartiles 6–7, range 2–12 days).
The different starting positions were chosen so as to be within the working range of the knee during ordinary weight-bearing activities/exercise. Since the range of motion may differ between individuals (e.g., some individuals may have an extension deficiency), the most extreme joint positions were excluded. Thus, the tension in the muscles, capsule and ligaments was kept below high levels to avoid more variable tissue tensions between individuals, and to allow the subjects to relax without having their leg forced to maximum extension.
A slow speed was chosen to ensure that the subjects could not detect a sudden onset of motion and to maximally stimulate the joint receptors and minimize the contribution from muscle receptors. The tests were performed on both legs; the right leg being tested first, by shifting the apparatus arrangement from one side of the platform to the other.
Since no differences were found between the men and the women, the results were analyzed together. No statistically significant difference was found between the variables in the right and left legs. To avoid the subjectivity in choosing one of the legs, the average of the right and left leg, i.e., (right+left)/2, for each variable was used for statistical analyses . However, the results were confirmed using the results from the right and left legs separately in the analyses.
Test-retest reliability in the kinesthetic variables, in 24 healthy subjects.
Test session 1 Mean (SD)
Test session 2 Mean (SD)
Mean difference (SDdiff),
0.05 (0.35), -0.10 – 0.19
0.70 (0.42 – 0.86)
0.57 – 1.78
-0.06 (0.84), -0.41 – 0.30
0.16 (-0.27 – 0.53)
0.31 – 3.09
-0.26 (0.55), -0.49 – -0.03
0.63 (0.31 – 0.83)
0.42 – 1.57
0.02 (0.28), -0.10 – 0.14
0.59 (0.25 – 0.80)
0.51 – 2.01
The reliability of the proprioceptive device has been assessed in a previous study by Fridén et al. . However, in that study, only the systematic change in the mean was used to assess test-retest reliability . If several reliability statistics are used, this may provide us with information regarding whether some variables are more reliable than others, and if the assessment tool is reliable for groups of subjects and for individual subjects. We found that three kinesthetic variables (TE20, TF20 and TF40) were sufficiently reliable to observe change in groups of subjects (ICC values ranging from 0.59 and 0.70, and 95% CI ranging from 10% to 38%), but that relatively large differences in an individual's performance would be required to confidently state that a real change had taken place. The TDPM variables from the 20-degree starting position seemed to be more reliable than those from the 40-degree position.
No systematic change in the mean was noted in three of the four kinesthetic variables (TE20, TE40, TF40), as zero was included in the 95% CI. This is in line with the previous study by Fridén et al. . The values of TF20 from test 2 were significantly lower than the values from test 1 (zero is not included in the 95% CI), which may be interpreted as a learning process. However, since the 95% CI was quite close to zero (Table 1), the clinical relevance of this learning effect can be questioned. According to the recommendations of Fleiss , ICC values above 0.75 represent excellent reliability, values between 0.4 and 0.75 represent fair to good reliability, while values below 0.4 represent poor reliability. Three variables (TE20, TF20, and TF40) showed ICC values above 0.40 but below 0.75, indicating good reliability, while one variable (TE40) showed poor reliability (ICC 0.16). Large variations between subjects result in high ICC values and, thus, more homogeneous data would result in lower ICC values . However, the standard deviations of the mean values of TE40 were not markedly smaller than those of the other three variables (Table 1). Thus, the low ICC value for TE40 cannot be explained by more homogeneous data. The high ICC values for TE20, TF20, and TF40, indicate that these variables are likely to observe change in groups of subjects without pathology. These high ICC values for TDPM variables are supported by findings in other studies [6, 32]. To be 95% confident of detecting a real change in groups of subjects, TDPM would need to decrease (i.e., improve) between 10% (TE20) and 38% (TF20), and increase (i.e., decline) between 17% (TF40) and 24% (TE40). In previous studies, patients with ACL injury had over 30% higher TDPM values (i.e., poorer kinesthetic acuity) than uninjured subjects [27, 28].
To evaluate changes over time in an individual, the magnitude of the change must exceed the inherent variability of the measurements. The LOA can be used to assess a "real" change in an individual's performance as a result of, for example, intervention, i.e., if the difference between two measurements is outside the LOA, there is a true change in performance . Since heteroscedasticity was found in the data, a log-transformation and a back-transformation were performed, giving the limits of the ratio between the two tests (LOAratio) . In the kinesthetic variables, TE20 showed the narrowest LOAratio, ranging between 0.57 and 1.78 times, i.e., one test may differ from another by 43% below (i.e., 43% lower value) to 78% above (i.e., 78% higher value). The TE40 showed the widest LOAratio, ranging between 0.31 and 3.09 times, i.e., one test may differ from another by 69% below to 209% above. The LOAratios were all rather wide, indicating that these tests cannot detect small changes in an individual's performance, i.e., a substantial difference in an individual's measurements would be required to confidently state that a change had actually taken place. According to Rankin and Stokes , at least 50 subjects are needed in reliability studies, otherwise the 95% limits of agreement will be too wide. Thus, one reason for the wide LOAratios in our study may be a too small sample size. To evaluate TDPM over time in individuals, it may be important to calculate LOA in a larger group of subjects (n ≥ 50). We have found no studies reporting absolute reliability of TDPM variables for knee kinesthesia. However, in a study by Pincivero et al. , the intra-subject variation was assessed for knee proprioception, by measuring the ability of subjects to "catch their leg" when the knee was dropped into extension from a relaxed position. They also found relatively large intra-subject variation, using SEMs as measures of absolute reliability . Thus, proprioceptive tests may be more useful and appropriate when distinguishing between groups of subjects, such as patients and controls, or when investigating the effect of an intervention in a group of subjects.
The different starting positions when measuring the TDPM were chosen to be within the working range of the knee during ordinary weight-bearing activities/exercise. The tendency towards higher ICC values and narrower LOAratios for the TDPM variables with the starting position at 20 degrees than those from 40 degrees, suggests that the variables TE20 and TF20 may be more reliable. Several other studies have reported higher reliability and/or higher sensitivity in detecting movements, in proprioceptive variables close to the end range of motion compared with in the mid range of motion in patients with ACL injury and uninjured subjects [18, 29, 33]. These findings may be explained by an increased afferent impulse generation near the terminal joint position, which is required to protect the joint from injury . Thus, from the results of the present study and those of others [18, 29, 33], it can be argued that measurements of TDPM close to the end range of motion are probably the most reliable and sensitive.
Three kinesthetic variables (TE20, TF20 and TF40) were found to be reliable in observing change in groups of subjects. TDPM would need to decrease between 10% and 38%, and increase between 17% and 24% in groups of uninjured subjects to be 95% confident of detecting a real change. The LOAratios revealed that small changes in an individual's measurements cannot be detected, i.e., a relatively large difference in an individual's kinesthetic measurements would be required to confidently state that a real change had taken place. These tests may thus be more useful and appropriate for observing change in groups of subjects. The higher ICCs and narrower LOAratios in the TDPM variables obtained with the starting position at 20 degrees knee joint flexion (i.e., closer to terminal extension), indicate that these variables are more reliable than those obtained with the starting position at 40 degrees. We, therefore, recommend that the TDPM variables from 20 degrees be used in future studies on subjects without pathology.
We would like to thank all the subjects who volunteered for this study, Per-Erik Isberg at the Department of Statistics, Lund University for statistical advice, and Henrik Nilsson for participating in data collection. The person in Figure 1 has approved publication of the picture. This study was supported by the Swedish National Centre for Research in Sports, and the Faculty of Medicine, Lund University.
- Riemann BL, Lephart SM: The Sensorimotor System, Part II: The Role of Proprioception in Motor Control and Functional Joint Stability. J Athl Train. 2002, 37 (1): 80-84.PubMedPubMed CentralGoogle Scholar
- Riemann BL, Lephart SM: The Sensorimotor System, Part I: The Physiologic Basis of Functional Joint Stability. J Athl Train. 2002, 37 (1): 71-79.PubMedPubMed CentralGoogle Scholar
- Solomonow M, Krogsgaard M: Sensorimotor control of knee stability. A review. Scand J Med Sci Sports. 2001, 11 (2): 64-80. 10.1034/j.1600-0838.2001.011002064.x.View ArticlePubMedGoogle Scholar
- Johansson H, Sjölander P, Sojka P: Receptors in the knee joint ligaments and their role in the biomechanics of the joint. Crit Rev Biomed Eng. 1991, 18 (5): 341-368.PubMedGoogle Scholar
- Lephart SM, Riemann BL, Fu FH: Introduction to the sensorimotor system. Proprioception and neuromuscular control in joint stability. Edited by: Lephart SM, Fu FH. 2000, Champaign, IL , Human Kinetics, xvii-xxiv.Google Scholar
- Reider B, Arcand MA, Diehl LH, Mroczek K, Abulencia A, Stroud CC, Palm M, Gilbertson J, Staszak P: Proprioception of the knee before and after anterior cruciate ligament reconstruction. Arthroscopy. 2003, 19 (1): 2-12.View ArticlePubMedGoogle Scholar
- Fridén T, Roberts D, Ageberg E, Waldén M, Zätterström R: Review of knee proprioception and the relation to extremity function after an anterior cruciate ligament rupture. J Orthop Sports Phys Ther. 2001, 31 (10): 567-576.View ArticlePubMedGoogle Scholar
- Koralewicz LM, Engh GA: Comparison of proprioception in arthritic and age-matched normal knees. J Bone Joint Surg Am. 2000, 82-A (11): 1582-1588.PubMedGoogle Scholar
- Pai YC, Rymer WZ, Chang RW, Sharma L: Effect of age and osteoarthritis on knee proprioception. Arthritis Rheum. 1997, 40 (12): 2260-2265. 10.1002/art.1780401223.View ArticlePubMedGoogle Scholar
- Sharma L, Pai YC, Holtkamp K, Rymer WZ: Is knee joint proprioception worse in the arthritic knee versus the unaffected knee in unilateral knee osteoarthritis?. Arthritis Rheum. 1997, 40 (8): 1518-1525. 10.1002/art.1780400821.View ArticlePubMedGoogle Scholar
- Barrett DS: Proprioception and function after anterior cruciate reconstruction. J Bone Joint Surg [Br]. 1991, 73 (5): 833-837.Google Scholar
- Barrett DS, Cobb AG, Bentley G: Joint proprioception in normal, osteoarthritic and replaced knees. J Bone Joint Surg [Br]. 1991, 73 (1): 53-56.Google Scholar
- Barrack RL, Skinner HB, Cook SD, Haddad RJ: Effect of articular disease and total knee arthroplasty on knee joint- position sense. J Neurophysiol. 1983, 50 (3): 684-687.PubMedGoogle Scholar
- Roberts D, Ageberg E, Andersson G, Friden T: Clinical measurements of proprioception, muscle strength and laxity in relation to function in the ACL-injured knee. Knee Surg Sports Traumatol Arthrosc. 2006Google Scholar
- Ageberg E, Roberts D, Holmström E, Fridén T: Balance in single-limb stance in patients with anterior cruciate ligament injury: relation to knee laxity, proprioception, muscle strength, and subjective function. Am J Sports Med. 2005, 33 (10): 1527-1535. 10.1177/0363546505274934.View ArticlePubMedGoogle Scholar
- Fridén T, Roberts D, Zätterström R, Lindstrand A, Moritz U: Proprioceptive defects after an anterior cruciate ligament rupture -- the relation to associated anatomical lesions and subjective knee function. Knee Surg Sports Traumatol Arthrosc. 1999, 7 (4): 226-231. 10.1007/s001670050153.View ArticlePubMedGoogle Scholar
- Roberts D, Fridén T, Zätterström R, Lindstrand A, Moritz U: Proprioception in people with anterior cruciate ligament-deficient knees: comparison of symptomatic and asymptomatic patients. J Orthop Sports Phys Ther. 1999, 29 (10): 587-594.View ArticlePubMedGoogle Scholar
- Borsa PA, Lephart SM, Irrgang JJ, Safran MR, Fu FH: The effects of joint position and direction of joint motion on proprioceptive sensibility in anterior cruciate ligament-deficient athletes. Am J Sports Med. 1997, 25 (3): 336-340. 10.1177/036354659702500311.View ArticlePubMedGoogle Scholar
- Lexell JE, Downham DY: How to assess the reliability of measurements in rehabilitation. Am J Phys Med Rehabil. 2005, 84 (9): 719-723. 10.1097/01.phm.0000176452.17771.20.View ArticlePubMedGoogle Scholar
- Rousson V, Gasser T, Seifert B: Assessing intrarater, interrater and test-retest reliability of continuous measurements. Stat Med. 2002, 21 (22): 3431-3446. 10.1002/sim.1253.View ArticlePubMedGoogle Scholar
- Atkinson G, Nevill AM: Statistical methods for assessing measurement error (reliability) in variables relevant to sports medicine. Sports Med. 1998, 26 (4): 217-238. 10.2165/00007256-199826040-00002.View ArticlePubMedGoogle Scholar
- Rankin G, Stokes M: Reliability of assessment tools in rehabilitation: an illustration of appropriate statistical analyses. Clin Rehabil. 1998, 12 (3): 187-199. 10.1191/026921598672178340.View ArticlePubMedGoogle Scholar
- Bland JM, Altman DG: Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986, 1 (8476): 307-310.View ArticlePubMedGoogle Scholar
- Bland JM, Altman DG: Measuring agreement in method comparison studies. Stat Methods Med Res. 1999, 8 (2): 135-160. 10.1191/096228099673819272.View ArticlePubMedGoogle Scholar
- Fleiss JL: Reliability of measurements. The design and analysis of clinical experiments. 1986, New York , John Wiley & Sons, 2-31.Google Scholar
- Tegner Y, Lysholm J: Rating systems in the evaluation of knee ligament injuries. Clin Orthop. 1985, 198: 43-49.PubMedGoogle Scholar
- Roberts D, Friden T, Stomberg A, Lindstrand A, Moritz U: Bilateral proprioceptive defects in patients with a unilateral anterior cruciate ligament reconstruction: a comparison between patients and healthy individuals. J Orthop Res. 2000, 18 (4): 565-571. 10.1002/jor.1100180408.View ArticlePubMedGoogle Scholar
- Fridén T, Roberts D, Zätterström R, Lindstrand A, Moritz U: Proprioception after an acute knee ligament injury: a longitudinal study on 16 consecutive patients. J Orthop Res. 1997, 15 (5): 637-644. 10.1002/jor.1100150502.View ArticlePubMedGoogle Scholar
- Fridén T, Roberts D, Zätterström R, Lindstrand A, Moritz U: Proprioception in the nearly extended knee. Measurements of position and movement in healthy individuals and in symptomatic anterior cruciate ligament injured patients. Knee Surg Sports Traumatol Arthrosc. 1996, 4 (4): 217-224. 10.1007/BF01567966.View ArticlePubMedGoogle Scholar
- Ranstam J: Repeated measurement and analysis units. Review of basic principles. Acta Orthop Scand. 1998, Norway , 69 (4): 345-346.Google Scholar
- Shrout PE, Fleiss JL: Intraclass correlations: Uses in assessing rater reliability. Psychol Bull. 1979, 86 (2): 420-428. 10.1037/0033-2909.86.2.420.View ArticlePubMedGoogle Scholar
- Safran MR, Allen AA, Lephart SM, Borsa PA, Fu FH, Harner CD: Proprioception in the posterior cruciate ligament deficient knee. Knee Surg Sports Traumatol Arthrosc. 1999, 7 (5): 310-317. 10.1007/s001670050169.View ArticlePubMedGoogle Scholar
- Pincivero DM, Bachmeier B, Coelho AJ: The effects of joint angle and reliability on knee proprioception. Med Sci Sports Exerc. 2001, 33 (10): 1708-1712. 10.1097/00005768-200110000-00015.View ArticlePubMedGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2474/8/57/prepub
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.