- Research article
- Open Access
Reference chart for knee flexion following total knee arthroplasty: a novel tool for monitoring postoperative recovery
BMC Musculoskeletal Disorders volume 21, Article number: 482 (2020)
Clinicians and patients lack an evidence-based framework by which to judge individual-level recovery following total knee arthroplasty (TKA) surgery, thus impeding personalized treatment approaches for this elective surgery. Our study aimed to develop and validate a reference chart for monitoring recovery of knee flexion following TKA surgery.
Retrospective analysis of data collected in routine rehabilitation practice for patients following TKA surgery. Reference charts were constructed using Generalized Additive Models for Location Scale and Shape. Various models were compared using the Schwarz Bayesian Criterion, Mean Squared Error in 5-fold cross validation, and centile coverage (i.e. the percent of observed data represented below specified centiles). The performance of the reference chart was then validated against a test set of patients with later surgical dates, by examining the centile coverage and average bias (i.e. difference between observed and predicted values) in the test dataset.
A total of 1173 observations from 327 patients were used to develop a reference chart for knee flexion over the first 120 days following TKA. The best fitting model utilized a non-linear time trend, with smoothing splines for median and variance parameters. Additionally, optimization of the number of knots in smoothing splines and power transformation of time improved model fit. The reference chart performed adequately in a test set of 171 patients (377 observations), with accurate centile coverage and minimal average bias (< 3 degrees).
A reference chart developed with clinically collected data offers a new approach to monitoring knee flexion following TKA.
Total knee arthroplasty (TKA) is one of the most common inpatient elective surgeries worldwide. There are approximately 700,000 procedures performed per year in the United States, and surgical rates are comparable in many European countries (120–200 per 100,000 people) [1, 2]. Despite the prevalence of TKA, there is little agreement in the clinical community regarding postoperative care and rehabilitation [3,4,5]. Rehabilitation practices vary widely by clinical site, and the content and goals of therapy are based largely on clinicians’ experience and intuition . Postoperative protocols typically indicate recovery milestones based on the expected clinical course for the average person, but surgical populations are heterogeneous [6,7,8]. Fundamentally, clinicians and patients lack an evidence-based framework by which to judge individual patients’ recovery following TKA surgery, thus impeding efforts to advance personalized or patient-centered treatment approaches for this elective surgery.
Reference charts are commonly used in healthcare settings to assist in monitoring patients’ progress and to inform clinical decisions at the level of the individual patient. The concept of monitoring the clinical course and adapting treatment decisions according to observations at the individual level has been promoted in psychotherapy [9, 10], in patients with chronic disease , and more recently in research designs (one-person trials) . In rehabilitation, reference charts have been proposed as a means of assessing patients’ response to preoperative inspiratory muscle training . The key concept in all cases is to base clinical decisions on the comparison of individual-level observations to an evidence-based reference .
The goal of this study was to develop and validate a reference chart with clinically collected data, to inform monitoring of knee flexion active range of motion (AROM) following TKA surgery. Knee flexion is frequently assessed following TKA and is widely cited in postoperative protocols as a marker of progress throughout recovery [3, 15]. Additionally, we sought to describe a systematic approach to generating reference charts, including model fitting and selection procedures, so that future investigations could extend this methodology to other outcomes following TKA or to clinical trajectory data for other patient populations. Ultimately, this work could serve as a template for developing evidence-based references to aid in monitoring and decision-making for a variety of health conditions.
Data were collected in the context of routine clinical practice at three sites (ATI Physical Therapy, in partnership with Greenville Health Systems) in South Carolina. As part of ongoing quality improvement efforts for patients undergoing knee arthroplasty, outcomes data were recorded on a semi-weekly basis throughout postoperative rehabilitation. Therapists were trained in the standardized application of outcomes assessments. The postoperative rehabilitation regime was also standardized across clinic locations and therapists. Outcomes data were compiled in a quality improvement (QI) database, housed at the University of Colorado Denver, using Research Electronic Data Capture (REDCap), a secure web-based software for database development. All analyses complied with a non-human subject research designation and were approved by the Colorado Multiple Institutional Review Board (COMIRB #: 15–1797).
For the purposes of this analysis, the QI database was queried for all available (de-identified) patient records. At the time of data extraction, a total of 897 patient records were available, with surgical dates between January 2013 and May 2016. However, 321 records could not be used because they lacked postoperative flexion AROM data. This was not unexpected, as patients commonly travel to the surgical center for preoperative consultation and surgery but subsequently undergo postoperative rehabilitation at a clinic closer to home. An additional 59 records were excluded because patients underwent a procedure other than primary unilateral TKA (17 patients underwent revision arthroplasty and 42 patients underwent unicompartmental arthroplasty). For the remaining records, postoperative flexion AROM data were available between 2 and 857 days (median 36 days) following surgery. We utilized the first 120 postoperative days for this project. We reasoned that rehabilitation typically occurs during this time window, and recovery plateaus between months 1 and 3 following surgery . Thus, we felt a reference chart describing recovery over the first 120 days would adequately capture the relevant time frame. A total of 498 patient records (1550 observations of flexion AROM) were used for this analysis.
Knee flexion active range of motion (AROM)
Knee flexion AROM (in degrees) was measured in a supine position via long-arm goniometry (see supplementary material). Briefly, patients were allowed to practice bending their knee 5 times, with therapist-assist as needed, prior to the therapist making the final assessment. For the final assessment the knee was placed in extension, and the patient was instructed to flex the knee as far as possible using only muscle power, leaving the heel on the surface. The fulcrum of the goniometer was placed at the medial joint line, with the lateral malleolus of the fibula and greater trochanter of the femur as distal landmarks . Physical Therapists were trained on a quarterly basis in this protocol, to standardize the collection of outcomes measures. Flexion AROM was measured on a semi-weekly basis throughout postoperative rehabilitation.
We divided the sample of patient records temporally (by surgical date) into a development set and test set. By this approach, the development set contained the earlier 75% of the knee flexion measurements and the test set contained the later 25% of the available knee flexion measurements. Model fit was adjudicated and a reference chart was constructed using the development set, and the performance of this chart was subsequently examined using the test set. The steps for generating and assessing reference curves were analogous to those used to develop reference charts for childhood growth, emphasizing procedures that would also yield a simple solution that limits the risk of overfitting the data (Table 1) [18,19,20].
Reference chart development
Using data from the development set, a series of statistical models were examined describing the variation of flexion AROM over the first 120 days following surgery. Generalized Additive Models for Location Scale and Shape (GAMLSS, version 4.4.0)  were used to obtain estimates of the median and other fitted centiles as smooth functions of the measurements in days. In GAMLSS, a variety of distributions can be used to fit the mean/median, variance, skewness, and kurtosis of the outcome. We selected 6 candidate distributions, of increasing complexity, for which to model knee flexion AROM. The Normal (NO) and Gamma (GA) distributions modeled 2 parameters (the median and variance) of the outcome. The t-family (TF) and Box-Cox Cole and Green (BCCG) distributions modeled the median, variance, and skewness of the outcome. The Box-Cox t distribution (BCT) and Box-Cox Power Exponential (BCPE) distributions modeled the median, variance, skewness and kurtosis of the outcome [21, 22]. Model fit was adjudicated numerically by the Schwarz Bayesian Criterion (SBC) . To protect against over-fitting, we also calculated the Mean Square Error (MSE) via 5-fold cross validation of each model (i.e. by developing the model in 80% of the development set and testing the model in the left-out 20%). Based on these metrics, we pursued model selection by the following approach:
First, we examined whether fitting cubic splines for each of the different parameters (i.e. median, variance, skewness, kurtosis) improved model fit. Next, we optimized: 1) the number of knots specified in splines for each parameter, and 2) the power-transformation of time, using the “find.hyper” function in GAMLSS. We then constructed reference charts and calculated the percentage of observed values captured below each of the specified centiles (5th, 10th, 25th, 50th, 75th, 90th, and 95th centile). Of the candidate models, the best solution would minimize the SBC, demonstrate low MSE by within-sample cross validation, and accurately describe percentiles in the dataset, both within the development set as well as when applied to the test dataset (e.g. 5% of the observed data would be captured below the 5th percentile, 10% below the 10th percentile, etc.).
Reference chart performance was examined by applying the reference curves to a test set of patients with later surgical dates. This approach was designed to mimic the process for development and subsequent use of the reference chart in practice. The accuracy with which the reference curves fit the new data was examined by z-test for proportions, and the average bias (difference between predicted and observed values) was calculated. Ideal performance would be reflected by accurate representation of the test set data (e.g., 5% of the observed data captured below the 5th percentile, 10% below the 10th percentile, etc.), and zero bias.
Demographic and anthropometric characteristics did not differ significanty between the development and test datasets (Table 2). The development set included 327 patients, with surgical dates between January 2013 and August 2015, while the test set included 177 patients with surgical dates between August 2015 and May 2016.
For each of the 6 selected GAMLSS distributions (NO, GA, TF, BCCG, BCT, BCPE) we examined model fit under a variety of conditions. According to SBC, it was beneficial to model the location (mean/median) and variance parameters with cubic splines (Table 3). However, fitting splines to skewness and kurtosis parameters did not yield additional improvements in model fit (Table 3). Further improvements in SBC were achieved by optimizing the power transformation of time and the number of knots in smoothing splines (Table 4). Additionally, the process of optimization yielded simpler models with fewer overall degrees of freedom. The optimized models specified 2.2 knots for the median and 1.2 knots for variance, with a power transformation approximating the square root of time (0.56).
Overall, reference charts for the optimized models demonstrated similar fit statistics and within-sample performance (i.e. within the development set). The BCCG, BCT, and BCPE distributions performed marginally better than the NO, GA, and TF distributions in representing percentiles of the development set data. For example, 45.7% of the observed data fell below the median for the reference chart built with a GA distribution, whereas 49.9% of the observed data fell below the median for the reference chart built with the BCCG distribution (Fig. 1a). The BCCG distribution performed incrementally better than BCPE in centile fit, and incrementally better than BCT according to SBC and MSE. Additionally, the BCCG model was the simpler model, with fewer overall degrees of freedom. Based on these metrics we chose to build the final reference chart for flexion AROM using the BCCG distribution.
The performance of the curves in the test set was slightly diminished relative to the performance in the development set. For example, 80% of the observations fell below the 75th percentile (p = 0.03 for the difference between the observed and expected proportions). However, other observed proportions were similar to expected proportions, and the average bias remained minimal at − 2.5 degrees of flexion AROM (Fig. 1b).
Reference values and chart
Figure 2 shows the reference chart we developed for monitoring flexion AROM after TKA. It gives a sense of the general trajectory and variability in postoperative recovery of flexion AROM over the first 120 days following surgery. The typical recovery trajectory demonstrates flexion AROM between 70 and 90 degrees (interquartile range) immediately following surgery, 95–115 degrees 1 month following surgery, and 109–122 degrees 3 months following surgery.
This reference chart illustrates how flexion AROM changes following surgery. In general, there is a period of rapid increase within the first 40 days, followed by a gradual plateau. The shape of the recovery trajectory differs depending on the centile of the reference chart. Higher centiles plateau more rapidly than lower centiles, with the lowest centiles continuing to demonstrate improvements up until 60–80 days following surgery. Moreover, there is substantial variability between individual patients in recovery of knee flexion. The interquartile range is between 13 and 22 degrees throughout the first 4 months of recovery. This variability illustrates the limitations of a one-size-fits-all approach to postoperative rehabilitation, as both the content of therapy and resource requirements are likely to differ between individuals with fast versus slow recovery of flexion. Tools such as this reference chart extend the work of previous studies (which have modeled the trajectory of recovery of the sample mean) [16, 24] to include an intuitive display of the variability in recovery, which may facilitate decisions at the level of the individual patient .
This flexion AROM reference chart represents a departure from one-size-fits-all protocols, which have traditionally been used to guide decisions following TKA surgery . A typical TKA rehabilitation protocol might stipulate a postoperative flexion AROM goal of 110 degrees, as this is the amount of knee flexion required for many activities such as symmetrical stair gait or cycling . However, at an individual level, patients may have more or less ambitious functional goals, and this reference chart provides an indication of the likelihood (as well as the timing) of whatever is best for the individual. Moreover, the variability in knee flexion AROM outcomes, readily apparent on the reference chart, further illustrates the problems in stipulating a single goal for all patients. According to the reference chart, a patient who demonstrates > 90 degrees of flexion AROM early after surgery should ultimately achieve an outcome much greater than 110 degrees, whereas a patient who demonstrates < 60 degrees of flexion AROM early after surgery has a high likelihood of not achieving 110 degrees. Thus, the reference chart accommodates a more nuanced picture of the clinical course, which could augment or replace protocol-based approaches to rehabilitation .
A strength of our study is the use of data collected in routine clinical practice, as the results are less likely to be influenced by volunteer bias or research eligibility criteria. This potentially enhances the future translation to rehabilitation clinical settings. The temporal validation used in this study is also a strength; it suggests the reference chart remains accurate over time. However, as our data were collected in a single clinic system (3 sites), the geographic generalizability is unknown. The references reported here could be compared to data collected at other sites in future work. Another potential limitation is the time frame selected for chart development. We limited our dataset to the first 4 postoperative months to cover the typical time frame of rehabilitation, but recovery may persist at a slower rate for many additional months . Thus, there may be value in expanding this reference chart to cover a longer post-operative time frame. Finally, the reference chart presented here describes knee flexion, but multiple outcome measures are likely to be important to individuals and clinicians following TKA surgery [28, 29]. Future work could focus on developing a menu of reference charts describing a range of clinical outcome measures (e.g. physical functioning, pain, quality of life) to support recovery monitoring .
We have developed and tested a reference chart for knee flexion AROM following TKA surgery. The final reference chart was accurate via both within-sample and out-of sample testing. It is designed to be easy to use in practice to track postoperative recovery of knee flexion AROM for individual patients, relative to others who have previously undergone TKA surgery.
Availability of data and materials
The datasets used for this analysis may be available upon reasonable request. Please contact the senior author, Dr. Jennifer Stevens-Lapsley, for details.
Total knee arthroplasty
Centers for Disease Control
World Health Organization
Active range of motion
Research Electronic Data Capture
Generalized Additive Models for Location Scale and Shape
Box-Cox Cole and Green
Box-Cox t distribution
Box-Cox Power Exponential
Schwarz Bayesian Criterion
Mean Square Error
Facts and Figures 2009. Healthcare Cost and Utilization Project (HCUP). 2017. 2010 May 22; Available from: http://www.hcup-us.ahrq.gov/reports/factsandfigures/2009/exhibit3_1.jsp.
Kurtz SM, Ong KL, Lau E, Widmer M, Maravic M, Gomez-Barrena E, et al. International survey of primary and revision total knee replacement. Int Orthop. 2011;35(12):1783–9.
Peter WF, Nelissen RG, Vlieland TP. Guideline recommendations for post-acute postoperative physiotherapy in total hip and knee arthroplasty: are they used in daily clinical practice? Musculoskeletal Care. 2014;12(3):125–31.
NIH Consensus Statement on total knee replacement. NIH consensus and state-of-the-science statements 2003;20(1):1–34.
Naylor JM, Hart A, Harris IA, Lewin AM. Variation in rehabilitation setting after uncomplicated total knee or hip arthroplasty: a call for evidence-based guidelines. BMC Musculoskelet Disord. 2019;20(1):214.
Weiss JM, Noble PC, Conditt MA, Kohl HW, Roberts S, Cook KF, et al. What functional activities are important to patients with knee replacements? Clin Orthop Relat Res. 2002;404:172–88.
Naylor JM, Harmer AR, Heard RC, Harris IA. Patterns of recovery following knee and hip replacement in an Australian cohort. Aust Health Rev. 2009;33(1):124–35.
Lugano G, Gianola S, Castellini G, Banfi G, Seil R, Denti M, et al. Evidence-based medicine (EBM) is properly perceived but its application is still limited in the orthopedic clinical practice: an online survey among the European Society of Sports Traumatology, knee surgery and arthroscopy (ESSKA) members. Knee Surg Sports Traumatol Arthrosc. 2020;28(5):1665–72.
Lutz W, Rafaeli E, Howard KI, Martinovich Z. Adaptive modeling of progress in outpatient psychotherapy. Psychother Res. 2002;12(4):427–43.
Lueger RJ, Howard KI, Martinovich Z, Lutz W, Anderson EE, Grissom G. Assessing treatment progress of individual patients using expected treatment response models. J Consult Clin Psychol. 2001;69(2):150–8.
Glasziou P, Irwig L, Mant D. Monitoring in chronic disease: a rational approach. Brit Med J. 2005;330(7492):644–8.
Schork NJ. Personalized medicine: time for one-person trials. Nature. 2015;520(7549):609–11.
van Buuren S, Hulzebos EH, Valkenet K, Lindeman E, van Meeteren NL. Reference chart of inspiratory muscle strength: a new tool to monitor the effect of pre-operative training. Physiotherapy. 2014;100(2):128–33.
Kittelson AJ, Hoogeboom TJ, Schenkman M, Stevens-Lapsley JE, van Meeteren NLU. Person-centered care and physical therapy: a “people-like-me” approach. Phys Ther. 2020;100(1):99–106.
Kisner C, Colby LA. Therapeutic exercise: foundations and techniques. 6th ed. Philadelphia: F.A. Davis; 2012. p. 1023.
Kennedy DM, Stratford PW, Riddle DL, Hanna SE, Gollish JD. Assessing recovery and establishing prognosis following total knee arthroplasty. Phys Ther. 2008;88(1):22–32.
Brosseau L, Balmer S, Tousignant M, O'Sullivan JP, Goudreault C, Goudreault M, et al. Intra- and intertester reliability and criterion validity of the parallelogram and universal goniometers for measuring maximum active knee flexion and extension of patients with knee restrictions. Arch Phys Med Rehabil. 2001;82(3):396–402.
Hayes DJ, van Buuren S, ter Kuile FO, Stasinopoulos DM, Rigby RA, Terlouw DJ. Developing regional weight-for-age growth references for malaria-endemic countries to optimize age-based dosing of antimalarials. Bull World Health Organ. 2015;93(2):74–83.
Flegal KM, Cole TJ. Construction of LMS parameters for the Centers for Disease Control and Prevention 2000 Growth charts. Nat Health Stat Rep. 2013;63:1–3.
Stasinopoulos DM, Rigby RA, Akantziliotou C. Instructions on how to use the gamlss package in R. 2nd ed; 2008.
Rigby RA, Stasinopoulos DM. Smooth centile curves for skew and kurtotic data modelled using the box-cox power exponential distribution. Stat Med. 2004;23(19):3053–76.
Rigby RA, Stasinopoulos DM. Using the box-cox t distribution in GAMLSS to model skewness and kurtosis. Stat Model. 2006;6(3):209–29.
Cavanaugh JE, Neath AA. Generalizing the derivation of the schwarz information criterion. Commun Stat Theory Methods. 1999;28(1):49–66.
Kennedy DM, Stratford PW, Hanna SE, Wessel J, Gollish JD. Modeling early recovery of physical function following hip and knee arthroplasty. BMC Musculoskelet Disord. 2006;7:100.
Kraemer HC, Frank E, Kupfer DJ. Moderators of treatment outcomes: clinical, research, and policy importance. JAMA. 2006;296(10):1286–9.
Rowe PJ, Myles CM, Walker C, Nutton R. Knee joint kinematics in gait and other functional activities measured using flexible electrogoniometry: how much knee motion is sufficient for normal daily life? Gait Posture. 2000;12(2):143–55.
Bade MJ, Kittelson JM, Kohrt WM, Stevens-Lapsley JE. Predicting functional performance and range of motion outcomes after total knee arthroplasty. Am J Phys Med Rehabil. 2014;93(7):579–85.
Yoshida Y, Mizner RL, Ramsey DK, Snyder-Mackler L. Examining outcomes from total knee arthroplasty and the relationship between quadriceps strength and knee function over time. Clin Biomech. 2008;23(3):320–8.
Mizner RL, Petterson SC, Clements KE, Zeni JA Jr, Irrgang JJ, Snyder-Mackler L. Measuring functional improvement after total knee arthroplasty requires both performance-based and patient-report assessments: a longitudinal analysis of outcomes. J Arthroplast. 2011;26(5):728–37.
This study is funded by the Agency for Healthcare Research and Quality (AHRQ R03 HS024316, AHRQ R01 HS025692) and the National Institutes of Health (NIH K12 HD055931). The funding body had no role in the study design, data collection, data analysis, and interpretation of data, nor any role in writing the manuscript.
Ethics approval and consent to participate
Ethics approval to conduct this study was obtained from the Colorado Multiple Institutional Review Board (COMIRB #: 15–1797).
Consent for publication
The authors have no conflicts of interest to report.
About this article
Cite this article
Kittelson, A.J., Elings, J., Colborn, K. et al. Reference chart for knee flexion following total knee arthroplasty: a novel tool for monitoring postoperative recovery. BMC Musculoskelet Disord 21, 482 (2020). https://doi.org/10.1186/s12891-020-03493-x
- Generalized additive models for location scale and shape
- Clinical decision making