Hospital volume and the risk of revision in Oxford unicompartmental knee arthroplasty in the Nordic countries -an observational study of 14,496 cases

Background High procedure volume and dedication to unicompartmental knee arthroplasty (UKA) has been suggested to improve revision rates. This study aimed to quantify the annual hospital volume effect on revision risk in Oxfordu﻿ ﻿nicompartmental knee arthroplasty in the Nordic countries. Methods 14,496 cases of cemented medial Oxford III UKA were identified in 126 hospitals in the four countries included in the Nordic Arthroplasty Register Association (NARA) database from 2000 to 2012. Hospitals were divided by quartiles into 4 annual procedure volume groups (≤11, 12-23, 24-43 and ≥44). The outcome was revision risk after 2 and 10 years calculated using Kaplan Meier method. Multivariate Cox regression analysis was used to assess the Hazard Ratio (HR) of any revision due to specific reasons with 95% confidence intervals (CI). Results The implant survival was 80% at 10 years in the volume group ≤11 procedures per year compared to 83% in other volume groups. The HR adjusted for age category, sex, year of surgery and nation was 0.87 (95% CI: 0.76-0.99, p = 0.036) for the group 12-23 procedures per year, 0.78 (95% CI: 0.68-0.91, p = 0.002) for the group 24-43 procedures per year and 0.82 (95% CI: 0.70-0.94, p = 0.006) for the group ≥44 procedures per year compared to the low volume group. Log-rank test was p = 0.003. The risk of revision for unexplained pain was 40-50% higher in the low compared with other volume groups. Conclusion Low volume hospitals performing ≤11 Oxford III UKAs per year were associated with an increased risk of revision compared to higher volume hospitals, and unexplained pain as revision cause was more common in low volume hospitals.


Background
The Oxford unicompartmental knee arthroplasty (UKA) has been investigated in numerous studies due to the deviant results comparing registry results to studies from high volume centers and surgeons. Data from national registries show a significantly higher revision rate for both short and long term results for UKA than for total knee arthroplasty (TKA) [1][2][3][4][5]. Other studies from highvolume Oxford developing centers, however, show excellent long-term results [6,7]. The existing variability in practice regarding indication and usage of UKA results in low volumes in hospitals using strict criteria [8], and higher volumes in hospitals offering UKA to patients using less strict criteria [9]. The Nordic Arthroplasty Register Association is a collaboration of arthroplasty registers in Sweden, Denmark, Norway and Finland established in 2007. The cooperation has produced a common defined set of variables agreed upon, enabling analyses of larger statistical material [10]. This is an advantage especially for uncommon methods and procedures, such as the UKA constituting only 11% of the knee arthroplasties in the Nordic countries [11]. The advantage of a registry study for our purpose was the representation of all surgeons in all hospitals in Sweden, Denmark, Finland and Norway resulting in more generalizable findings. The UKA is utilized at similar lower percentage than TKA in the majority of countries with registries worldwide for the treatment of osteoarthritis [2,12]. The aim of this study was to investigate how the patient risk for revision surgery after Oxford III UKA varied as a function of hospital procedure volume. Adding to the analyses for all causes of revision, the second objective was to assess any differences in the proportion of the specific causes of revision according to volume groups.

Data sources
We used the NARA database, containing a common defined code set to identify patients undergoing primary cemented medial Oxford III UKA between January 1, 2000 and December 31, 2012 in this population-based register study [11,13]. Every year all uniform variables from each national register are re-coded according to common definitions and anonymized and then merged into the NARA database. The linkage between primary procedure and subsequent revision or death on individual data is performed in each national register before merged into the NARA database. The first studies focused on differences in patient demographics, surgical methods and implant brands [10,11,14]. The main purpose of NARA was the ability to analyze a larger statistical material, which is an advantage especially for uncommon methods and implants. It reflects the current practice in 4 different countries. The knee dataset currently includes 390,525 primary knee arthroplasty operations performed during 1995-2012 [13]. The Oxford UKA was the most commonly registered UKA implant in the NARA.

Study population
Implant brand and type could be a source of confounding in comparison to revision rate according to hospital, and therefore all other brands and types than Oxford III UKA were excluded. Diagnoses other than osteoarthritis (OA) were excluded as inflammatory disease is a contraindication in UKA. The inclusion criteria for this study, to obtain comparable groups for analysis, are shown in the flowchart (Fig. 1). In NARA revision is defined as removal/exchange/addition of one or more implant component(s) and is linked to the primary procedure by the unique national identification number of the patient.
We  (Table 1). The inclusion of bilateral knee arthroplasty can be a violation of the assumption of independent observations in survival analyses, but studies have shown that the effect is minor regarding statistical precision for survival analysis of knee replacements [15]. In this study, 14% of the patients had bilateral knee arthroplasty.

Exposure
All Oxford III UKA procedures were entered into one of four different annual hospital volume groups. We used quartiles to divide into equal numbered volume groups; ≤11, 12-23, 24-43 and ≥44 procedures per year. Hospitals with inconsistent procedure volume over time may have contributed to different volume groups according to the number of procedures at their hospital in the year of surgery. Thus, for each hospital each year was examined individually. This categorization of the exposure assumes that unspecified hospital-level effects are trumped by a potential volume effect on revision rates. Revision due to any reason as well as specific causes for revision was analysed.

Statistics
Survival analyses were performed with any revision of the implant as endpoint. Kaplan Meier cumulative survival at 2 and 10 years was reported. A 2 year follow-up was chosen to assess early revisions. The follow-up started at the day of primary UKA procedure and ended at the day of first revision, death, emigration or the end of follow-up time (December 31st 2012). The two highest volume groups had shorter follow-up compared to the lower (chi-square test p-value <0.001). Log-rank test was performed, p = 0.003. Differences for categorical variables such as sex, age categories, year of surgery and nations were assessed by Pearson's chi-squared test. Any p-values less than 0.05 were considered significant. To estimate differences in continuous variables the student t-test was used. The Cox regression model was used to calculate Hazard Ratios (HR) with 95% confidence interval (CI) for the 10 year follow-up period to investigate the association between four hospital procedure volume groups and implant survival time. P-values were presented relative to the lowest volume group (≤ 11). All p-values less than 0.05 were considered to be statistically significant. The Cox model included sex, age category, year of surgery, nation and hospital volume. Death is to be considered a possible competing risk to revision. We studied the influence of death by performing a competing risk analysis using the statistical software R [16,17]. The results for the volume groups did not change significantly when accounting for death as a competing risk for revision (Table 2). Cox regression analyses were made for the different confounding variables and are presented in Table 3.
The various reasons for revision were organized hierarchically with infection first and unexplained pain last, as shown in Table 4. Loosening and wear were second in the list and instability and dislocation third. The group 'other reasons' contained new diseases occurring in the joint such as osteoarthritis or osteonecrosis laterally or joint fibrosis with stiffness. Surgical errors such as incorrect sizing of components were also included in this group. When more than one reason was reported, the top reason in the hierarchy was used as endpoint in the analyses. Pain as a cause of revision was used as endpoint only when pain was the only reason reported. HR with 95% CI was reported for different revision causes with 10 years follow-up. The proportional hazards assumption of the Cox model was tested based on log-minus-log plot and found to be valid. SPSS version 23 and R statistical software package version 3.2.1 were used for the statistical analyses.  The Kaplan Meier 2 year survival was 95% for the three hospitals groups with annual procedure volume > 11 and 93% for the hospitals performing ≤11 Oxford III UKA per year. The Kaplan Meier estimated survival had dropped to 80% at 10 years follow up with poorest result for the ≤11 per year group ( Table 2). The three hospital volume groups of >11 had an estimated survival of 83% at 10 years. The Log-rank test was statistically significant with p = 0.003.

Revision causes
The distribution of revision causes among the 1519 revised cemented medial Oxford III implants from 2000 to 2012-according to hospital volume-is shown in Table  4. We found a difference in the risk of revision for unexplained pain among the volume groups. The volume groups performing >11 Oxford III UKA per year revised 40-50% fewer patients for unexplained pain than the lowest volume hospitals (≤11 per year). The other revision causes did not show any statistically significant differences between the groups (Table 4).

Discussion
In this large population based study based on 14,496 cemented medial Oxford III unicompartmental knee arthroplasty performed in four Scandinavian countries; we showed that high procedure volumes (>11 procedures per year) were associated with a decreased risk for revision.
This study contributes to the knowledge of other previously published results. There are available studies on the impact of procedure volume in UKA, and the common denominator is the Oxford implant since its usage is widespread. The Swedish study from 2001 found that performing less than 23 UKA per year was associated with a higher risk of revision [18], whereas Baker et al. [19] suggested a minimum annual volume of 13. Our previous study from Norway indicated fewer revisions  with an annual caseload of more than 40 [20]. A study from the National Joint Registry of England and Wales (NJR) regarding determinants of revision following UKA supported the importance of experience measured at the unit level as well, and also favoring consultants rather than trainees [21]. A recent study from the NJR recommended surgeons to perform at least 20% of their knee arthroplasties as UKAs to achieve lower rates of revisions [22]. They also found that 81.4% of the surgeons performed less than 10 UKA per year. This corresponds to our findings of extreme skewness with dominance of low-volume performance. Some registers on the other hand recommend the use of fewer UKA due to higher failure rates [23]. Our study from 4 countries suggests a minimum hospital volume per hospital of 11. However, considering the variety of the previously mentioned studies and results, a threshold value of 11 per year could be considered a conservative value. Our study included data from 4 different national registers with multiple surgeons and hospitals with varying experience and volume, suggesting high external validity. It reflects the practice in 4 different countries. Due to complete follow up of all patients in the study population with censoring at the time of death, emigration, or at the end of follow up, selection bias is unlikely. Additionally, only patients who received an Oxford III UKA with the diagnosis OA were selected (Fig. 1). We limited the analyses to the latest time period from 2000, excluding older implants and techniques. Using previously described methods of analysing the impact of procedure volume also strengthen the study [20,22,24,25]. The advantage of analyzing each year separately is the reflection of the procedure volume that particular year.
Revision was less likely in older patients compared to the younger in our study. Other studies have shown that young patients experience an increased risk of revision after UKA compared to older patients [21,[26][27][28]. W-Dahl et al. [29] and Liddle et al. [21] also found that older patients had the greatest benefits and the lowest revision rates. In addition, UKA has been associated with lower rates of morbidity and mortality compared to TKA [30]. Sweden had the best implant survival of all the 4 countries. This could be a result of longer training of Swedish surgeons, starting unicompartmental knee arthroplasty surgery and a knee arthroplasty register before the other Nordic countries, and thereby gaining more experience. Sweden differs from the other nations with less than 50% of the implanted UKAs being Oxford and thus their learning curve could be improved by surgical experience performing other types of UKA. Denmark had inferior results compared to the other countries and contributed to the majority of patients in high volume hospitals (52% in the ≥44 group). We performed sensitivity analysis with and without data from Denmark. The tendency in the results for the volume groups did not change excluding Denmark. Denmark also has poorer results in the low volume groups. The cause of poorer results in Denmark is not possible to verify, but learning curve, threshold for  revision and patient selection could be explanation factors.
Theoretically, an increase in inexperienced surgeons implementing a new technique could initially lead to many revisions, but if continued, an expected improvement should occur. This could also explain the deteriorating results in the last time period. Analyses of specific revision causes revealed a higher risk of revision for unexplained pain in low volume hospitals as compared to higher volume hospitals. We found minor differences for the other revision causes ( Table 4). Baker et al. found that while more unicompartmental knee implants than total knee implants were revised for unexplained pain, when these revisions for unexplained pain were discounted, unicompartmental knee arthroplasty still had a significantly greater risk of revision from other reasons than did total knee arthroplasty [31]. However the numbers of revisions in each group were too small to allow making any conclusions regarding the differences between the volume groups.
There has been an on-going discussion regarding the threshold for revision due to unexplained pain [32]. Similarly, the incidence of radiolucent lines at the bone-implant interface [33] could be misinterpreted as loosening by unexperienced Oxford-users, and thereby leading to unnecessary revisions. Nevertheless, in cases with concurrent pain or symptomatology, it could be argued that revision is motivated. These could be explanations to the differences in revision rates, suggesting a lower revision-threshold in lowvolume users. However, even the highest volume hospitals could not match the outcomes reported by developers [6,7,34] or the results after TKA regarding revision rates [24,35]. A retrospective independent sample of failures reported to the registers could be one approach to evaluate the indication for revision surgery and identifying critical errors in the primary surgical technique and patient selection. Precise surgical indications for both primary and revision surgery are still debated [8,22]. Furthermore, whether emphasis should be put on the higher revision rates of UKA compared to TKA or the lower risk of postoperative death and complications comparing UKA to TKA is also important to take into consideration [35].
Limitations to the study may be unmeasured factors such as decision-making regarding pre-operative radiographic changes leading to primary indication for surgery [36]. In addition, information on life style factors and physical activity was not available. The selection of patients considered suitable for UKA surgery is debatable regarding radiographic findings, age and BMI [8,22]. Only hospital procedure volume was available for analysis in the NARA database, surgeon caseload and experience were not available. Theoretically, a high volume surgeon in a high volume center would gain the best results according to a systematic review regarding surgery volume [37]. However, the volume of a center had an equal if not greater effect on patient outcome than surgeon volume. Categorization of the volume exposure assumes that any (unspecified) hospital-level effects (e.g. the care that patients within a specific hospital receive, independent of volume) are trumped by a potential volume effect on revision rates. The analyses in this study are limited to the cemented medial Oxford III UKA and may limit the generalizability of the results to be valid for other UKA implant types.

Conclusion
Hospitals performing ≤11 Oxford III UKA per year had a higher risk of revision, and were more likely to perform revisions due to unexplained pain.