Loosening and revision rates after total shoulder arthroplasty: a systematic review of cemented all-polyethylene glenoid and design of the modern metal-backed glenoid


 Background: Modern designs of metal-backed glenoids (MBG) have been devised to overcome flaws such as loosening and a high failure rate. This review aimed to compare rates of complications and revision surgeries between the modern metal-backed glenoid (MBG) and the cemented polyethylene glenoid (PEG).Methods: Literature search was carried out using PubMed, Cochrane Library, EMBASE, and Google Scholar using MeSH terms and natural keywords. A total of 1186 articles were screened. We descriptively analyzed numerical data between the groups and statistically analyzed the categorical data, such as the presence of radiolucent line, loosening, and revision surgery (failure). Articles were divided into 3 groups based on follow-up duration: short-term < 36 months, midterm 36–72 months, and long-term > 72 months.Results: This study included 35 articles (3769 shoulders); 25 on cemented PEG and ten on the modern MBG. Mean age was 66.4 (21–93) and 66.5 years (31–88). The mean duration of follow-up was 73.1 (12–211) and 56.1 months (24–100). Overall, the rate of the radiolucent line was 354/1302 (27%) and 47/282 (17%), the loosening rate was 465/3185 (15 %) and 22/449 (5%), and the failure rate was 189/3316 (6%) and 11/457 (2%), for PEG and MBG, respectively. The results of short- and mid-term FU studies showed lower rates of radiolucency and loosening in the cemented PEG group, but there was no significant difference in failure rate (P=0.754 and 0.829, respectively). In long-term FU, MBG was better in terms of loosening (P <0.001) and failure rates (P = 0.006).Conclusions: The modern MBG component, especially TM glenoid, seems to be a promising alternative to cemented PEGs, based on subgroup revision rates according to the follow-up duration and overall results of ROM and clinical scores. All polyethylene glenoids tend to increase loosening and failure over time. The modern MBG seems to have no difference in failure, at least in the short- and mid-term compared to the cemented PEG. More long-term follow-up studies on modern MBG should be ultimately conducted.

term follow-up studies on modern MBG should be ultimately conducted.

Background
Although numerous studies on total shoulder arthroplasty (TSA) have aimed to find the optimal TSA design, no definite conclusions have been made [1]. The glenoid component of TSA is divided into keel type and peg type according to its shape, and can be made of all polyethylene (PE) or be metal-backed. Both metal-backed glenoids (MBG) and cemented polyethylene glenoids (PEG) were initially used, however due to the nature of the initial MBG design, the polyethylene liner was very thin and resulted in a high wear and failure rate [2]. A systematic review conducted in 2014 concluded that MBGs are not recommended as they show higher failure rates [3].
However, advanced MBG designs were devised to address these shortcomings, increasing the chance of good clinical outcomes [4][5][6]. We aimed to summarize and compare the results of TSA using cemented PEG and modern MBG by examining radiolucency, loosening, and failure rate. Our null hypothesis was that radiolucency, loosening, and failure rates of modern MBGs would be similar to those of cemented PEG.

Methods
This systematic review was conducted in accordance with the preferred reporting items for systematic reviews and meta-analyses (PRISMA) guidelines [7]. Additionally, we have registered the current review on the website of International prospective register of systematic reviews (PROSPERO, CRD42019137134).

Inclusion and exclusion criteria
We regulated various factors that could cause heterogeneity using strict inclusion and exclusion criteria determined by group discussion. Articles eligible for inclusion had to be a study on adults (> 18 years old), be a clinical study presenting the results of TSA using the cemented PEG or modern MBG with more than a two year mean follow-up (FU), a study including any type of shoulder arthritis, and be written in English. Case reports or articles with fewer than 5 cases were excluded. Also, articles that show the results of hybrid cage glenoids, mixed cases of revision arthroplasty, or mixed cases structural bone graft, and articles which do not present the main outcomes (number of revisions or failure) were excluded.

Search strategy and study selection
PubMed, Embase, Google Scholar, and Cochrane Library were searched to find a large number of relevant articles. We conducted group discussions and consulted medical informatics experts for an effective search strategy. After such discussions and consultations, we decided to search for final articles using individual search terms for MBG and PEG, respectively. The search terms for articles on cemented PEG were "total AND shoulder AND (replacement OR arthroplasty) AND polyethylene". Search terms for articles on modern MBG were "total AND shoulder AND (replacement OR arthroplasty) AND (metal OR backed OR (cementless glenoid))". After excluding duplicated documents, two independent reviewers screened the title and abstract, and finally selected articles through full-text review. We also performed citation tracking and search updates to find additional related articles using Google Scholar as an additional tool. All disagreements were resolved through group discussions of three or more authors.

Methodological assessment and data extraction
Levels of evidence were assessed according to the Oxford Center for Evidence Based Medicine [8]. The methodological quality of the studies included in this review was assessed using the methodological index for non-randomized studies (MINORS) [9]. A total of 8 items were evaluated for non-comparative studies, and 12 items for comparative studies. As 0, 1, or 2 points can be assigned to each item, non-comparative studies can have a total of 16 points, while comparative studies can have a total of 24 points. A study that obtained more than 60% of the total score was considered as a high-quality article, and the distribution of high-quality articles was analyzed between the two groups.
In order to define "modern design", the core topic of this study, the most up to date articles on the glenoid component were reviewed in group discussion. The advanced MBG designs presented by Castagna and Garofalo, who comprehensively assessed the product development year, conformity, rod, keel shape, and material, were defined as modern MBGs [10]. We included three designs in the modern MBG group: 1) second-generation SMR MBG (SMR System, Lima Corporate, Villanova, di San Daniele, Udine, Italy), 2) firstgeneration trabecular metal (TM) glenoid which consists of a soft MBG, the Sulmesh (Zimmer, Winterthur, Switzerland), and 3) the second-generation TM glenoid (Zimmer, Winterthur, Switzerland). If studies on the recent MBG design (after 2010) which was not one of the three designs mentioned above were found, we decided to conduct a group discussion. No such study was found, so the three designs were finally considered "modern design". The remaining designs were considered conventional designs and were excluded from this study.
Three independent reviewers extracted the number of shoulders, age, sex, FU duration, surgery procedures, medical and surgical history, preoperative diagnosis, name of implant and manufacturer, clinical score, range of motion (ROM), radiologic FU such as radiolucent lines, loosening, other complications, and revision or failure from the articles. The radiolucent line was defined as a radiolucency of 1 mm or more, grade 2 or more on the Lazarus radiolucency scoring system, or seven or more points out of a total of 18 points [11]. Failure was defined as complications that resulted in revision surgery involving an implant-related procedure. Loosening included both radiological and clinical loosening.
Data presented by other methods and ambiguous data were not extracted.

Statistical analyses
We used strict criteria to minimize heterogeneity. However, trends in age, FU duration, and preoperative diagnosis could be identified after data extraction. In particular, FU duration was considered to be the most important variable associated with implant failure.
We collaborated with medical statisticians on data interpretation and data analysis (including scatter plot and subgroup analysis). For categorical variables such as the presence of radiolucent lines, loosening, and failure or revision surgeries, statistical analysis was performed on the difference between cemented PEG and modern MBG.
Since the FU duration varies from study to study, we determined that a simple overall comparison between 2 groups was not sufficient, and therefore two additional analyses were performed according to the FU duration. Firstly, a scatter plot was used that plots the mean FU duration and loosening and revision rates of each study. Trend lines were weighted according to the number of cases to identify trends of loosening and revision rates between the two groups. Secondly, a subgroup analysis was performed that divided the FU duration into three groups based on 36 and 72 months as the statistician suggested. The three subgroups were defined in terms of short-term (FU < 36 months), midterm (36-72 months), and long-term (72 months) to ease the representation of subgroups. Subsequently, we analyzed the radiolucency, loosening and revision rates overall, and for the short-term FU, mid-term FU, and long-term FU. We analyzed the radiolucency, loosening and revision rates overall, and for the short-term FU, mid-term FU, and long-term FU. All statistical analyses were performed using R version 3.5.1 (R Foundation for Statistical Computing, Vienna, Austria). P-values less than 0.05 were determined to be statistically significant. Since numerical data were often missing important values such as standard deviation, a meta-analysis could not be performed.
Therefore, descriptive analysis and weighted means were performed on the numerical data of 2 groups.

Search results
Two hundred forty-one articles on cemented PEG were found in PubMed, 371 in Embase, and 24 articles in Cochrane Library. Subsequently, 177 articles on modern MBG were found in PubMed, 324 articles in Embase, and 29 articles in Cochrane Library. Through screening titles and abstracts and using full-text review, 25 PEG and 9 MBG articles were included. One article was added through citation tracking of selected articles, and no additional articles were found in the search update (Fig. 1). The final cemented PEG group included 3312 patients (25 articles) , and the modern MBG group included 457 patients (10 articles) [4-6, 37-43].

Assessment of methodological quality and heterogeneity between two groups
Levels of evidence and MINORS scores were determined by agreement between the two investigators, and there was no disagreement; one randomized controlled trial (Level I), one prospective comparative study (Level II), five Level III studies, and 28 Level IV studies were included. The mean MINORS scores, except for one Level I study, were 9.75 ± 1.38 for non-comparative studies and 16.8 ± 1.57 for comparative studies. Fifteen of the 25 studies on the cemented PEG (including Level I study, 60%) and 6 of the ten studies on the modern MBG (60%) were classified as high-quality articles (Fig. 2).
We analyzed the distribution of three factors that could introduce heterogeneity. Age and FU duration are shown using the summary plot (Figs 3A and 3B). Age showed a similar pattern except for three studies in the PEG group with young adults, whereas the cemented PEG group tended to have a longer FU period than the modern MBG group. The distribution of preoperative diagnosis was similar between the two groups ( Fig. 4), and the proportion of primary osteoarthritis was not statistically different (P = 0.310). Table 1 shows the demographic data and the outcome measurements of each study. Each study used a variety of measures; commonly used items were forward elevation (FE, 18 and 5 articles for cemented PEG and modern MBG, respectively), external rotation (ER, 18 and 5 articles), Constant score (13 and 3 articles), and ASES scores (7 and 6 articles), pain visual analogue scale (VAS, 5 and 7 articles), complications (most articles), and revision surgeries or failure (all articles) (Fig. 5). The results for each article for each commonly used item are shown in Table 2.

Clinical outcomes and complications of cemented PEG and modern MBG groups
Based on the data obtained in Table 2, an overall comparison between the two groups was performed ( Table 3). The mean gain of the arc of flexion-extension (F-E) was 48.6° and 61.7° and the ER increase was 24.2° and 39.2°, the mean Constant score increase was 34.8 and 40.4, and the ASES score was 44.5 and 56.5 for cemented PEG and modern MBG, respectively (Fig. 6). Rates of radiolucent lines, loosening, and revision surgery were lower in the modern MBG group, although incomplete results did not resolve heterogeneity. The causes of the revision are summarized in Fig. 7; the most common cause of reoperation for the cemented PEG group was loosening of glenoids (83 out of 141 known causes, 59.0%), and fractures of glenoid components for the modern MBG group (6 out of 11 known causes, 54.5%).

Scatter plots and subgroup analysis according to the FU duration
We performed additional scatter plot and subgroup analyses according to the FU duration, which showed a heterogeneous pattern. The trend lines showed that the MBG group tended to have lower loosening and revision rates than the PEG group over time (Figs 8A and 8B). Table 4 shows the results of subgroup analysis by short-term, mid-term, and long-term FU. The results of the short-and mid-term FU studies showed that cemented PEG showed good results in terms of radiolucency and loosening, but that there was no significant difference in failure rate (P=0.754 and 0.829 for short-term and mid-term FU).
In contrast, in long-term FU, modern MBG showed better results in terms of loosening (P <0.001) and revision rates (P = 0.006). We additionally compared two groups, after excluding three studies which included only young adults [14,17,24]. The scatter plot analysis and subgroup analysis according to the FU duration showed the same trend as that of the main analysis (Table 5, fig. 9A and fig. 9B).

Discussion
Although failure rates did not differ significantly between the two glenoid type groups in short-term and mid-term FU, modern PEGs were found to have lower radiolucency, loosening, and failure rates than cemented PEG (P = 0.033, < 0.001, and 0.006, respectively). This is in line with the results obtained from the scatter plot analysis. Also, the gains of FE and ER, Constant score, and ASES score of the modern MBG group were not lower than those of cemented PEG. Taken together, these results show that the modern MBG is comparable to the cemented PEG, with promisingly better results in a few of these aspects.
The trends in outcomes were found to differ between the two groups as the FU duration increased. In the cemented PEG group, loosening and failure rate typically increased as the FU duration increased. In contrast, long-term FU studies were comparable to shortterm FU studies in the modern MBG group. This may be because it is possible that the MBG was stably fixed, and that bony ingrowth was sufficient. If the modern MBG design caused stable fixation and bony ingrowth as the design originally intended, it makes sense that there were some initial failures in the modern MBG group and that the long-term FU results of the modern MBG were better than PEG. Moreover, it is possible that the error occurred due to the small number of studies with a long-term FU on modern MBGs. In order to confirm this conclusion, more long-term FU studies on modern MBGs should be performed.

A previous systematic review by Papadonikolakis and Matsen compared rates of complications and revision surgeries between MBG and PEG. They included all designs of
MBGs up to 2013 in the same group and reported that MBGs showed significantly higher revision rates than PEG [3]. Categorical data such as loosening and revision were analyzed by crosstab analysis as in this study. The review is a well-performed study that has served as a reference for the selection of glenoid components. We tried to increase the credibility of the analytical results by conducting heterogeneity assessments and adjustments that did not appear to be performed in the previous review.
The MBG was designed to induce bone ingrowth using the porous-coated component on the glenoid contact surface, and smooth ROM on the joint surface using the PE component.
These failures were caused by several factors. First, MBG failure is often associated with PE wear, which is often caused by thinner PE thickness in these designs due to the metal back [16]. Second, overstuffing of joints can be induced to ensure sufficient PE thickness, resulting in loosening and rotator cuff tears, which ultimately leads to joint instability.
Third, breakages of rods and screws may occur that are not caused by cemented PEG. This study has several limitations. First, there is no clear global consensus on the distinction between modern design and conventional design. However, a rationale was found through a review article on glenoid components by Castagna and Garofalo [10] and three models were defined as modern designs. Second, there is the possibility of remaining heterogeneity between studies. We thoroughly discussed this point at the research design stage and conducted data analysis and pooling after sufficient distribution analysis and adjustment of bias. Third, studies use different criteria for the definition of radiolucency and loosening in the main outcomes. Here, these were summarized using the most common and objective items as was possible, and credibility was increased by eliminating ambiguous data. Ultimately, the most objective and ultimate outcome indicator is failure or revision rate, and the failure rate results presented in this review suggest that modern MBG is promising. The fourth limitation is the lack of longer-term FU data across all implants. It is especially true in the modern MBG group, leading to the shortened criteria for dividing subgroups (36 and 72 months). This is because many surgeons still prefer cemented PEGs, and the modern designs of MBGs are still quite new.