The most relevant diagnostic criteria for developmental dysplasia of the hip: a study of British specialists
BMC Musculoskeletal Disorders volume 17, Article number: 38 (2016)
Developmental dysplasia of the hip (DDH) is the most common orthopaedic disorder in newborns. Despite this considerable variation in practice exists. The aim of this study was to determine the clinical relevance and a ranking order for the diagnostic criteria in DDH amongst paediatric orthopaedic surgeons practicing in the UK.
One hundred members of the British Society of Children’s Orthopaedic Surgery (BSCOS) were asked to rate the importance of 37 criteria useful in the diagnosis of DDH in newborns, using a 10 cm visual analogue scale. We determined the consistency among specialists in rating the criteria with the intraclass correlation coefficient (ICC) and compared the results to a group of international peers.
Ortolani/Barlow tests, asymmetry in abduction ≥20° and a first-degree relative treated for DDH ranked among the top ten. Participants demonstrated poor consistency in rating the 37 criteria (ICC 0.39; 95 % CI 0.29, 0.52), but for clinical examination criteria alone their consistency improved (ICC 0.52; 0.35, 0.75). The importance ratings of members of BSCOS and members of the European Paediatric Orthopaedic Society differed for 15/37 (41 %) criteria (p <0.05).
Members of BSCOS had a preference for criteria relating to clinical examination and ultrasound.
Early recognition of developmental dysplasia of the hip (DDH) associated with better outcomes . Clearly defined, well formulated diagnostic criteria are vital to identify infants needing observation or treatment. This is particularly important during the first 8 weeks of life when there is perhaps greatest uncertainty as to the capacity for spontaneous resolution of abnormal findings of the hip [2–4]. In an attempt to elicit clearly defined, well-formulated diagnostic criteria for DDH in this age group, a Delphi consensus study of paediatric orthopaedic surgeons from 34 countries was conducted . It identified 37 standardized diagnostic criteria for DDH in this age group. Details of criteria are shown in Additional file 1. In the present study we sought to discern the opinions of British paediatric orthopaedic surgeons on these 37 criteria. Specifically, we wanted to determine (1) a ranking order of clinical relevance of these criteria reflecting the opinions of surgeons practicing in the UK, (2) the consistency with which British paediatric orthopaedic surgeons agree about the importance of these criteria and (3) how their opinions compare to a group of international paediatric orthopaedic surgeons .
The study was approved by the institutional review board for Great Ormond Street Hospital and the Institute of Child Health, British Society of Children’s Orthopaedic Surgery (BSCOS) and European Paediatric Orthopaedic Society (EPOS). We surveyed members of BSCOS and presented them with a set of 37 criteria grouped in 4 domains; patient history, clinical examination, ultrasonography, radiography. These had been compiled in an international consensus study . We asked survey participants to rate each criterion on a 10-cm visual analogue scale (VAS) for its relative importance in making the diagnosis of DDH in infants not older than 8 weeks. We defined DDH as a condition requiring either treatment or follow-up with an orthopaedic surgeon. We employed Dillman’s tailored design method for survey design and conduct ; it entails making up to 4 contacts with participants by first-class mail or e-mail, personalized correspondence and additional contacts by telephone or fax.
We surveyed 148 eligible members of BSCOS. All surgeons in this study were specialists who examine and treat infants for DDH as part of their routine practice. We were solely interested in the surgeons’ opinion on each criterion in isolation, rather than in determining how surgeons establish the diagnosis of DDH using combinations of these criteria. As such, we asked the participants to rate each of the 37 criteria irrespective of any other abnormalities. We recognize that this may not reflect how clinicians arrive at a diagnosis; however, because each criterion was rated in isolation we assumed that the relative importance rating would be stable . We compiled the VAS means (a ratio scale measurement of the perceived value on the VAS provided a continuous outcome) for all criteria based on the responses of all members of BSCOS. Based on these VAS means we generated a ranking list and defined the top ten criteria. We compared VAS means to those of members of EPOS using the signed rank test at the 5 % significance level.
We determined the consistency of members of BSCOS in assigning the importance rating to each criterion with the ICC. The concept of consistency is defined as the agreement of two quantitative measurements in settings where neither one is assumed correct . Multiple raters evaluated all of the criteria and the case 2 model according to Shrout and Fleiss  was employed. ICC is interpreted as follows: ≤0.40, poor consistency or large variation in opinion; 0.41 to 0.74, acceptable consistency; and ≥0.75 good consistency . A sample size of 37 items with 148 raters for each criterion achieves 80 % power to detect an ICC of 0.80 under the alternative hypothesis when the ICC under the null hypothesis is 0.69, using an F-test and a 5 % significance level .
68 % (100/148) members of BSCOS responded to the survey. BSCOS members expressed a preference for clinical examination criteria, which constituted 6 of the top ten raking criteria (Table 1). They included Ortolani and Barlow tests, asymmetry in abduction ≥20° and leg length discrepancy. Among the top ten ranked 3 ultrasonographic criteria: sonographically dislocatable hip, α angle <45°, and femoral head displacement >2 mm from the medial aspect of the acetabulum. A first degree relative treated for DDH was the only risk factor ranking in the top ten. The details for the ranking of each criteria by BSCOS members are shown in Table 2.
VAS mean values assigned to individual criteria were largely similar between members of BSCOS and EPOS (Fig. 1). Of the 15 criteria that differed significantly, the largest differences were seen (p <0.001) for the criteria postural foot deformity, torticollis and abduction ≤70°, with members of BSCOS assigning higher mean ratings (Table 3).
Members of BSCOS demonstrated poor consistency in rating the 37 criteria, with an ICC of 0.39 (95 % CI = 0.29, 0.52). Better consistency was found for criteria relating to the clinical examination – the ICC was 0.52 (0.35, 0.75). Members of BSCOS were least consistent in their opinions about the importance of criteria relating to hip ultrasound, with an ICC of 0.25 (0.14, 0.52). Poor consistency was noted also for criteria of patient history (ICC 0.39; 0.23, 0.69) and radiography (ICC 0.31; 0.10, 0.95).
This study determined the opinions of British paediatric orthopaedic surgeons about a set of 37 criteria which have been identified as the most relevant features for diagnosing DDH in the first 8 weeks of life . Because all 37 criteria cannot be equally important, we wanted to delineate those identified as most and least important by British specialist surgeons and determine to what degree their opinions differ compared to specialists from other countries.
We note the potential limitations of this study. As clinical experience and exposure accumulate the symptoms and signs associated with a diagnosis are “chunked” together and not taken in isolation. By asking experts to rank individual criteria we have established the opinions of surgeons and this may not reflect their normal practice. However, individual criteria are important as they can act as a trigger to activate the relevant knowledge.
Surveys are an effective means of evaluating physicians’ attitudes  and evidence suggests that physicians act as they indicate in surveys . The response rate of this survey was 68 %, however, this is reasonable considering that the mean response rate of surveys involving physicians is 54 % .
Members of BSCOS rated historically well-established diagnostic criteria such as the Ortolani test highest and controversial ones such as hip click lowest despite this being a common reason for referral. The opinions of British surgeons were consistent with an international group of paediatric orthopaedic surgeons – the top ten ranking criteria were identical with the exception that breech presentation was not included in the top-ten of the BSCOS panellists. The pattern of importance ratings was almost identical between BSCOS and EPOS (Fig. 1).
The fact that criteria related to the clinical examination were among the highest ranking in this study reflects other studies from the UK about the diagnosis of DDH in early infancy. Clarke et al.  based triage decisions of infants not older than 3 days on Ortolani, Barlow and Galeazzi tests. Talbot et al.  and Price et al. , examining the same age group, used Ortolani and Barlow tests but placed no emphasis on the Galeazzi test. Limited hip abduction was not reported as a diagnostic criterion in either of these 3 large studies. However, limited abduction ≥20° ranked third amongst BSCOS members. In fact, 3 criteria relating to hip abduction ranked top ten in our study; asymmetry in abduction ≥20°, abduction limited to 45° and any asymmetry in abduction. In contrast, a study of infants aged 3 to 10 months highlights the lack of reliability when relying on clinical examination alone, with 46 % of infants without DDH exhibiting a limit to hip abduction . Of note, members of EPOS placed less value on the criterion hip abduction.
In terms of risk factors, members of BSCOS ranked highest family history and breech presentation (Table 2). This is in keeping with current practice: 3 recent studies on screening in DDH [16–18] utilized these 2 criteria to select at-risk patients prompting specialist referrals. In a study of 64 670 births, Talbot et al.  evaluated the incidence of DDH in patients with these 2 risk factors; the incidence was 3.2 % with a family history of DDH and 2.5 % with a breech presentation. Price et al.  examined, amongst others, the risk factors oligohydramnios and foot deformity but did not comment on their associations with DDH, suggesting that they had little value in predicting DDH. This is consistent with our survey these 2 criteria are among the lowest ranking.
While our study showed that members of BSCOS regarded ultrasound criteria as important in general, it also confirmed the ongoing controversy [2, 21] about the nature of ultrasound criteria. Three recent studies about hip screening showed that the ultrasound criteria by which surgeons defined DDH varied in the UK. While one study  utilized criteria based on dynamic ultrasound, another study  relied on the α angle in combination with dynamic criteria, and a third study  relied solely on the α angle. Our survey reflected this controversy – a dislocatable hip seen on dynamic ultrasound and an α angle <45° ranked among the top ten criteria, similar to the opinions of members of EPOS, but also the femoral head coverage as measured by ultrasound was rated highly (Table 2).
In quantifying how consistent members of BSCOS were in rating the 37 criteria we used the ICC. It provides a measure of the extent to which any single member identified at random would compare to any other randomly selected member. Coefficients for judgments on individual patients should reach values of 0.70 to 0.80 . The best value that members of BSCOS reached was 0.52, indicating acceptable agreement about clinical examination criteria. Similar patterns were seen in an international study where paediatric orthopaedic surgeons were most consistent about clinical examination criteria . In contrast, for criteria relating to patient history, ultrasound and radiography, large variations in the opinions of UK surgeons were seen. Members of BSCOS were least consistent about the ultrasonographic criteria; this may be related to the inconsistent evidence in terms of the use of this diagnostic modality. It also reflects current practice in the UK: 3 recent studies about hip screening employed different ultrasonographic criteria in defining DDH [16–18].
The ranking order of criteria generated in this survey offers information for clinicians in primary and secondary care about the opinions of expert diagnosticians. Clinicians can determine how their personal preferences for diagnostic criteria differ from those experts. Such a comparison may reassure clinicians that their practice is mirrored by others, or, if not, can provide a basis for reconsidering their practice.
Developmental dysplasia of the hip
British society of children’s orthopaedic surgery
Intraclass correlation coefficient
European paediatric orthopaedic society
Visual analogue scale
Weinstein SL, Mubarak SJ, Wenger DR. Developmental hip dysplasia and dislocation: Part II. Instr Course Lect. 2004;53:531–42.
Roposch A, Moreau NM, Uleryk E, Doria AS. Developmental dysplasia of the hip: quality of reporting of diagnostic accuracy for US. Radiology. 2006;241:854–60.
Roposch A, Wright JG. Increased diagnostic information and understanding disease: uncertainty in the diagnosis of developmental hip dysplasia. Radiology. 2007;242:355–9.
Woolacott NF, Puhan MA, Steurer J, Kleijnen J. Ultrasonography in screening for developmental dysplasia of the hip in newborns: systematic review. BMJ. 2005;330:1413.
Roposch A, Liu LQ, Hefti F, Clarke NM, Wedge JH. Standardized diagnostic criteria for developmental dysplasia of the hip in early infancy. Clin Orthop Relat Res. 2011;469:3451–61.
Roposch A, Liu LQ, Protopapa E. Variations in the use of diagnostic criteria for developmental dysplasia of the hip. Clin Orthop Relat Res. 2013;471:1946–54.
Dillman DA. Mail and internet surveys: the tailored design method. 2nd ed. New York: Wiley; 2000.
Graham B, Regehr G, Wright JG. Delphi as a method to establish consensus for diagnostic criteria. J Clin Epidemiol. 2003;56:1150–6.
Newman-Toker DE, Pronovost PJ. Diagnostic errors--the next frontier for patient safety. JAMA. 2009;301:1060–2.
Shrout PE, Fleiss JL. Intraclass correlations: Uses in assessing rater reliability. Psychol Bulletin. 1979;86:420–7.
Portney LG, Watkinks MP. Statistical measures of reliability. In: Foundations of clinical research: applications in practice. 2nd ed. Princeton: Princteon-Hall; 2000.
Walter SD, Eliasziw M, Donner A. Sample size and optimal designs for reliability studies. Stat Med. 1998;17:101–10.
Epstein AM, McNeil BJ. Relationship beliefs and behavior in test ordering. Am J Med. 1986;80:865–70.
Moskowitz AJ, Kuipers BJ, Kassirer JP. Dealing with uncertainty, risks, and tradeoffs in clinical decisions: a cognitive science approach. Ann Intern Med. 1988;108:435–49.
Kellerman SE, Herold J. Physician Response to Surveys: A Review of the Literature. Am J Prev Med. 2001;20:61–7.
Clarke NMP, Reading IC, Corbin C, Taylor CC, Bochmann T. Twenty years’ experience of selective secondary ultrasound screening for congenital dislocation of the hip. Arch Dis Child. 2012;97:423–9.
Talbot CL, Paton RW. Screening of selected risk factors in developmental dysplasia of the hip: an observational study. Arch Dis Child. 2013;98:692–6.
Price KR, Dove R, Hunter JB. The use of X-ray at 5 months in a selective screening programme for developmental dysplasia of the hip. J Child Orthop. 2011;5:195–200.
Castelein RM, Korte J. Limited hip abduction in the infant. J Pediatr Orthop. 2001;21:668–70.
Price KR, Dove R, Hunter JB. Current screening recommendations for developmental dysplasia of the hip may lead to an increase in open reduction. Bone Joint J. 2013;95-B:846–50.
Sucato DJ, Johnston CE, Birch JG, Herring JA, Mack P. Outcome of ultrasonographic hip abnormalities in clinically stable hips. J Pediatr Orthop. 1999;19:754–9.
Bland JM, Altman DG. Cronbach's alpha. BMJ. 1997;314:572.
All phases of this study were supported by grants from BUPA foundation and Great Ormond Street Hospital Children’s Charity. We would like to thank all of the BSCOS surgeons who contributed to this study.
The authors declare that they have no competing interests.
DW: data analysis, interpretation of data, drafting and revisions of the manuscript. KS: data collection, critical appraisal and interpretation of data and manuscript. EP: data collection and statistical analysis, drafting and critical revision of the manuscript. JH: interpretation of data, critical review of manuscript. AR: conception and design of study, supervision of statistical analysis & drafting of manuscript; critical revisions of manuscript. All authors read and approved the manuscript.
About this article
Cite this article
Williams, D., Protopapa, E., Stohr, K. et al. The most relevant diagnostic criteria for developmental dysplasia of the hip: a study of British specialists. BMC Musculoskelet Disord 17, 38 (2016). https://doi.org/10.1186/s12891-016-0867-4