- Research article
- Open Access
Validity of the self-reported five-part questionnaire as an assessment of generalized joint hypermobility in early pregnancy
BMC Musculoskeletal Disorders volume 21, Article number: 514 (2020)
The assessment of generalized joint hypermobility is difficult due to differences in classification methods and in the performance of joint mobility assessment. The primary aim was to evaluate the validity of the self-reported five-part questionnaire, 5PQ, for identifying generalized joint hypermobility using the Beighton score as reference test. The secondary aim was to describe how joint angles measured in degrees included in the Beighton score varied in different cut-off levels in the self-reported 5PQ and the Beighton score.
A cross-sectional validity study with a total of 301 women in early pregnancy, mean age of 31 years, were included in the study. The participants answered the self-reported 5PQ before the joint angles were measured. To standardize the joint mobility measurement, a structural protocol was used. The sensitivity, specificity, receiver operating characteristic curve, area under curve, positive- and negative predictive value, positive likelihood ratio and Spearman’s rank correlation between the self-reported 5PQ ≥ 2 and the Beighton score ≥ 5 were used as main outcome measures in the validity analyses. Joint angles, measured in degrees, were calculated with means in relation to different cut-off levels.
There was moderate correlation between the self-reported 5PQ and the Beighton score. The highest combined sensitivity, 84.1%, as well as specificity, 61.9%, was on 5PQ cut-off level ≥ 2, with a 38% false-positive rate, a moderate area under curve, a low positive predictive value and likelihood ratio, and a high negative predictive value. The odds of a self-reported 5PQ, cut-off level ≥ 2, among women with generalized joint hypermobility, Beighton ≥5, was low indicating a low post-test probability. The mean for all joint angles measured in degrees increased with increased cut-off levels, both in the Beighton score and in the self-reported 5PQ. However, there was a significant variation for each cut-off level.
There is uncertainty in identifying generalized joint hypermobility in young women using the self-reported 5PQ with a cut-off level of ≥2 when the Beighton score ≥ 5 is used as the reference test. The strength of the self-reported 5PQ is to rule-out women without generalized joint hypermobility.
Generalized joint hypermobility (GJH) is a collagen phenotype that impacts the entire body and is the main criterion for heritable connective tissue disorders [1,2,3]. The definition is the ability of a group of joints, usually five or more, to move beyond their normal range of motion (ROM). GJH is part of the term joint hypermobility (JH). JH can also be divided into localized Joint Hypermobility (LJH) when JH exists at a single site, peripheral joint hypermobility (PJH), when JH exist in hands or feet or as historical joint hypermobility (HJH), the prior existence of JH in older adults who on clinical assessment have lost their JH [4, 5].
GJH can be asymptomatic but when symptomatic the clinical consequences are joint instability resulting in increased frequency of joint dislocation, subluxations, soft tissue overload, or injuries, defective neuromuscular control with reduced proprioception and balance, muscle weakness, prolonged and widespread musculoskeletal pain, pregnancy-related pelvic girdle pain, skin disorders, impaired effects of local anaesthesia, stomach disorders, and psychiatric disorders [2,3,4, 6,7,8,9,10]. Therefore, assessment of GJH is urged in clinical practice . Although there is currently insufficient evidence, the clinical consequences might be relieved with a combination of physical therapy and pain management [11,12,13].
The prevalence of GJH in the general population is uncertain and seems to vary greatly. GJH is more common among women, in certain ethnic groups and decreases with increasing age body mass and waist circumference. The prevalence also depends on which assessment instruments and cut-off levels are used [14,15,16,17,18]. In Sweden, the prevalence of GJH among adults, mixed gender, assessed with a modification of the Beighton score with a cut-off value of three out of five, has been estimated to about 10% .
There is as yet no consensus on a gold standard for assessing GJH. The Beighton score (BeS) is the most recognised instrument [4, 20] and is used in assessing GJH in hereditary connective tissue disorders, with a cut-off level of 5/9 for adults . The BeS consists of a dichotomous assessment of five joint movements. The BeS is reliable and valid, but with conflicting results in studies due to differing methods and populations [16, 21,22,23,24,25,26]. A noted limitation of the BeS is the lack of assessment of common clinical sites of hypermobility, such as the cervical spine, shoulders, hips, and ankles . However, for patients with a lower Beighton score, the assessment of other joints not included in the Beighton score is recommended , although the importance of these joints in GJH assessment needs further investigation. The Hospital del Mar criteria is a less commonly used instrument that also includes ball-and-socket joints, which might be beneficial in assessment of GJH .
To identify hypermobility, Hakim and Grahame constructed a self-reported instrument in the form of a five-part questionnaire (5PQ) that includes five aspects of past or present hypermobility . The 5PQ correctly identified 84% of individuals with hypermobility with a cut-off level of two positive answers to any of the five questions with a sensitivity and specificity of 83 and 89%, respectively, using the revised Brighton criteria 1998 for Benign joint hypermobility syndrome, BJHS , as the reference test . Previous studies showed that the self-reported 5PQ, cut-off level 2/5, is a valid and a reliable instrument to identify GJH, compared with the BeS with a cut-off level of 4/9 or 5/9, however, with populations that differed with respect to genders and ages [29, 30].
To identify, diagnose, and establish a rehabilitation plan for patients, valid and reliable diagnostic clinical assessment methods and criteria for classifying GJH are essential . However, a recent systematic review stated that validity was insufficiently studied for the joint hypermobility instruments currently used . Since the self-reported 5PQ and the BeS are the instruments predominantly used to identify GJH in clinical settings and in research, it is important to know if they identify GJH in a similar manner. A clinical examination of joints is time-consuming and the self-reported 5PQ offers a more practical way to identify GJH. It is also important to evaluate which cut-off level to use for each instrument.
This study is part of a longitudinal study of women during and after pregnancy, with the overall aim of investigating GJH and its relation to pregnancy-related pain. Therefore, it would be of value to investigate if the self-reported 5PQ for identifying GJH, could easily identify pregnant women with hypermobility.
The primary aim of this study was to evaluate the validity of the self-reported five-part questionnaire for identifying GJH, using the Beighton score as a reference test.
The secondary aim was to describe how the joint angles measured in degrees included in the Beighton score varied in different cut-off levels of the self-reported five-part questionnaire and the Beighton score.
This was a cross-sectional validity study.
The study is part of a longitudinal study of women during and after pregnancy, with the overall aim of investigating GJH and its relation to pregnancy-related pain.
Pregnant women in Sweden have the right to regularly visit a maternal health care centre, free of charge. The centres are operated by the county councils, or are subcontracted by the councils. The centres are staffed by general practitioners, midwives, and administrative personnel.
The majority of the 8029 women attending three maternal health care centres, two in Uppsala (5312) and one in Borlänge (2717), between February 2014 and December 2018, were invited to participate. Information about the study was given by their midwife or via a letter sent home. The women were consecutively recruited. A total of 339 pregnant women were included in the study (101 from Borlänge and 238 from Uppsala). All included women gave written informed consent to participate in the study. The inclusion criterion was duration of gestation less than 15 completed weeks and exclusion criteria were history of joint inflammation, spasticity, joint replacement, musculoskeletal injury in joints relevant for the assessment during the preceding 3 months, and not being fluent in the Swedish language. The duration of gestation was calculated after a midwife estimated date of birth.
Data collection was performed before gestational week 15. The women completed a web-based questionnaire covering age, completed weeks of gestation, BMI, marital status, education, country of birth, previous pregnancies, previous children and tobacco use. The women also completed the 5PQ. The self-reported 5PQ encompasses five aspects regarding past or present information on joint hypermobility. The questions are “Can you now (or could you ever) place your hands flat on the floor without bending your knees?”, “Can you now (or could you ever) bend your thumb to touch your forearm?”, “As a child, did you amuse your friends by contorting your body into strange shapes or could you do the splits?”, “As a child or teenager, did you dislocate your shoulder or kneecap on more than one occasion?” and “Do you consider yourself double-jointed?”. Each positive response adds 1 point, with a total score ranging from 0 to 5 . In this study, all cut-off levels, 1–5, on the self-reported 5PQ were included in the evaluation.
The BeS, used as the reference test , includes five assessments of joint movements: passive dorsiflexion of the fifth metacarpophalangeal joint ≥90 degrees, passive apposition of the thumb to the flexor aspects of the forearm, passive hyperextensions of the elbow and the knee ≥10 degrees, and forward flexion of the trunk with knees straight, with the palms of the hands rested on the floor. The first four are assessed bilaterally. A positive result in any assessment adds 1 point, with a total score ranging from 0 to 9. In this study, a cut-off level of ≥5 was used for identifying GJH according to the 2017 criteria to assess GJH in hereditary connective tissue disorders, with a cut-off level of 5/9 for adults . All joint mobility measurements were performed in accordance with a structured protocol used in a previous study .
Three experienced physiotherapists and one experienced general practitioner working in primary care (termed raters from now on) performed all joint mobility measurements. To standardise performance and ensure similar assessments, the four raters trained un-blinded on three occasions, for a total of 24 h, with 21 people. The protocol, illustrated with photos, described start position, position of goniometer, anatomic landmarks, stabilisation of adjacent structures, and how to perform the test using active or passive movement. Two different goniometers (Medema Brodin, Kista, Sweden, 31 cm and 21 cm with a 180° protractor and movable arms) were used. The shorter goniometer was used for measurement of the fifth finger. Each joint angle was registered to the nearest 1 degree.
All measurements for each participant were performed on the same occasion. The participants filled out the 5PQ and provided their sociodemographic data, without the presence of the rater, before measurement of joint range of motion, ROM, was performed. The raters were blinded to the answers in the self-reported 5PQ during clinical examination.
The rater started each joint mobility assessment by providing the participant with verbal and visual instructions about the test procedure. No warm-up session preceded the test. To measure passive ROM, the rater moved the joint to its end-range position without causing pain. To measure active ROM, the participant moved the joint to its end-range position and the rater asked the participant: “Is this your maximum range of motion?”
Clinimetric properties and criterion validity (concurrent validity) were used to evaluate the validity of the self-reported 5PQ versus the BeS [33, 34]. The BeS is the most commonly used and recognised test for assessing GJH and was therefore used as the reference test [2, 4]. The validity of the self-reported 5PQ was tested versus a cut-off level of ≥5 for the BeS.
Data from the clinical measurements were manually entered into a web-based tool by the raters. The sociodemographic data and the self-reported 5PQ were entered into the web-based tool by the participants. Descriptive statistics were used for sociodemographic data. The normality of the distribution was investigated using graphs.
Spearman’s rho with 95% confidence intervals (CI) was used for evaluation of the correlation between the self-reported 5PQ and the BeS. The optimal combinations of cut-off levels for the population studied, giving the highest combined sensitivity and specificity with 95% confidence intervals (CI) of the self-reported 5PQ versus the BeS, were determined [35, 36].
A receiver operating characteristic (ROC) curve was created to illustrate the proportions of sensitivity and specificity for the test at different cut-off levels, in order to determine the most appropriate cut-off level for the self-reported 5PQ for the studied population. The ROC is based on cross-tabulation of categorical outcomes, where all possible cut-offs were studied. The most appropriate cut-off level was selected from the sensitivity since the 5PQ is used to assess people with GJH and is also used in research as exposure variable.
The area under the ROC curve (AUC) with 95% confidence intervals (CI) was calculated. The highest possible value of AUC is 1.0, and the higher the value, the better a test is at discriminating between positive or negative cases [36, 37].
To define the proportion of participants with a positive self-reported 5PQ who were correctly identified with GJH, a positive predictive value (PPV) was used. To define the proportion of participants with a negative self-reported 5PQ who were correctly identified without GJH, a negative predictive value (NPV) was used. PPV and NPV were calculated with 95% confidence intervals (CI). A value of 100% is considered to be perfect. The PPV and NPV depend on the GJH prevalence and were therefore adjusted with an estimated GJH prevalence of 10% estimated from a previous study within the general population in Sweden [19, 36, 38].
The positive likelihood ratio (LR+) was used, calculated with 95% confidence intervals (CI), to determine how many times more likely a person with GJH was to have a positive self-reported 5PQ than a person without GJH. LR+ above 10 is considered strong evidence predicting the presence or absence of a diagnosis [36, 39].
Fagan’s nomogram was used to illustrate how much a result from a diagnostic test changed the probability that a person had the diagnosis . The post-test probability was calculated using an estimated GJH prevalence of 10% within the general population in Sweden as pre-test probability.
All joint angles included in the BeS were measured in degrees, with means calculated, and related to the different cut-off levels for the self-reported 5PQ and the BeS.
All analyses were carried out using Stata/IC 15.1 (StataCorp LLC, Texas, USA) and R version 3.5.2.
The study was approved by the Regional Ethical Review Board, Box 2110, 750 02 Uppsala, Sweden, dnr 2013/186. Department of Public Health and Caring Sciences, Uppsala University, Uppsala, Sweden.
Table 1 shows sociodemographic data and the distribution of the BeS and 5PQ among included participants. A total of 339 women fulfilled the sociodemographic data, with a mean (SD) age of 31 (4.4) years. Of these, 86.1% were born in Sweden.
Of the 339 participants, 306 completed the self-reported 5PQ, and 64.3% of these answered the self-reported 5PQ before their clinical assessment. Three participants did not fill out the questionnaire. One participant did not answer question 1. Six participants answered, do not remember, on question 1, six participants answered, do not remember, on question 2, eight participants answered, do not remember, on question 3 and six participants answered, do not remember, on question 4. Three participants answered only three out of five questions. These participants where therefore excluded from the analysis.
Of the 339 participants, 333 were assessed using the BeS. Two participants were not assessed. In one participant, the left and right elbow extensions were not assessed, in another, the left thumb was not assessed and in two participants the right thumb were not assessed. A total of 301 participants were included in the validity analyses.
In the self-reported 5PQ the most prevalent positive responses was, “Can you now (or could you ever) place your hands flat on the floor without bending your knees?” (52.5%). The question with fewest positive responses was, “As a child or teenager, did you dislocate your shoulder or kneecap on more than one occasion?” (6.5%).
The frequency of each item in the BeS ranged from 11 to 32%, with the most common abilities being left thumb apposition (32%) and palms to floor (29%).
Table 2 shows the cumulative frequency of total scores for the self-reported 5PQ and the BeS. Among the 306 participants answering the 5PQ, 137 (40.4%) had scores of ≥2 and 76 (22.4%) had scores of ≥3. Of the 333 participants assessed by the BeS, 54 (15.9%) had scores of ≥5.
There was a positive correlation (r = 0.62, p < 0.001, CI 0.53–0.68) between the self-reported 5PQ and the clinically assessed BeS.
After studying the ROC curve, the most appropriate cut-off level for the 5PQ was found to be ≥2, with an AUC of 0.73 (95% CI 0.67–0.79), Fig. 1.
Table 3 shows a cross tabulation result of index test, 5PQ ≥ 2, against reference standard, BeS ≥ 5. Table 4 shows the highest combined sensitivity, 84.1% (95% CI 69.9–93.4), and specificity, 61.9% (95% CI 55.6–67.8). The false-positive rate was 38%. The PPV and NPV ranged between 13.4 and 42.4% and between 90.3 and 99.2%, respectively. The LR+ increased by 2.2 when using a cut-off level of ≥2 for the self-reported 5PQ, which is illustrated with the Fagan’s nomogram in Fig. 2 showing a post-test probability of 19.6%.
Range of motion, measured in degrees, in joints included in the BeS are shown in Tables 5 and 6, presented as means and ranges. The higher the scores on the self-reported 5PQ and in the BeS, the higher the joint ROM except for the thumb apposition, where the joint ROM should decrease. The measured joint angles varied in each cut-off level.
There was a moderate correlation between the self-reported 5PQ and the BeS. The highest values of sensitivity, specificity and an acceptable area under curve were observed on cut-off level ≥ 2 in the 5PQ. The false-positive rate was 38% with low positive predictive value and high negative predictive value. The odds and the post-test probability having generalized joint hypermobility with a positive self-reported five-part questionnaire, using a cut-off level of ≥2, was low.
The mean for all joint angles measured in degrees increased with increased cut-off levels, both in the BeS and in the self-reported 5PQ with a significant variation for each cut-off level.
GJH is a complex clinical condition, for which identification requires a broad clinical assessment. Which joints to include, specific cut-off levels for each included joint, how the measurement of ROM in joints should be performed, and what cut-off levels to use for GJH are all points of an ongoing discussion [15, 16]. In diagnostic accuracy studies, the presence or absence of the condition is ideally determined using a “gold standard” assessment method . There is no gold standard for assessing GJH; therefore, the use of criterion validity and the BeS as a reference test could be discussed. However, the BeS is the instrument most used and since 2017 there has been consensus in assessing GJH in hereditary connective tissue disorders with the BeS. To assess GJH in clinical and research settings, the BeS was therefore considered to be a reasonable “gold standard” .
Depending on which statistical method was used in the present study, the usability of the self-reported 5PQ differed. Using sensitivity and specificity, the ideal situation is that both are high, to maximise the discriminative ability. However, there is a certain trade-off between them. This is related to the severity of the diagnosis, whether the test should detect or exclude a disease, and the costs and suffering for the patient if misdiagnosed . A cut-off level of ≥2 for the self-reported 5PQ had the highest combined sensitivity, specificity, and AUC when compared with the BeS cut-off level of ≥5. This is, in agreement with previous studies [29, 30] using a cut-off level of ≥4 and ≥ 5 respectively for the BeS. Although a cut-off level of ≥2 for the self-reported 5PQ demonstrated the highest combined sensitivity and specificity in this study, an associated risk of including a high rate of false positives cannot be eliminated. Furthermore, the sensitivity and specificity say nothing about the probability of presence or absence of GJH, only the test result is known [35, 36]. In clinical practice, it is important to know if a test result predicts the risk of having a diagnosis. Therefore, the assessment of predictive value and LR+ was used [36, 38]. The LR+, illustrated with the Fagan’s nomogram estimates, is rarely used in studies evaluating diagnostic tests .
The prevalence affects the PPV and NPV and our results are only useful for studies with the same population and prevalence . There are no new, large population studies in Sweden, so the estimated GJH prevalence of 10% that was used here was based on a study from 1993 . The prevalence of GJH depends on gender, age, ethnicity, how the test is performed, as well as the cut-off values used, which will differ between studies. The PPV decreases and the NPV increases if the prevalence is low [36, 38]. However, when calculating the PPV and the NPV, different prevalence values were tested (15 and 20%) and neither the PPV nor the NPV changed significantly.
Ideally, a high sensitivity is required for a test assessing people with GJH, or if used as an exposure variable in research, however, including a low false-positive rate and a high positive predictive value.
Possible advantages of the self-reported 5PQ are that it increases the ability to identify individuals with joint hypermobility, it is easy to use, it does not rely entirely on specific joints, and the questions take into account a possible history of joint hypermobility . However, it is important to distinguish between identifying joint hypermobility using a questionnaire and assessing it clinically with regard to the degree of joint mobility and pathology in multiple joints. A more thorough assessment of joint hypermobility is suggested to identify which type of JH, e.g. GJH and to set up a rehabilitation and treatment plan [4, 12]. The assessment of the presence and degree of joint hypermobility should not be arbitrary.
In this study, the highest positive response rate to a single question in the self-reported 5PQ was to question 1, “Can you now (or could you ever) place your hands flat on the floor without bending your knees?”, which was in accordance with the study by Moraes et al. . In the current study, the difference was 20% when the answers to question 1 were compared in the self-reported 5PQ with the clinical assessment of forward flexion of the trunk with knees straight and the palms of the hands rested on the floor. In this study the prevalence of GJH in the self-reported 5PQ was 40.4% with a cut-off level of ≥2, and 22.4% with a cut-off level of ≥3. Compared with the BeS cut-off level of ≥5, the prevalence of GJH in this study was 15.9%. In a recent study  using a cut-off level ≥ 2 on the self-reported 5PQ and ≥ 5 for the BeS in females with the same age as in the present study the prevalence of GJH was 38.2 and 11.8% respectively, which is in accordance with our study. Compared with the study by Moraes et al.  having included mixed genders and ages, the prevalence rates based on the self-reported 5PQ, with cut-off levels of ≥2 and 3, were 37 and 9.9% respectively . In the study by McKay et al.  age, sex, and waist circumference in adults, were found to be the strongest predictors for joint flexibility. We are of the opinion that by including only women in fertile age, mainly born in Sweden and having normal weight, might not have affected the difference in prevalence found in this study. Differences between answering the 5PQ as opposed to a clinical joint assessment could be that some people had been able to do the test historically, while responding to a questionnaire often results in an overestimation of an individual’s ability. Alternatively, participants might also not have known how to perform the test properly. Finally, the ROM assessed in a questionnaire is subjectively estimated and is not always passively assessed when self-performed. Passive performance of ROM is needed to achieve an end feel, and the adjacent joint should be stabilised. All these aforementioned factors could have affected the prevalence of GJH. Maybe the cut-off value of ≥2 for the self-reported 5PQ, may have been to low for this group and should have been adjusted to at least ≥3. In addition, we did not differentiate between asymptomatic and symptomatic GJH in this study which also could have influenced the prevalence.
As this study is part of a longitudinal study of women during and 9 months after pregnancy, with the overall aim of investigating joint mobility and its relation to pain during pregnancy, the result is representative only for young women. The ROM in joints was measured in early pregnancy. Previous studies, using various methodologies, show no clear evidence that joint mobility increases in early pregnancy [44, 45].
Our ambition was that the clinical assessments would take place after the participants had answered the self-reported 5PQ. This was the case for 64.3% of participants. However, the raters were blinded to the answers when performing the clinical assessments.
The mean angle for all specific joints increased with increased cut-off levels, both in the BeS and in the self-reported 5PQ, which is in accordance with another study . However, the joint angles measured in degrees varied significantly in the different cut-off levels. Because of this variation, there is possibly a need to further study age-and sex specific cut-offs for the specific joint angles measured in degrees that differentiate those with and without GJH. Perhaps those classified as having GJH should be at the extreme, the uppermost 5%, of joint mobility for their population as suggested in the study by Singh et al. . This study confirmed the current cut-off level of ≥5 in the BeS, for young women, although a different joint measurement methodology was used. A further conclusion, in the study by Singh et al. was that when using the BeS when assessing GJH, age- and sex-specific BeS cut-off levels based on the uppermost 5% should be used.
There are several limitations in this study. The first is the use of the BeS as the reference test. However, this study is done pending further research on the assessment of GJH. Another limitation in the study could be the estimated prevalence of GJH in the population studied, as mentioned above. Diagnostic accuracy should be tested on a population including those with both mild and severe abnormality in regard to what the instrument is supposed to measure, in this case GJH . In the studied population, 10.6% had scores of ≥4 for the self-reported 5PQ and 7.1% had scores of ≥7 for the BeS. This might be too low a prevalence of severe GJH, which may have affected the result.
Another limitation of this study was that the Swedish translation of the self-reported 5PQ was not scientifically validated as were performed in the studies by Moraes et al. and Glans et al. [29, 30]. In this study the 5PQ was translated by a native English-speaking physician, who is also fluent in the Swedish language and familiar with the terminology. The translation was discussed until a consensual version was achieved together with the co-workers of the project. However, after the clinical assessments were completed, it emerged that there was some uncertainty in relation to question 2: “Can you now (or could you ever) bend your thumb to touch your forearm?” Some participants answered the question negatively, even though they had a positive clinical test, implicating a lack of understanding on how to perform the test. This is in line with the results in the studies by Moraes et al. and Glans et al. [29, 30]. To promote understanding, the question should be rewritten or illustrated, as in the study by Moraes et al. . Additionally, in this study, as well as in the studies by Moraes et al. and Glans et al. [29, 30] there was also difficulty when translating the word “double-jointed” in the last question in the questionnaire. In this study, the Swedish word for “agile” was used while Moraes et al. , used the Brazilian Portuguese word for “flexible” and Glans et al.  used the Swedish word for “to have joint hypermobility” in their studies. Overall, our translation did not differ significantly compared with the version of Glans et al., and we believe that it did not affect the prevalence of GJH.
Both previous studies did a test-retest as a first phase in their studies, showing moderate to almost perfect reliability for each item on the 5PQ [29, 30]. This study did not include a test-retest investigation of the 5PQ, which is a limitation since high validity requires high reliability.
The strengths of the study include the use of PPV, NPV, and LR+ to describe the clinical usability of the test. Also, including only young women could be a strength in studies of GJH, as GJH is more prevalent in this group. Another strength of this study was that the measurement of ROM in joints was performed in accordance with a structured protocol and by experienced raters. The same protocol was used in a previous reliability study of the BeS, the Hospital del Mar criteria, and the Contompasis score, which showed good inter- and intra-rater reliability . The validity of a clinical measurement instrument is dependent not only on its validity, but also on the data quality . This depends for example on the experience of the rater and the amount of attention paid by the participant in answering the questionnaire . Although, since 2017 there has been a consensus about assessing GJH in hereditary connective tissue disorders with the BeS [2, 26], it has been performed in many different ways in different studies. How joint ROM is measured is not always clearly described, which hampers comparison of results with other studies and with clinical assessments. However, there is still a need for further discussion about the use of measuring instruments such as goniometers and whether adjacent joints should be stabilized.
GJH should always be considered in the differential diagnosis for patients with widespread pain and soft tissue lesions. It should be easy to screen for joint hypermobility, but the assessment of the degree of joint hypermobility is probably more complex. Until now, clinimetric research has been about assessing joint ROM and angular mobility. The current assessment methods might not be sufficient for determining the severity of joint hypermobility. Further research should also include the assessment of joint instability and joint translation, especially joint gliding in the transverse-or sagital plane, both in the spine and the extremity joints. There is maybe also a need to evaluate if other joints than those included in the BeS should be included in the assessment of GJH. However, a single error-free gold standard is probably not the way to establish presence or absence of GJH.
There is uncertainty in identifying generalized joint hypermobility in young women using the self-reported five-part questionnaire with cut-off level ≥ 2 using the Beighton score cut-off level ≥ 5 as the reference test. The strength of the self-reported five-part questionnaire is that it rules-out women without generalized joint hypermobility.
The mean for all joint angles measured in degrees increased with increased cut-off levels, both in the BeS and in the self-reported 5PQ. There was, however, a significant variation for each cut-off level.
Availability of data and materials
The datasets used and/or analysed in the current study are available from the corresponding author on reasonable request.
Generalised joint hypermobility
Range of motion
Body mass index
Receiver operating characteristic
Area under curve
Positive predictive value
Negative predictive value
Positive likelihood ratio
Grahame R. Hypermobility: an important but often neglected area within rheumatology. Nat Clin Pract Rheumatol. 2008;4(10):522–4.
Malfait F, Francomano C, Byers P, Belmont J, Berglund B, Black J, et al. The 2017 international classification of the Ehlers-Danlos syndromes. Am J Med Genet C Semin Med Genet. 2017;175(1):8–26.
Gazit Y, Jacob G, Grahame R. Ehlers-Danlos syndrome-hypermobility type: a much neglected multisystemic disorder. Rambam Maimonides Med J. 2016;7(4):e0034.
Castori M, Tinkle B, Levy H, Grahame R, Malfait F, Hakim A. A framework for the classification of joint hypermobility and related conditions. Am J Med Genet C Semin Med Genet. 2017;175(1):148–57.
Castori M, Hakim A. Contemporary approach to joint hypermobility and related disorders. Curr Opin Pediatr. 2017;29(6):640–9.
Bulbena A, Gago J, Pailhez G, Sperry L, Fullana MA, Vilarroya O. Joint hypermobility syndrome is a risk factor trait for anxiety disorders: a 15-year follow-up cohort study. Gen Hosp Psychiatry. 2011;33(4):363–70.
Baeza-Velasco C, Pailhez G, Bulbena A, Baghdadli A. Joint hypermobility and the heritable disorders of connective tissue: clinical and empirical evidence of links with psychiatry. Gen Hosp Psychiatry. 2015;37(1):24–30.
Hakim AJ, Grahame R, Norris P, Hopper C. Local anaesthetic failure in joint hypermobility syndrome. J R Soc Med. 2005;98(2):84–5.
Scheper MC, Juul-Kristensen B, Rombaut L, Rameckers EA, Verbunt J, Engelbert RH. Disability in adolescents and adults diagnosed with hypermobility-related disorders: a meta-analysis. Arch Phys Med Rehabil. 2016;97(12):2174–87.
Mogren IM. BMI, pain and hyper-mobility are determinants of long-term outcome for women with low back pain and pelvic pain during pregnancy. Eur Spine J. 2006;15(7):1093–102.
Keer R, Simmonds J. Joint protection and physical rehabilitation of the adult with hypermobility syndrome. Curr Opin Rheumatol. 2011;23(2):131–6.
Engelbert RH, Juul-Kristensen B, Pacey V, de Wandele I, Smeenk S, Woinarosky N, et al. The evidence-based rationale for physical therapy treatment of children, adolescents, and adults diagnosed with joint hypermobility syndrome/hypermobile Ehlers Danlos syndrome. Am J Med Genet C Semin Med Genet. 2017;175(1):158–67.
Simmonds JV, Herbland A, Hakim A, Ninis N, Lever W, Aziz Q, et al. Exercise beliefs and behaviours of individuals with Joint Hypermobility syndrome/Ehlers-Danlos syndrome - hypermobility type. Disabil Rehabil. 2019;41(4):445–55. https://doi.org/10.1080/09638288.2017.1398278.
Remvig L, Engelbert RH, Berglund B, Bulbena A, Byers PH, Grahame R, et al. Need for a consensus on the methods by which to measure joint mobility and the definition of norms for hypermobility that reflect age, gender and ethnic-dependent variation: is revision of criteria for joint hypermobility syndrome and Ehlers-Danlos syndrome hypermobility type indicated? Rheumatology. 2011;50(6):1169–71.
Remvig L, Flycht L, Christensen KB, Juul-Kristensen B. Lack of consensus on tests and criteria for generalized joint hypermobility, Ehlers-Danlos syndrome: hypermobile type and joint hypermobility syndrome. Am J Med Genet A. 2014;164A(3):591–6.
Juul-Kristensen B, Schmedling K, Rombaut L, Lund H, Engelbert RH. Measurement properties of clinical assessment methods for classifying generalized joint hypermobility-a systematic review. Am J Med Genet C Semin Med Genet. 2017;175(1):116–47.
Singh H, McKay M, Baldwin J, Nicholson L, Chan C, Burns J, et al. Beighton scores and cut-offs across the lifespan: cross-sectional study of an Australian population. Rheumatology. 2017;56(11):1857–64.
McKay MJ, Baldwin JN, Ferreira P, Simic M, Vanicek N, Burns J, et al. Normative reference values for strength and flexibility of 1,000 children and adults. Neurology. 2017;88(1):36–43.
Larsson LG, Baum J, Mudholkar GS, Srivastava DK. Hypermobility: prevalence and features in a Swedish population. Br J Rheumatol. 1993;32(2):116–9.
Beighton P, Solomon L, Soskolne CL. Articular mobility in an African population. Ann Rheum Dis. 1973;32(5):413–8.
Bulbena A, Duro JC, Porta M, Faus S, Vallescar R, Martin-Santos R. Clinical assessment of hypermobility of joints: assembling criteria. J Rheumatol. 1992;19(1):115–22.
Ferrari J, Parslow C, Lim E, Hayward A. Joint hypermobility: the use of a new assessment tool to measure lower limb hypermobility. Clin Exp Rheumatol. 2005;23(3):413–20.
Erkula G, Kiter AE, Kilic BA, Er E, Demirkan F, Sponseller PD. The relation of joint laxity and trunk rotation. J Pediatr Orthop B. 2005;14(1):38–41.
Smits-Engelsman B, Klerks M, Kirby A. Beighton score: a valid measure for generalized hypermobility in children. J Pediatr. 2011;158(1):119–23.e1-4.
Junge T, Jespersen E, Wedderkopp N, Juul-Kristensen B. Inter-tester reproducibility and inter-method agreement of two variations of the Beighton test for determining Generalised Joint Hypermobility in primary school children. BMC Pediatr. 2013;13:214.
Juul-Kristensen B, Rogind H, Jensen DV, Remvig L. Inter-examiner reproducibility of tests and criteria for generalized joint hypermobility and benign joint hypermobility syndrome. Rheumatology. 2007;46(12):1835–41.
Hakim AJ, Grahame R. A simple questionnaire to detect hypermobility: an adjunct to the assessment of patients with diffuse musculoskeletal pain. Int J Clin Pract. 2003;57(3):163–6.
Grahame R, Bird HA, Child A. The revised (Brighton 1998) criteria for the diagnosis of benign joint hypermobility syndrome (BJHS). J Rheumatol. 2000;27(7):1777–9.
Moraes DA, Baptista CA, Crippa JA, Louzada-Junior P. Translation into Brazilian Portuguese and validation of the five-part questionnaire for identifying hypermobility. Rev Bras Reumatol. 2011;51(1):53–69.
Glans M, Humble MB, Elwin M, Bejerot S. Self-rated joint hypermobility: the five-part questionnaire evaluated in a Swedish non-clinical adult population. BMC Musculoskelet Disord. 2020;21(1):174.
Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, et al. The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J Clin Epidemiol. 2010;63(7):737–45.
Schlager A, Ahlqvist K, Rasmussen-Barr E, Bjelland EK, Pingel R, Olsson C, et al. Inter- and intra-rater reliability for measurement of range of motion in joints included in three hypermobility assessment methods. BMC Musculoskelet Disord. 2018;19(1):376.
Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, et al. The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study. Qual Life Res. 2010;19(4):539–49.
de Vet HC, Terwee CB, Bouter LM. Current challenges in clinimetrics. J Clin Epidemiol. 2003;56(12):1137–41.
Altman DG, Bland JM. Diagnostic tests. 1: Sensitivity and specificity. BMJ. 1994;308(6943):1552.
Eusebi P. Diagnostic accuracy measures. Cerebrovasc Dis. 2013;36(4):267–72.
Altman DG, Bland JM. Diagnostic tests 3: receiver operating characteristic plots. BMJ. 1994;309(6948):188.
Altman DG, Bland JM. Diagnostic tests 2: predictive values. BMJ. 1994;309(6947):102.
Deeks JJ, Altman DG. Diagnostic tests 4: likelihood ratios. BMJ. 2004;329(7458):168–9.
Fagan TJ. Letter: Nomogram for Bayes theorem. N Engl J Med. 1975;293(5):257.
Terwee CB, Bot SD, de Boer MR, van der Windt DA, Knol DL, Dekker J, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60(1):34–42.
Björk J. Praktisk statistik för medicin och hälsa. 1st ed. Liber AB; 2011. p. 249-265.
Hayden SR, Brown MD. Likelihood ratio: a powerful tool for incorporating the results of a diagnostic test into clinical decisionmaking. Ann Emerg Med. 1999;33(5):575–80.
Cherni Y, Desseauve D, Decatoire A, Veit-Rubinc N, Begon M, Pierre F, et al. Evaluation of ligament laxity during pregnancy. J Gynecol Obstet Hum Reprod. 2019;48(5):351–7.
Lindgren A, Kristiansson P. Finger joint laxity, number of previous pregnancies and pregnancy induced back pain in a cohort study. BMC Pregnancy Childbirth. 2014;14:61.
This work was supported by grants provided by the Stockholm County Council (NSV Project), the Aleris Research Fund, the Primary Care Research Fund in Region Uppsala and the County Council of Uppsala. The role of the funders was to finance the researcher / author during the research period. Open access funding provided by Uppsala University.
Ethics approval and consent to participate
Informed written consent was obtained from all participants upon inclusion in the study.
The study was approved by the Regional Ethical Review Board, Box 2110, 750 02 Uppsala, Sweden, dnr 2013/186. Department of Public Health and Caring Sciences, Uppsala University, Sweden.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Schlager, A., Ahlqvist, K., Pingel, R. et al. Validity of the self-reported five-part questionnaire as an assessment of generalized joint hypermobility in early pregnancy. BMC Musculoskelet Disord 21, 514 (2020). https://doi.org/10.1186/s12891-020-03524-7
- Generalised joint hypermobility
- Diagnostic accuracy
- Beighton score
- Five-part questionnaire