Ehlers-Danlos

The Beighton Score Problem

A 1973 epidemiological tool from a survey of Tswana villagers became the gatekeeper for hEDS evaluation. Why it misses adults and what has been proposed to replace it.

The screening tool that decides whether an adult with multi-system connective tissue symptoms gets a hypermobile Ehlers-Danlos evaluation was designed in 1973 to count joints in 1,081 villagers at the foot of the Magaliesburg Hills, sixty miles northwest of Johannesburg. Peter Beighton, Lewis Solomon, and Charles Soskolne published "Articular mobility in an African population" in the Annals of the Rheumatic Diseases that year. The paper was an epidemiological survey of the Tswana people of Phokeng. It was not designed to diagnose anyone with anything. Fifty-three years later, a 9-point checklist derived from that survey is the primary gatekeeper for hEDS evaluation across rheumatology clinics, geneticist intake forms, and the 2017 international diagnostic criteria.

The score asks five things. Can you bend each little finger backward past 90 degrees. Can you lay each thumb flat against the forearm. Can you hyperextend each elbow past 10 degrees. Can you hyperextend each knee past 10 degrees. Can you place your palms flat on the floor with knees straight. Each side counts. The maximum is 9. The standard adult cutoff for generalized joint hypermobility is 5 of 9.

The score is fast, free, and reproducible. It is also a poor instrument for the job it now does.

What the 1973 paper was for

Beighton and Solomon were rheumatologists studying differences in articular mobility across populations. The Tswana cohort was one of several they assessed. Soskolne handled the statistics. The paper reported how often various joint movements turned up in a representative sample of an African rural population. It compared mobility distributions across age and sex. It did not propose a clinical diagnosis. The 9-point composite was a counting device.

The leap from counting device to diagnostic gatekeeper happened in stages. Rodney Grahame and colleagues incorporated the Beighton score into the 1998 Brighton criteria for benign joint hypermobility syndrome, published by the British Society for Rheumatology working group on heritable disorders of connective tissue. The Brighton criteria used the Beighton score as both a major and a minor item, alongside arthralgia, dislocations, skin findings, marfanoid habitus, and other systemic features. In 2017, Marco Castori, Brad Tinkle, Howard Levy, Rodney Grahame, Fransiska Malfait, Alan Hakim, and the rest of the international working group published the current hEDS criteria in the American Journal of Medical Genetics Part C. Generalized joint hypermobility is criterion 1 of three. The instrument used to establish criterion 1 is the Beighton score.

The score that reached the criteria is the same instrument from 1973. The cohort, the joints, the cut points, the structure. None of it was redesigned for the diagnostic role it now performs.

The age-attenuation problem

Joint mobility falls with age. The decline is steep in childhood and continues through adulthood. A 2021 review in Rheumatology International, "The Beighton Score as a measure of generalised joint hypermobility" by Juul-Kristensen and colleagues, summarized the evidence. The probability of meeting the standard adult cutoff drops roughly 5.5% with each year of age. In a sample of 200 adults over 70, no one scored higher than 2 of 9. A 45-year-old with the same connective tissue laxity she had at 20 will routinely score lower because the contributing tissues have stiffened, the joints have remodeled, and prior surgeries or injuries have reduced range of motion at the joints the score measures.

The 2017 criteria attempted to patch this. The cutoff was tiered: 6 of 9 for prepubertal children, 5 of 9 for adults to age 50, 4 of 9 after age 50. The criteria also introduced a workaround called the historical Beighton. If an adult scores one point below her age-and-sex-specific cutoff, she can still meet criterion 1 by answering yes to at least 2 of 5 questions on the self-report instrument developed by Alan Hakim and Rodney Grahame in 2003 and published in the International Journal of Clinical Practice. Could you ever place your palms flat on the floor with knees straight. Could you bend your thumb to touch your forearm. As a child, did you amuse friends by contorting your body. Did you ever dislocate a shoulder or kneecap on more than one occasion. Do you consider yourself double-jointed.

The 5-part questionnaire, validated against the 1998 Brighton criteria, identified about 84% of hypermobile individuals at a cutoff of 2 positive answers. Sensitivity was 84%. Specificity was 89%. It catches some adults the current Beighton score misses.

The historical workaround narrows by one point. It does not solve the underlying problem. An adult with severe multi-system hEDS who scores 2 or 3 because she is 40, has had two shoulder reconstructions, and lifts weights has no path through the official criterion. She does not enter the diagnostic frame. She is told she is not hypermobile.

Joints the score does not count

The Beighton score weights upper limb mobility heavily. Five of nine points come from the hands and elbows. Two come from the knees. One from the spine and hamstrings combined. Major joints implicated in hEDS sit outside the instrument entirely.

Hips. Shoulders. The temporomandibular joint. The cervical spine. The sacroiliac joints. The small joints of the hands beyond the fifth finger and thumb. The ankles. Subtalar mobility. None of them register.

A person with severe shoulder laxity who dislocates a shoulder doing routine reaches, with hip instability that produces a positive log-roll test, with TMJ dysfunction that affects chewing, and with sacroiliac instability that drives chronic low back pain can score 2 of 9 if her elbows do not hyperextend, her thumbs do not reach her forearms, and her palms do not touch the floor. The score sees five small joints out of dozens that matter. The clinical reality of hEDS is multi-joint, multi-system, and frequently centered in the joints the score does not measure.

Manuel Ferrari and colleagues developed the Lower Limb Assessment Score in 2005 to address this in pediatric athletic populations. Verena Meyer and Mark Scheper validated it in adults in BMC Musculoskeletal Disorders in 2017, with a cutoff of 7 of 12 yielding 86% specificity and 68% sensitivity for lower limb hypermobility. Bernd Nicholson and colleagues developed the Upper Limb Hypermobility Assessment Tool, published in the Journal of Bodywork and Movement Therapies, to capture shoulder and elbow mobility the Beighton score does not. These instruments exist. They are not embedded in the criteria.

The validation gap across populations

Hypermobility distribution varies by ancestry. Studies of athletic populations have reported generalized joint hypermobility at 21% in Asian, 23% in African, and 12% in white cohorts using the standard Beighton cutoff. Australian work cited in the Juul-Kristensen review found higher Beighton scores across the lifespan in non-white residents. The original 1973 paper studied 1,081 Tswana adults and children. The score has not been formally restandardized in a way that asks whether the cutoff that defines pathological hypermobility in one ancestry defines pathological hypermobility in another.

The clinical reading of this varies. Some groups argue that higher mobility distributions in some populations are asymptomatic and that the cutoff should hold. Others argue that a single cutoff applied across populations with different baseline distributions will systematically misclassify in both directions. The 2017 criteria do not address this. The instrument is applied uniformly to every adult who walks into a hypermobility clinic, regardless of ancestry, body habitus, or athletic history.

Alternatives

Clinicians frustrated with the Beighton score have several validated alternatives.

The Hospital del Mar criteria, developed by Antonio Bulbena and colleagues at Hospital del Mar in Barcelona and published in the Journal of Rheumatology in 1992, uses 10 items including thumb apposition, metacarpophalangeal extension, elbow and shoulder rotation, hip abduction, patellar mobility, ankle and metatarsophalangeal mobility, knee hyperflexion, and ecchymoses. Inter-rater reliability has been reported with kappa coefficients above 0.81 in pediatric samples. In a comparison study, Beighton at the standard cutoff identified 12% of a population as hypermobile while Hospital del Mar identified 34%. The instruments are not measuring the same thing. Hospital del Mar reaches more of the relevant joints.

The Brighton criteria, the 1998 framework that preceded the 2017 hEDS criteria, retain a place in clinical practice. They include the Beighton score but also weight symptoms: arthralgia, dislocations, soft tissue rheumatism, marfanoid habitus, abnormal skin, eye signs, varicosities, hernia. They allow a diagnosis of joint hypermobility syndrome based on the constellation rather than the score alone. The 2017 criteria narrowed this by demanding the Beighton score as a hard prerequisite.

The Hakim-Grahame 5-part questionnaire is built into the 2017 criteria as the historical workaround. It is a self-report instrument, fast, and validated. Its weakness is that it is gated by the Beighton score in current practice. An adult who fails the Beighton by more than one point cannot use the questionnaire as an alternate path.

The Lower Limb Assessment Score and the Upper Limb Hypermobility Assessment Tool offer joint-specific, validated alternatives that capture mobility the 9-point score misses. They are used in research. They are uncommon in routine clinical practice.

What the gatekeeper costs

The diagnostic odyssey for hEDS averages 10 to 22 years and 15 clinicians. The Beighton score is one of the chokepoints inside that interval. A 35-year-old woman with three subluxing joints, a positive family history, gastrointestinal dysmotility, and postural orthostatic tachycardia who scores 4 of 9 is told she does not meet criteria. The geneticist documents hypermobility spectrum disorder or non-specified joint hypermobility and refers her back to her primary care physician. Her insurance does not cover a multidisciplinary connective tissue evaluation without the diagnostic code. Her physical therapist is not authorized to bill for hEDS-specific protocols. The clinical management does not change much. The access does.

The 2017 working group knew the instrument was imperfect. The historical Beighton was a concession. The narrowed cutoffs were a concession. The companion category of hypermobility spectrum disorder was a concession to the people who clearly have a heritable connective tissue problem and clearly do not score high enough on a 1973 checklist to clear the diagnostic line. The 2023 pediatric framework for joint hypermobility, published in Orphanet Journal of Rare Diseases, was another attempt to escape the limits of the 9-point score for younger populations.

The deeper problem is that hEDS is clinically heterogeneous and the criteria depend on a single physical-exam score from one cohort. Two adults can have the same constellation of dislocations, autonomic dysfunction, GI dysmotility, and skin findings and end up on opposite sides of the diagnostic line based on whether their fifth fingers happen to bend. The score does not know about the rest of the body.

What better assessment looks like

A better diagnostic instrument for hEDS uses three things the 9-point score does not.

The first is longitudinal data. Joint mobility at age 20 is informative for hEDS at age 45 in a way that joint mobility at age 45 alone is not. A diagnostic frame that incorporates historical hypermobility through self-report, prior medical records, family video, and contemporaneous documentation captures the trajectory the current snapshot misses.

The second is a phenotype that crosses systems. The clustering of hypermobile joints with POTS, mast cell activation, GI dysmotility, dental crowding, and pelvic floor dysfunction is the clinical signature of hEDS. A diagnostic framework that weights the multi-system phenotype, with quantified contributions from each system, captures the disorder more faithfully than a single-system count.

The third is genetic anchoring. hEDS is the only EDS subtype in the 2017 classification without a confirmed gene. The Norris Lab at the Medical University of South Carolina runs the largest public gene-discovery effort. When a gene or genes are identified, the diagnostic frame will shift from a clinical checklist to genotype-phenotype correlation. A confirmed molecular finding makes the joint-counting instrument a screening tool again, which is what it was in 1973, rather than a gatekeeper.

The infrastructure that delivers all three is structured longitudinal phenotypic data, contributed at scale, linked to the genome. Every 9-point score measured over a lifetime, every dislocation, every comorbidity diagnosis, every medication response, contributed by tens of thousands of adults across ancestries and ages. That dataset answers the questions the 1973 instrument cannot. Which joints actually predict the multi-system phenotype. Which cutoffs perform across populations. Which historical features matter most. Which genetic variants segregate with which clinical clusters. The instrument that replaces the Beighton score will be built from data that does not yet exist in any single place.

A single physical-exam score from one cohort is the wrong gatekeeper for a clinically heterogeneous condition. The right one is built from the trajectories of the people who live with it.

Read about the diagnostic odyssey as systems failure.