Birth · Newborn

New Ballard Score (NBS)

The New Ballard Score (Ballard 1991) — 12-item postnatal gestational age assessment combining 6 neuromuscular and 6 physical maturity signs. Accuracy ± 2 weeks; AAP / NRP gold standard when ultrasound dating is unavailable.

Last reviewed 25 May 2026

New Ballard Score — gestational age

Newborn maturity assessment

Each item scored −1 to +5 based on clinical examination of the newborn. Best performed within 12-96 hours of birth.

Neuromuscular maturity (6 items)

Posture
Square window (wrist)
Arm recoil
Popliteal angle
Scarf sign
Heel-to-ear

Physical maturity (6 items — score genitalia as one)

Skin
Lanugo
Plantar surface
Breast
Eye / ear
Genitalia
Estimated GA
20 weeks
Total Ballard score 0  ·  accuracy ± 2 weeks
Educational tool only — not medical advice. NBS is performed by trained NICU staff. This widget shows the scoring framework; actual GA assessment is by an experienced examiner. Used when LMP / first-trimester dating is unavailable (e.g. surrogacy, late presentation, NICU admission from outside).
What does this mean?
The New Ballard Score (Ballard 1991) lets a clinician estimate gestational age from a newborn examination — useful when LMP dating is unreliable or unavailable (late booking, no first-trimester scan, surrogacy, abandoned newborn, infant admitted from outside the system). It uses 6 neuromuscular items (posture, square window, recoil, popliteal angle, scarf, heel-to-ear) and 7 physical maturity items (skin, lanugo, plantar creases, breast bud, eye/ear, genitals). The total maps to a gestational age with accuracy of ± 2 weeks, best when assessed 12–96 h after birth. Ballard’s NBS extension (down to 20 wk) made it usable for extreme preterms. When first-trimester CRL dating is available it ALWAYS overrides Ballard, because ultrasound dating before 14 wk is accurate to ±5–7 days.

Introduction

The New Ballard Score (NBS) is the AAP / NRP gold standard for postnatal gestational age estimation. Developed by Ballard et al. in 1991, it expanded the original 1979 Ballard score to include extremely premature infants and remains widely used in NICU and delivery rooms when antenatal dating is unavailable.

Background

Accurate GA matters for survival prognostication, treatment eligibility (therapeutic hypothermia, surfactant, growth-chart selection), and family counselling. When LMP is uncertain or first-trimester ultrasound was not performed (late presentation, surrogacy, adoption, NICU transfer from elsewhere), postnatal assessment is the next best step.

The 12 items

Neuromuscular maturity (6 items)

  • Posture — degree of limb flexion (extended → fully flexed).
  • Square window (wrist) — angle of wrist flexion (90° → 0°).
  • Arm recoil — speed of return to flexion after extension.
  • Popliteal angle — angle measured at the knee with hip flexed (180° → 90°).
  • Scarf sign — elbow movement across chest when arm is drawn across.
  • Heel-to-ear — flexibility of legs reaching to ears.

Physical maturity (6 items)

  • Skin — sticky / friable (extreme preterm) → cracked / leathery (post-term).
  • Lanugo — absent (extreme preterm or post-term) → abundant (mid-trimester).
  • Plantar surface — heel-toe length and creases.
  • Breast — areola development and bud size.
  • Eye / ear — eyelid fusion (early) → cartilage and recoil (term).
  • Genitals — sex-specific maturation signs.

Scoring and GA conversion

Each item scored from −1 to +5. Sum all 12 items: total score ranges −12 to +60.

Linear conversion (approximate):

GA (weeks) ≈ 20 + ((Total − 10) / 5) × 2

Anchor points:
  Score 10 → 20 wk
  Score 20 → 24 wk
  Score 30 → 28 wk
  Score 40 → 36 wk
  Score 50 → 44 wk

Timing of assessment

  • Before 12 hours: postural and neuromuscular signs still affected by birth — score less reliable.
  • 12-96 hours: optimal window. Most accurate.
  • After 96 hours: peripheral perfusion changes; physical signs evolve with adaptation.
  • For extreme preterm (< 28 wk): best assessed between 12 and 24 hours.

How accurate is it?

± 2 weeks across the 20-44 week range in the original Ballard 1991 validation cohort. Less accurate than first-trimester ultrasound (± 3-5 days) but uniquely useful when no antenatal dating exists.

Limitations

  • Inter-observer variability — trained NICU staff achieve consistency; less experienced examiners may overestimate or underestimate by 2-3 weeks.
  • Less reliable for infants with significant CNS injury (altered tone), congenital syndromes, or severe IUGR (physical maturity disconnected from chronological GA).
  • Should NOT replace first-trimester ultrasound dating when that’s available.
  • Educational widget; actual NBS assessment is by trained clinicians at the cot-side.

Sources

  • Ballard JL, Khoury JC, Wedig K, Wang L, Eilers-Walsman BL, Lipp R. New Ballard Score, expanded to include extremely premature infants. J Pediatr 1991;119:417-23.
  • Ballard JL, Novak KK, Driver M. A simplified score for assessment of fetal maturation of newly born infants. J Pediatr 1979;95:769-74. (Original)
  • American Academy of Pediatrics. Neonatal Resuscitation Program (NRP). 8th edition 2021.
  • ACOG / AAP. Guidelines for Perinatal Care. 8th edition 2017.

Frequently asked questions

What is the New Ballard Score?
Ballard JL et al. (1991) developed an expanded gestational age assessment tool capable of accurately estimating GA from 20 to 44 weeks based on neuromuscular and physical maturity signs. It revised the original 1979 Ballard score to include extremely premature infants. Each of the 12 items is scored −1 to +5; total score correlates linearly with gestational age. Accuracy is ± 2 weeks. Used in NICU and delivery rooms when LMP or first-trimester ultrasound dating is unavailable.
When is NBS most accurate?
Within 12-96 hours of birth. Before 12 hours the postural and neuromuscular signs are still affected by the birth process; after 96 hours peripheral perfusion changes and adaptation alter the physical signs. Outside this window the GA estimate becomes less reliable. For extremely preterm infants (< 28 weeks), assessment is best between 12 and 24 hours of birth.
How do the 12 items contribute?
Six NEUROMUSCULAR items: posture (flexion vs extension), square window (wrist flexibility), arm recoil, popliteal angle, scarf sign, heel-to-ear. Six PHYSICAL items: skin (translucent through cracked/peeling), lanugo (sparse through abundant), plantar surface (smooth through deep creases), breast (flat areola through full areola/5-10mm bud), eye / ear (eyelids fused or pinna recoil), genitals (sex-appropriate maturation). Each item references a maturity diagram in the original publication.
Where does NBS fit alongside LMP and ultrasound dating?
LMP and first-trimester ultrasound are MORE accurate than NBS — ultrasound at 8-13 weeks is accurate to ± 3-5 days. NBS comes into play when dating is unavailable or unreliable: late-presenting pregnancy, surrogacy / adoption, in-utero abandonment, or when LMP is uncertain. ACOG recommends NBS as the gold standard for postnatal GA estimation when ultrasound dating is not available.
How is GA calculated from the score?
Score 10 corresponds to ~20 weeks; each additional 5 points adds 2 weeks. Score 30 ≈ 28 weeks; Score 50 ≈ 44 weeks. The linear approximation we use is: GA (weeks) ≈ 20 + ((Total − 10) / 5) × 2. The original 1991 publication provides a chart that some clinicians still reference; the linear conversion captures it accurately within ± 1 week across the validated range.
Are there other GA assessment tools?
Yes. Capurro score (1978) — Latin American, faster but less validated for extremes. Dubowitz score (1970) — predecessor to Ballard with more items. Eregie's modified Ballard for resource-poor settings. The NBS remains the AAP/NRP gold standard for postnatal GA estimation, particularly in extremely preterm infants.
Why does GA assessment matter postnatally?
Treatment, prognosis, growth-chart selection, and family counselling all depend on accurate GA. A neonate born at 25 vs 27 vs 30 weeks has very different mortality and long-term outcomes — the assessment frames clinical decisions. NICU therapeutic-hypothermia eligibility, ventilation strategy, vaccination timing, and Fenton growth chart use all depend on GA.