Investigations of the reliability of observational gait analysis for the assessment of lameness in horses.

Authors: Hewetson M, Christley R M, Hunt I D, Voute L C

Journal: The Veterinary record

DOI: 10.1136/vr.158.25.852 PubMed: 16798953Log in to save

Summary

# Editorial Summary: Observational Gait Analysis and Lameness Assessment Reliability Observational lameness grading remains a cornerstone of equine clinical practice, yet this 2006 investigation raised important questions about the consistency of subjective assessment methods. Hewetson and colleagues had 16 independent observers grade lameness severity in 20 videotaped horses using both a numerical rating scale (NRS) and verbal rating scale (VRS), then analysed inter-observer and intra-observer agreement, correlation patterns, and systematic bias. Whilst both scales demonstrated high correlation coefficients and acceptable inter-observer consistency at around 56–60 per cent agreement with negligible systematic bias, the clinical utility of this finding was limited: agreement between the two scales themselves proved unacceptable for clinical purposes when scores were compared directly, despite significant statistical correlation. These results underscore that subjective gait assessment scales, whilst reliable within themselves, cannot be used interchangeably and are only moderately reliable overall—a finding with considerable implications for practitioners relying on visual lameness grading for diagnosis, monitoring treatment response, or communicating clinical findings across different settings and observers.

Read the full abstract on PubMed

Practical Takeaways

•Do not switch between numerical and verbal lameness scales in your assessments—the tools produce clinically different results despite statistical correlation
•When evaluating lameness on video or in clinical practice, standardize to one rating scale within your operation and be aware that observer agreement is around 56-60%, meaning re-evaluation by a second opinion is valuable
•These moderate reliability findings suggest that subjective gait analysis alone has limitations; combine visual assessment with additional diagnostic tools (flexion tests, imaging, etc.) for clinical decision-making

Key Findings

•Observer agreement was moderate at 56% for numerical rating scale (NRS) and 60% for verbal rating scale (VRS) with high Kendall coefficient of concordance
•Both scales showed high correlation between and within observers with no significant bias among observers' mean scores
•NRS and VRS scores were significantly correlated with each other but differences between scales were clinically unacceptable
•Both rating scales demonstrated only moderate reliability for assessing lameness severity and should not be used interchangeably

Conditions Studied

lameness

Related References

The intra- and inter-assessor reliability of measurement of functional outcome by lameness scoring in horses.

Fuller Catherine J, Bladon Bruce M, Driver Adam J, Barr Alistair R S(2006)Veterinary journal (London, England : 1997)

Comparison of three acute colic pain scales: Reliability, validity and usability.

Sutton G A, Atamna R, Steinman A, Mair T S(2019)Veterinary journal (London, England : 1997)

Repeatability of subjective evaluation of lameness in horses.

Keegan K G, Dent E V, Wilson D A, Janicek J, Kramer J, Lacarrubba A, Walsh D M, Cassells M W, Esther T M, Schiltz P, Frees K E, Wilhite C L, Clark J M, Pollitt C C, Shaw R, Norris T(2010)Equine veterinary journal

Agreement between veterinarians and three objective evaluation systems in naturally occurring equine lameness.

McPeek Jenna L, Menarim Bruno, Sponseller Beatrice, McClendon Margaret, Adam Emma N, Adams Amanda A, Slone Stacy, Page Allen E(2025)Equine veterinary journal

Rater agreement for assessment of equine back mobility at walk and trot compared to quantitative gait analysis.

Spoormakers T J P, Graat E A M, Serra Bragança F M, Weeren P R van, Brommer H(2021)PloS one