Agreement between subjective evaluations and a markerless AI-based gait analysis system during lungeing assessment in traditional racehorses.

Authors: Meistro F, Ralletti M V, Rinnovati R, Spadari A

Journal: Journal of equine veterinary science

DOI: 10.1016/j.jevs.2025.105704 PubMed: 41022272Log in to save

Summary

# Editorial Summary Subjective lameness assessment during lungeing remains a cornerstone of pre-race clinical evaluation in racehorses, yet its reliability has long been questioned—particularly when identifying mild or complex gait asymmetries. Meistro and colleagues compared traditional clinician-based scoring against a markerless artificial intelligence gait analysis system (OAI-MS) in 24 traditional racehorses evaluated at routine pre-race inspections, with inter-observer agreement measurements and 10-day repeatability testing of the AI system to establish consistency. Inter-observer agreement among experienced clinicians was poor to weak (κ = −0.20 to 0.36), whilst agreement between subjective scores and the OAI-MS ranged from slight to moderate (κ = 0.13–0.47), with the AI system demonstrating fair short-term repeatability (κ = 0.43) and notably better concordance for forelimb assessment than hindlimbs. These findings validate what many practitioners have suspected: human evaluation of lungeing gait is inherently variable and potentially unreliable, particularly for subtle lameness. The OAI-MS offers a practical, objective complement to clinical judgment—most useful in borderline cases where clinician opinions diverge or when documenting mild asymmetries for baseline comparison and monitoring, rather than as a replacement for experienced clinical assessment.

Read the full abstract on PubMed

Practical Takeaways

•Subjective lameness assessments during lungeing have limited reliability, especially for mild cases—consider complementary AI gait analysis when clinical agreement is poor or asymmetry is subtle
•AI-based gait analysis (OAI-MS) shows promise as a repeatable, objective tool for routine pre-race inspections and clinical decision-making
•The technology appears more reliable for forelimb evaluation; use additional assessment methods when hindlimb asymmetry is suspected

Key Findings

•Inter-observer agreement for subjective gait evaluation was poor to fair (κ = -0.20 to 0.36)
•Agreement between subjective evaluations and AI-based gait analysis ranged from slight to moderate (κ = 0.13-0.47)
•The OAI-MS demonstrated moderate repeatability at 10-day interval (κ = 0.43), supporting field usability
•Agreement was higher for forelimbs than hindlimbs, with most discrepancies being of low magnitude

Conditions Studied

lamenessgait asymmetrymild lamenesscomplex asymmetry

Related References

Agreement between veterinarians and three objective evaluation systems in naturally occurring equine lameness.

McPeek Jenna L, Menarim Bruno, Sponseller Beatrice, McClendon Margaret, Adam Emma N, Adams Amanda A, Slone Stacy, Page Allen E(2025)Equine veterinary journal

Agreement between subjective gait assessment and markerless video gait-analysis in endurance horses.

de Chiara, Montano, De Matteis, Guidi, Buono, Auletta, Del Prete, Pasolini(2025)Equine veterinary journal

Farriery

Repeatability of subjective evaluation of lameness in horses.

Keegan K G, Dent E V, Wilson D A, Janicek J, Kramer J, Lacarrubba A, Walsh D M, Cassells M W, Esther T M, Schiltz P, Frees K E, Wilhite C L, Clark J M, Pollitt C C, Shaw R, Norris T(2010)Equine veterinary journal

Comparing Inertial Measurement Units to Markerless Video Analysis for Movement Symmetry in Quarter Horses.

Pfau, Landsbergen, Davis, Kenny, Kernot, Rochard, Porte-Proust, Sparks, Takahashi, Toth, Scott(2023)Sensors (Basel, Switzerland)