Annotation of the Protein Coding Regions of the Equine Genome.

Authors: Hestand Matthew S, Kalbfleisch Theodore S, Coleman Stephen J, Zeng Zheng, Liu Jinze, Orlando Ludovic, MacLeod James N

Journal: PloS one

DOI: 10.1371/journal.pone.0124375 PubMed: 26107351Log in to save

Summary

# Editorial Summary The equine genome annotation has historically relied on computational predictions and comparison with other species, leaving substantial gaps in our understanding of horse-specific gene expression. Researchers sequenced messenger RNA from 43 different equine tissues to build a comprehensive map of 68,594 transcripts and identify 301,829 genetic variations within protein-coding regions, revealing that 780 of these variants actually represent errors in the reference genome sequence itself. By cross-referencing these findings with whole-genome sequencing data, the team confirmed these genomic discrepancies and simultaneously uncovered previously unrecognised genetic diversity across equine tissues. For practitioners, this improved annotation resource enables more accurate interpretation of genetic tests, better understanding of breed-specific disease susceptibilities, and stronger foundations for performance and health-related genomic research. This work substantially upgrades the quality of equine genetic information available for both clinical diagnostics and comparative genomic studies across species.

Read the full abstract on PubMed

Practical Takeaways

•This genomic resource improves the accuracy of gene identification and variant calling for equine research, supporting more reliable biomarker discovery for health and disease.
•The corrected reference genome enhances cross-species comparisons, making equine models more valuable for understanding conserved biological mechanisms.
•Better annotation of the equine genome will accelerate development of genetic tests for inherited conditions and performance traits.

Key Findings

•Sequencing of mRNA from 43 different equine tissues yielded structures for 68,594 transcripts with experimental evidence.
•Identified 301,829 SNP or indel positions within transcripts relative to the EquCab2 reference genome.
•Found 780 variants that extend open reading frames, indicating small errors in the equine reference genome sequence.

Related References

The transcriptome of equine peripheral blood mononuclear cells.

Pacholewska Alicja, Drögemüller Michaela, Klukowska-Rötzler Jolanta, Lanz Simone, Hamza Eman, Dermitzakis Emmanouil T, Marti Eliane, Gerber Vincent, Leeb Tosso, Jagannathan Vidhya(2015)PloS one

Analysis of unannotated equine transcripts identified by mRNA sequencing.

Coleman Stephen J, Zeng Zheng, Hestand Matthew S, Liu Jinze, Macleod James N(2013)PloS one

Whole genome detection of sequence and structural polymorphism in six diverse horses.

Al Abri Mohammed Ali, Holl Heather Marie, Kalla Sara E, Sutter Nathan B, Brooks Samantha A(2020)PloS one

Applied equine genetics.

Finno C J, Bannasch D L(2014)Equine veterinary journal

Equine clinical genomics: A clinician's primer.

Brosnahan M M, Brooks S A, Antczak D F(2010)Equine veterinary journal