Unexpected structural features of the equine major histocompatibility complex

In many ways, the alignment of ELA sequence with HLA demonstrated a great deal of organizational similarity and conserved synteny as expected (Gustafson et al., 2003). Sequence alignments also confirmed earlier mapping studies that found ELA class I sequences distributed in three clusters, including two class I loci within the class II region (Gustafson et al., 2003; Tallmadge et al., 2005). However, noteworthy differences were predicted. One of the striking results from the initial assembly of the ELA and extended regions from the equine genome project was that the size of the region was approximately 6.0 Mb in size, almost 1.3 Mb larger than the human HLA and extended regions. Furthermore, annotation of the ELA sequence identified 40 class I loci, many more than expected from serology, analysis of BAC clones, and gene expression studies (Ellis et al., 1995; Tallmadge et al., 2005, 2010).

Closer examination of the assembled sequence of ELA revealed that the increased size of ELA was largely due to two features apparently unique to the MHCs of horses when compared to other sequenced mammalian taxa. One feature is a gene desert of approximately 550 Kb at the boundary of the class II and class III regions from coordinates 20: 31,896,104..32,442,400 (Figure 5.1). This region contains a single annotated gene, C6orf10, also called testis-specific binding protein (TSBP), and two pseudogenes, one related to a tetraspanin 17-like sequence and the other to a Spi-c-like transcription factor. The chromosomal position and sequence homology of ELA C6orf10 to human C6orf10 indicates that these are orthologous loci. Alignment of the balance of the ELA desert sequence with whole genomic sequences of other species identified no comparable sequence at any location in any other species.

The second distinctive genomic feature in ELA is a large and strikingly conserved segmental duplication of at least 11 units, each about 45Kb in size, at the boundary of the ELA class I and III regions (Figure 5.2) (Brinkmeyer-Langford et al., 2010). Each unit in the segmental duplication contains one sequence related to a truncated form of the B-associated transcript 1 (Bat1) that aligns to the c-terminal domain of the helicase domain, and a second sequence with strong homology to class I sequences. The Bat-1 and class I sequences are regularly interspersed and arrayed largely in head to tail arrangement throughout the segmental duplication. The Bat-1 sequences are extraordinarily conserved within Equus caballus and among other Perissodactyl species (Brinkmeyer-Langford et al., 2010), arguing for some functional role(s) for these sequences. Twenty-four genes are predicted from the version 2.0 assembly to be contained within the segmental duplication. Three closely related sequences are chromosomally unassigned in the 2.0 assembly, indicating that the segmental duplication may include 14 repeating units and 30 annotatable genes.

Figure 5.2 Dotplot of repeat-masked sequence in the ELA segmental duplication region on ECA20 (NW_001867389.1; positions 30600000-31350000bp) aligned against itself using the bl2seq option of the NCBI BLAST website. At least 11 regions contain related sequences generally aligned head to tail. Three additional closely related sequences are found in the ChrUn database of the horse, indicating that 14 repeat units may reside within the segmental duplication.


The remarkable conservation of sequences within the segmental duplication feature among different Perissodactyl MHCs suggests that this region would contain functional genes. Evidence for gene expression has been sought using reverse transcription PCR and chromatin immunoprecipition sequencing (ChIP seq).


We used locus-specific RT-PCR to seek transcripts from nine of the class I genes and four of the Bat-1 like genes in the segmental duplication (Table 5.1; Brinkmeyer-Langford et al., 2010). Sequence similarities precluded design of locus-specific primers for the remaining predicted genes. Transcripts for five of the nine class I sequences were successfully amplified from peripheral WBCs of at least some of several horses tested, while no transcripts were detected for any of the four tested BAT1-like sequences. Transcripts from the full-length BAT-1 sequence were identified as expected. These results indicate that at least some of the class I genes within the segmental duplication are transcriptionally active, including two genes previously identified (Ellis et al., 1995; Tallmadge et al., 2005). Consequently, it seems that most of the predicted genes within the segmental duplication are not expressed as mRNAs, although it is possible that some of these genes may demonstrate tissue-specific expression profiles not assessed by these studies.

Table 5.1 Summary of predicted genes and transcripts within the segmental duplication interrogated for gene expression by H3K4me3 ChIPseq and reverse transcription PCR.


Chromatin Modifications Associated with Transcription

Another approach to identifying regions of transcribed DNA is by sequencing the DNA bound to nucleosomes immunoprecipitated with antibodies specific for histone modifications that are predictive of open chromatin (ChIP). Using unpublished data graciously provided by S. Dindot and N. Cohen of Texas A&M University, we examined the gene desert and segmental duplication regions of ELA in whole genome ChIP seq data from anti-H3K4me3-immunoprecipitated chromatin of neutrophils obtained from a newborn foal. Histone 3 trimethylated at lysine 4 (H3K4me3) is a histone modification predictably associated with the 5′ transcribed regions of actively expressed genes in higher eukaryotes (Santos-Rosa et al., 2002; Schneider et al., 2003) and is useful as a surrogate for identifying expressed genes.

Representation of DNA sequences aligned to the gene desert coordinates ECA 20: 31,896,104..32,442,400 revealed low levels of H3K4me3-captured sequences over the entire span of the gene desert, consistent with the predictions of few or no expressed genes in this region of ELA. In contrast, the region of the segmental duplication (ECA 20:30,600,000..31,336,000) was highly enriched for H3K4me3 bound sequences, suggesting an abundance of actively transcribed DNA. Twenty-four predicted genes are located in the ELA segmental duplication and 17 of these sequences were located in regions that were moderately to highly represented in anti-H3K4me3 immunoprecipitates. These peaks corresponded to the locations of 13 predicted genes (Ensembl). Of the seven regions highly enriched in H3K4me3, four (57%) were also positive for RT-PCR transcripts. Of the six predicted genes in regions of low H3K4me3, none were detected as transcripts by RT-PCR. A summary of the evidence for gene expression in the segmental duplication is presented in Table 5.1.


As more detailed analyses of the vertebrate MHC become available, the picture emerging is of an organizationally constrained but structurally dynamic region evolving primarily by recombination and gene conversion. These processes act to sort and reshuffle combinations of genes that have been strongly selected for over evolutionary time frames. The early paradigm that mammalian MHCs were predictably arranged by gene content into three regions consisting of class I and class II genes flanking a class III region is proving to be overly simplistic. The disruption of the class II region of the ruminant MHC (Childers et al., 2006) and less dramatic rearrangements in the MHCs of cat (FLA) and dog (DLA) (Yuhki et al., 2007) promise to provide insights into the evolutionary processes at work in this important region of the genome. The unusual features now known to characterize the Perissodactyl MHCs add significantly to the list of diversifying structural changes present in the vertebrate MHC. Access to research material from a common domestic animal such as the horse, with deep pedigrees and well-developed formal biology, provides great potential to help unravel the puzzling structural and functional properties of the MHC.


Antczak, D. F., & Allen, W. R. (1989). Maternal immunological recognition of pregnancy in equids. Journal of Reproduction & Fertility, 37(Suppl.), 69–78.

Bailey, E. (2010). Foreword: Horse genomics and the Dorothy Russell Havemeyer Foundation. Animal Genetics 41(Suppl. 2), 1.

Brinkmeyer-Langford C. L., Murphy, W. J., Childers, C. P., & Skow, L. C. (2010). A conserved segmental duplication within ELA. Animal Genetics, 41(Suppl. 2), 186–195.

Childers, C. P., Newkirk, H. L., Honeycutt, D. A., Ramlachan, N., Muzney, D. M. et al. (2006). Comparative analysis of the bovine MHC class IIb sequence identifies inversion breakpoints and three unexpected genes. Animal Genetics, 37(2), 121–129.

Chowdhary B. P., Raudsepp, T., Honeycutt, D., Owens, E. K., Piumi, F. et al. (2002). Construction of a 5000(rad) whole-genome radiation hybrid panel in the horse and generation of a comprehensive and comparative map for ECA11. Mammalian Genome, 13(2), 89–94.

Chowdhary, B. P., Raudsepp, T., Kata, S. R., Goh, G., Millon, L. V. et al. (2003). The first-generation whole-genome radiation hybrid map in the horse identifies conserved segments in human and mouse genomes. Genome Research, 13(4), 742–751.

Crump, A., Donaldson, W. L., Miller, J., Kydd, J. H., Allen, W. R., & Antczak, D. F. (1987). Expression of major histocompatibility complex (MHC) antigens on horse trophoblast. Journal of Reproduction & Fertility, 35(Suppl.), 379–388.

de Bakker, P. I., McVean, G., Sabeti, P. C., Miretti, M. M., Green, T. et al. (2006). A high-resolution HLA and SNP haplotype map for disease association studies in the extended human MHC. National Genetics, 38(10), 1166–1172.

Donaldson, W. L., Zhang, C. H., Oriol, J. G., & Antczak, D. F. (1990). Invasive equine trophoblast expresses conventional class I major histocompatibility complex antigens. Development, 110(1), 63–71.

Doan, R., Cohen, N. D., Sawyer. J., Ghaffari, N., Johnson, C. D., & Dindot, S. V. (2012). Whole-genome sequencing and genetic variant analysis of a quarter horse mare. BMC Genomics 13:78.

Guérin, G., Bailey, E., Bernoco, D., Anderson, I., Antczak, D. F. et al. (1999). Report of the International Equine Gene Mapping Workshop: Male linkage map. Animal Genetics, 30(5), 341–354.

Guérin, G., Bailey, E., Bernoco, D., Anderson, I., Antczak, D. F. et al. (2003). The second generation of the International Equine Gene Mapping Workshop half-sibling linkage map. Animal Genetics, 34(3), 161–168.

Gustafson, A. L., Tallmadge, R. L., Ramlachan, N., Miller, D., Bird, H. et al. (2003). An ordered BAC contig map of the equine major histocompatibility complex. Cytogenetic Genome Research, 102(1–4), 189–195.

Harris, E. E., & Meyer, D. (2006). The molecular signature of selection underlying human adaptations. American Journal of Physiology & Anthropology, 43(Suppl.), 89–130.

Hedrick, P. W. (1998–1999). Balancing selection and MHC. Genetica, 104(3), 207–214.

Klein, J. (1987). Origin of major histocompatibility complex polymorphism: The trans-species hypothesis. Human Immunology, 19, 155–162.

Lazary, S., Antczak, D. F., Bailey, E., Bell, T. K., Bernoco, D. et al. (1988). Joint Report of the Fifth International Workshop on Lymphocyte Alloantigens of the Horse, October 31–November 1987, Baton Rouge, Louisiana. Animal Genetics, 19(4), 447–456.

Lie, B. A., & Thorsby, E. (2005). Several genes in the extended human MHC contribute to predisposition to autoimmune diseases. Current Opinions in Immunology, 17(5), 526–531.

McCue, M. E., Bannasch, D. L., Petersen, J. L., Gurr, J., Bailey, E. et al. (2012). A high density SNP array for the domestic horse and extant perissodactyla: Utility for association mapping, genetic diversity, and phylogeny studies. PLoS Genetics, 8(1), e1002451.

McGuire, K. L., Duncan, W. R., & Tucker, P. W. (1985). Syrian hamster DNA shows limited polymorphism at class I-like loci. Immunogenetics, 22(3), 257–268.

Miyata, T., Yasunaga, T., & Nishida, T. (1980). Nucleotide sequence divergence and functional constraint in mRNA evolution. Proceedings of the National Academy of Sciences USA, 77(12), 7328–7332.

Nizetić, D., Stevanović, M., Soldatović, B., Savić, I., & Crkvenjakov, R. (1988). Limited polymorphism of both classes of MHC genes in four different species of the Balkan mole rat. Immunogenetics, 28(2), 91–98.

Ohta, Y., Okamura, K., McKinney, E. C., Bartl, S., Hashimoto, K., & Flajnik, M. F. (2000). Primitive synteny of vertebrate major histocompatibility complex class I and class II genes. Proceedings of the National Academy of Sciences USA, 97(9), 4712–4717.

Perrocheau, M., Boutreux, V., Chadi, S., Mata, X., Decaunes, P. et al. (2006). Construction of a medium-density horse gene map. Animal Genetics, 37(2), 145–155.

Raudsepp, T., Gustafson-Seabury, A., Durkin, K., Wagner, M. L., Goh, G. et al. (2008). A 4,103 marker integrated physical and comparative map of the horse genome. Cytogenetic Genome Research, 122(1), 28–36.

Santos-Rosa, H., Schneider, R., Bannister, A. J., Sherriff, J., Berstein, B. E. et al. (2002). Active genes are trimethylated at K4 of histone H3. Nature, 419, 407–411.

Sato, A., Figueroa, F., Murray, B. W., Málaga-Trillo, E., Zaleska-Rutczynska, Z. et al. (2000). Nonlinkage of the major histocompatibility complex of class I and class II loci in bony fishes. Immunogenetics, 51, 108.

Schneider, R., Bannister, A. J., Myers, F. A., Thorne, A. W., Crane-Robinson, C., & Kouzarides, T. (2004). Histone H3 lysine 4 methylation patterns in higher eukaryotic genes. Nature Cell Biology, 6(1), 73–77.

Shiue, Y. L., Bickel, L. A., Caetano, A. R., Millon, L. V., Clark, R. S. et al. (1999). A synteny map of the horse genome comprised of 240 microsatellite and RAPD markers. Animal Genetics, 30(1), 1–9.

Swinburne, J., Gerstenberg, C., Breen, M., Aldridge, V., Lockhart, L. et al. (2000). First comprehensive low-density horse linkage map based on two 3-generation, full-sibling, cross-bred horse reference families. Genomics, 66(2), 123–134.

Tait, B. D. (2011). The ever-expanding list of HLA alleles: Changing HLA nomenclature and its relevance to clinical transplantation. Transplant Review (Orlando), 25(1), 1–8.

Tallmadge, R. L., Campbell, J. A., Miller, D. C., & Antczak, D. F. (2010). Analysis of MHC class I genes across horse MHC haplotypes. Immunogenetics, 62(3), 159–172.

Tallmadge, R. L., Lear, T. L., & Antczak, D. F. (2005). Genomic characterization of MHC class I genes of the horse. Immunogenetics, 57(10), 763–774.

Trowsdale, J. (2011). The MHC, disease and selection. Immunology Letters, 137(1–2):1–8.

Tseng, C. T., Miller, D., Cassano, J., Bailey, E., & Antczak, D. F. (2010). Identification of equine major histocompatibility complex haplotypes using polymorphic microsatellites. Animal Genetics, 41(Suppl 2), 150–153.

Wade, C. M., Giulotto, E., Sigurdsson, S., Zoli, M., Gnerre, S. et al. (2009). Genome sequence, comparative analysis, and population genetics of the domestic horse. Science, 326(5954), 865–867.

Walsh, E. C., Mather, K. A., Schaffner, S. F., Farwell, L., Daly, M. J. et al. (2003). An integrated haplotype map of the human major histocompatibility complex. American Journal of Human Genetics, 73, 580–590.

Yuhki, N., Beck, T., Stephens, R., Neelam, B., & O’Brien, S. J. (2007). Comparative genomic structure of human, dog, and cat MHC: HLA, DLA, and FLA. Journal of Heredity, 98(5), 390–399.

Only gold members can continue reading. Log In or Register to continue

Stay updated, free articles. Join our Telegram channel

Jul 9, 2017 | Posted by in EQUINE MEDICINE | Comments Off on Unexpected structural features of the equine major histocompatibility complex

Full access? Get Clinical Tree

Get Clinical Tree app for offline access