# Remotely acting SMCHD1 gene regulatory elements: in silico prediction and identification of potential regulatory variants in patients with FSHD

**Authors:** Mary B. Mayes, Taniesha Morgan, Jincy Winston, Daniel S. Buxton, Mihir Anant Kamat, Debbie Smith, Maggie Williams, Rebecca L. Martin, Dirk A. Kleinjan, David N. Cooper, Meena Upadhyaya, Nadia Chuzhanova

PMC · DOI: 10.1186/s40246-015-0047-x · Human Genomics · 2015-10-07

## TL;DR

This study explores how distant DNA regions might regulate the SMCHD1 gene in facioscapulohumeral dystrophy, identifying potential disease-linked variants.

## Contribution

The study introduces a method to predict and identify remotely acting regulatory elements influencing the SMCHD1 gene in FSHD.

## Key findings

- Two regions ~14 and ~85 kb upstream of SMCHD1 were identified as potential regulatory elements.
- Three heterozygous variants were found near SMCHD1, with one showing potential functional significance.
- The approach offers a new way to screen for disease-relevant mutations beyond the immediate gene vicinity.

## Abstract

Facioscapulohumeral dystrophy (FSHD) is commonly associated with contraction of the D4Z4 macro-satellite repeat on chromosome 4q35 (FSHD1) or mutations in the SMCHD1 gene (FSHD2). Recent studies have shown that the clinical manifestation of FSHD1 can be modified by mutations in the SMCHD1 gene within a given family. The absence of either D4Z4 contraction or SMCHD1 mutations in a small cohort of patients suggests that the disease could also be due to disruption of gene regulation. In this study, we postulated that mutations responsible for exerting a modifier effect on FSHD might reside within remotely acting regulatory elements that have the potential to interact at a distance with their cognate gene promoter via chromatin looping. To explore this postulate, genome-wide Hi-C data were used to identify genomic fragments displaying the strongest interaction with the SMCHD1 gene. These fragments were then narrowed down to shorter regions using ENCODE and FANTOM data on transcription factor binding sites and epigenetic marks characteristic of promoters, enhancers and silencers.

We identified two regions, located respectively ~14 and ~85 kb upstream of the SMCHD1 gene, which were then sequenced in 229 FSHD/FSHD-like patients (200 with D4Z4 repeat units <11). Three heterozygous sequence variants were found ~14 kb upstream of the SMCHD1 gene. One of these variants was found to be of potential functional significance based on DNA methylation analysis. Further functional ascertainment will be required in order to establish the clinical/functional significance of the variants found.

In this study, we propose an improved approach to predict the possible locations of remotely acting regulatory elements that might influence the transcriptional regulation of their associated gene(s). It represents a new way to screen for disease-relevant mutations beyond the immediate vicinity of the specific disease gene. It promises to be useful for investigating disorders in which mutations could occur in remotely acting regulatory elements.

The online version of this article (doi:10.1186/s40246-015-0047-x) contains supplementary material, which is available to authorized users.

## Linked entities

- **Genes:** SMCHD1 (structural maintenance of chromosomes flexible hinge domain containing 1) [NCBI Gene 23347], LOC710545 (double homeobox protein 4C) [NCBI Gene 710545]
- **Diseases:** Facioscapulohumeral dystrophy (MONDO:0001347), FSHD (MONDO:0001347)

## Full-text entities

- **Genes:** RUNX3 (RUNX family transcription factor 3) [NCBI Gene 864] {aka AML2, CBFA3, PEBP2aC}, STAT2 (signal transducer and activator of transcription 2) [NCBI Gene 6773] {aka IMD44, ISGF-3, P113, PTORCH3, STAT113}, CBX3 (chromobox 3) [NCBI Gene 11335] {aka HECH, HP1-GAMMA, HP1Hs-gamma, HP1gamma}, FSHMD1A (facioscapulohumeral muscular dystrophy 1A) [NCBI Gene 2489] {aka FMD, FSHD, FSHD1A, FSHMD}, METTL4 (methyltransferase 4, N6-adenosine) [NCBI Gene 64863] {aka HsT661}, MDFIC (MyoD family inhibitor domain containing) [NCBI Gene 29969] {aka HIC, LMPHM12, MDFIC1}, DUX4 (double homeobox 4) [NCBI Gene 100288687] {aka DUX4L}, IRF1 (interferon regulatory factor 1) [NCBI Gene 3659] {aka IMD117, IRF-1, MAR}, CTCF (CCCTC-binding factor) [NCBI Gene 10664] {aka CFAP108, FAP108, MRD21}, TNFRSF25 (TNF receptor superfamily member 25) [NCBI Gene 8718] {aka APO-3, DDR3, DR3, LARD, TNFRSF12, TR3}, SMCHD1 (structural maintenance of chromosomes flexible hinge domain containing 1) [NCBI Gene 23347] {aka BAMS, FSHD2}, STAT1 (signal transducer and activator of transcription 1) [NCBI Gene 6772] {aka CANDF7, IMD31A, IMD31B, IMD31C, ISGF-3, STAT91}, BATF (basic leucine zipper ATF-like transcription factor) [NCBI Gene 10538] {aka B-ATF, BATF1, SFA-2, SFA2}, SPI1 (Spi-1 proto-oncogene) [NCBI Gene 6688] {aka AGM10, OF, PU.1, SFPI1, SPI-1, SPI-A}, MAK (male germ cell associated kinase) [NCBI Gene 4117] {aka RP62}, DR1 (down-regulator of transcription 1) [NCBI Gene 1810] {aka NC2, NC2-BETA, NC2B, NCB2}
- **Diseases:** genetic defects (MESH:D030342), facial and shoulder girdle muscle weakness (MESH:D018908), FSHD (MESH:D020391), atrophy of the facial and shoulder girdle muscle (MESH:D009133), FSHD2 (MESH:C563557), muscular dystrophies (MESH:D009136), and shoulder girdle weakness (MESH:D020968)
- **Chemicals:** Validate (-), DS (MESH:D003903)
- **Species:** Pan troglodytes (chimpanzee, species) [taxon 9598], Gorilla (genus) [taxon 9592], Callithrix jacchus (common marmoset, species) [taxon 9483], Homo sapiens (human, species) [taxon 9606], Saimiri sciureus (common squirrel monkey, species) [taxon 9521], Macaca mulatta (rhesus macaque, species) [taxon 9544], Macaca fascicularis (crab eating macaque, species) [taxon 9541]
- **Mutations:** 610 T > C, 886 G > A, rs7229070, 858 T > C
- **Cell lines:** GM12878 — Homo sapiens (Human), Transformed cell line (CVCL_7526), GM06990 — Homo sapiens (Human), Transformed cell line (CVCL_9587)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC4597391/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC4597391/full.md

## References

27 references — full list in the complete paper: https://tomesphere.com/paper/PMC4597391/full.md

---
Source: https://tomesphere.com/paper/PMC4597391