# A simple method for mapping the location of cross-β-forming regions within protein domains of low sequence complexity

**Authors:** Jinge Gu, Xiaoming Zhou, Lillian Sutherland, Glen Liszczak, Steven L. McKnight

PMC · DOI: 10.1073/pnas.2503382122 · Proceedings of the National Academy of Sciences of the United States of America · 2025-04-23

## TL;DR

This paper introduces a method to identify regions in proteins that can self-associate and form structures important for cellular organization.

## Contribution

A new method is introduced to pinpoint self-associating regions in low complexity protein domains.

## Key findings

- Cross-β-prone regions suppress fluorescence when attached to the C-terminus of GFP.
- A 20 amino acid sequence in TDP-43's LCD is essential for self-association and phase separation.
- The method can identify self-associating regions in low complexity domains.

## Abstract

This study describes a molecular biological method for analyzing protein domains of low sequence complexity in search of segments that mediate self-association and consequent phase separation both in vitro and in vivo. Small regions allowing for self-association correspond to sequences that specify the formation of labile cross-β structural order. When juxtaposed to the C-terminus of GFP, cross-β-prone regions suppress fluorescence. A tiled scan of overlapping fragments of the low complexity domain (LCD) of the TDP-43 RNA-binding protein pinpointed an evolutionarily conserved sequence of 20 amino acids essential for self-association, phase separation, and the formation of nuclear speckles. The screening method described herein should be useful for the analysis of any LCD believed to function via homotypic self-association.

Protein domains of low sequence complexity are unable to fold into stable, three-dimensional structures. In test tube studies, these unusual polypeptide regions can self-associate in a manner causing phase separation from aqueous solution. This form of protein:protein interaction has been implicated in numerous examples of dynamic morphological organization within eukaryotic cells. In several cases, the basis for low complexity domain (LCD) self-association and phase separation has been traced to the formation of labile cross-β structures. The primary energetic force favoring formation of these transient and reversible structures is enabled by polypeptide backbone interactions. Short, contiguous networks of peptide backbone amino groups and carbonyl oxygens are zippered together intermolecularly by hydrogen bonding as described by Linus Pauling seven decades ago. Here, we describe a simple, molecular biological method useful for the identification of localized, self-associating regions within larger protein domains of low sequence complexity.

## Linked entities

- **Proteins:** NAL1 (Protein NARROW LEAF 1), TARDBP (TAR DNA binding protein)

## Full-text entities

- **Chemicals:** hydrogen (MESH:D006859), oxygens (MESH:D010100)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12054801/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12054801/full.md

## References

49 references — full list in the complete paper: https://tomesphere.com/paper/PMC12054801/full.md

---
Source: https://tomesphere.com/paper/PMC12054801