Learning molecular traits of human pain disease via voltage-gated sodium channel structure renormalization

Markos N. Xenakis; Angelika Lampert

PMC · DOI:10.1016/j.csbj.2025.11.048·December 1, 2025

Learning molecular traits of human pain disease via voltage-gated sodium channel structure renormalization

Markos N. Xenakis, Angelika Lampert

PDF

Open Access

TL;DR

This paper explores how voltage-gated sodium channels function and how their structure relates to chronic pain, using a new computational method to identify mutation hotspots.

Contribution

The paper introduces a novel renormalization group flow paradigm and machine learning approach to study NaVCh thermostability and pain-related mutations.

Findings

01

A critical inflection point regulating thermostability in NaVCh pore domains was identified using a generalized Widom scaling law.

02

A machine learning algorithm successfully identified pain-disease-associated mutation hotspots in the human NaV1.7 channel.

03

The method provides accurate insights for human pain medicine with reduced computational cost.

Abstract

Mammalian neurophysiology vitally depends on the stable functioning of transmembrane, pore-forming voltage-sensing proteins known as voltage-gated sodium channels (NaVChs). Deciphering the principles of NaVCh spatial organization can illuminate fundamental structure-function aspects of pore-forming proteins and offer new opportunities for pharmacological treatment of associated diseases such as chronic pain. Here, we introduce a renormalization group flow paradigm permitting a formal investigation of NaVCh thermostability properties. Our procedures are solidified by deriving an atom-packing entropy and validated over 121 experimentally resolved NaVCh structures of prokaryotic and eukaryotic origin. We uncover the universality of a critical inflection point regulating the thermostability of the pore domain relative to the voltage sensors, summarized in terms of a generalized Widom…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Genes1

SCN9A

Proteins1

Species1

Homo sapiens

Chemicals1

NaVCh

Diseases2

chronic pain pain

Figures9

Click any figure to enlarge with its caption.

Table 1Machine learning experiments summary. Median area-under-the-curve (AUC) and F1 scores obtained during the final classification round (SI 1.10.2). The first and second numbers of each $[eqn]$ -pair are median values obtained during training and testing the final classification model, respectively. The thermostability significance of inertia (iner.) and conductivity (cond.) constraints is probed by the feature inputs $[eqn]$ and $[eqn]$ , respectively. *SCN9A*-gene mutation datasets I, II, and III contain the classes $[eqn]$ class_0: GoF $[eqn]$ LoF, class_1: Neutr. $[eqn]$ , $[eqn]$ clas

Keywords

Voltage-gated sodium channelRenormalizationCriticalityHuman pain diseaseMachine learning

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIon channel regulation and function · Nanopore and Nanochannel Transport Studies · Nicotinic Acetylcholine Receptors Study

Full text

Introduction

1

Voltage-gated sodium channels (NaVChs) play a central role in neurophysiology by initiating and propagating action potentials along neuronal tissue [1]. Functionally, they act as ‘traffic controllers’ for sodium ions crossing the cell membrane, thereby shaping the upstroke of the action potential [2], [3].

A key feature that makes sodium transport entropically favorable within the hydrophobic membrane is hydrophilicity [4]. Hydrophilic groups coordinate the dehydration of sodium ions as they traverse the NaVCh pore, enabling efficient permeation [5], [6], [7]. The prevailing view is that NaVCh selectivity arises from a finely tuned balance of strong and potentially long-range interactions between the selectivity filter (SF) and surrounding residue clusters [8], a concept supported by early electrophysiological studies [9].

Members of the NaVCh superfamily share a conserved structural organization: four radially arranged homologous domains (DI–IV) form a porous membrane environment [10]. The first NaVCh structure solved was the prokaryotic channel from Arcobacter butzleri (NaVAb), captured in a pre-open state [11]. As predicted from previous crystallographic insights into potassium channels [12], [13], NaVAb confirmed that each domain contains a pore module (PM) linked to a voltage-sensor domain (VSD) [11]. The PM consists of two antiparallel $[eqn]$ -helical segments (S5–S6), connected by an extracellular loop and the SF [11]. Together, the four PMs form the pore domain (PD), composed of a narrow, sieve-like SF region, a hydrophobic central cavity, and an intracellular constriction where the putative activation gate (AG) resides [11].

The VSD comprises four transmembrane $[eqn]$ -helices (S1–S4) and is connected to the PM through an S4–S5 linker [11]. Positively charged residues in S4 detect membrane depolarization [14]; their outward displacement exerts a pulling force that opens the pore [14], [15], [16], [17]. Compared with their prokaryotic ancestors, eukaryotic NaVChs display greater structural and functional diversity, reflecting the breaking of radial symmetry [10]. As a result, eukaryotic channels exhibit a richer, more specialized metastable dynamics repertoire and enhanced allosteric efficiency [18], [19], [20].

Complex biomolecules, such as NaVChs, have evolved to balance sensitivity and robustness to perturbations of environmental and genetic origin [21], [22]. For example, the gnomAD database [23] reports thousands of benign variants in human NaVCh genes, such as 729 benign variants in SCN9A [24], which encodes the pain-related NaV1.7 channel [25]. This abundance of tolerated variation indicates that NaVChs possess substantial structural and functional resilience. What physical principle enables this?

Protein systems often rely on self-similarity, or scale invariance [26], to optimally distribute internal energy and external stresses across their structure [27], [28]. This property is characteristic of self-organized criticality (SOC) [29], an evolutionary strategy that preserves high functional sensitivity while maintaining robustness against mutations [21], [22], [27], [28], [30], [31].

Building on early work showing universal transient behavior in the hydropathic radial profile of globular proteins [53], [54], [55], [56], it was suggested that SOC signatures in proteins manifest as patterns of extrema, peaks and valleys, in intrinsic thermostability cost functions linked to water-mediated interactions [30], [31]. Scaling analysis of the atomic environment around the NaVAb and NaV1.7 pores demonstrates that the atomic distribution around the pore is predominantly unimodal, i.e., that the corresponding cumulative distribution function admits a prominent inflection point [34], [35]. In addition, at specific pore points (such as those marking the SF pore region) the hydropathic dipole field magnitude increases self-similarly along the radial direction, exhibiting a distinct peak near this inflection point [34], [35]. Also, the radial location of the inflection point corresponds to the characteristic size of the PD, marking the structural transition from the PD to the voltage-sensor domains (VSDs) [34], [35].

This threefold ‘coincidence’ – the unimodal atomic distribution, the self-similar increment of the hydropathic dipole field magnitude, and the existence of a characteristic molecular length scale describing the transition from the PD to the VSDs – suggests that such inflection points may serve as subtle indicators of SOC [32]. It also supports a conceptual analogy between the spatial atom arrangement surrounding the pore and the archetypal SOC sandpile model [29], offering a useful toy model for conceptualizing NaVCh molecular complexity [33]. Accordingly, residue structural positions are abstracted as lattice sites, with mutations analogous to externally added sand grains perturbing the lattice structure. The slope of the sandpile, inferred from scaling (power-law) exponents [30], [31], [34], [35], predicts whether a mutation at a given site is likely to destabilize the molecule, thereby providing a rationale for distinguishing between pain-disease-associated and benign structural locations in NaV1.7 [35].

A self-consistent scaling theory for proteins must rest on a renormalization group (RG) foundation [36]. Here, we examine this assumption within the NaVCh superfamily and show how it can yield biomedically relevant insights into inherited human pain disorders. Starting from the simplest radial differential equation that can rationally support a pore-forming architecture with a PD/VSD interface, we derive an RG flow equation that enables a substantial yet biophysically meaningful reduction of atom-packing degrees of freedom (DoF) around a NaVCh pore. ‘Atom-packing DoF’ refers to the number of positional possibilities available to atoms surrounding a pore point at any moment. Beyond traditional definitions of molecular entropy, which are informed by global disorder arising from all motions and energy states, we derive a localized measure of entropy to quantify atom-packing DoF in a pore-point-specific manner. This methodological novelty is not merely of theoretical interest; it is motivated by the fact that the performance of machine learning algorithms that predict disease hot-spots in NaVChs largely depends on capturing conserved patterns [37], [38] which encode how geometric and hydropathic characteristics vary across the spatial extent of the NaVCh structure [37], [39]. We report universal trends in dimensionality reduction, atom-packing entropy variation along the pore, and symmetry breaking of hydropathic dipole fields across the NaVCh superfamily, based on analysis of 121 experimentally resolved full-atom structures from both prokaryotic and eukaryotic sources. At the single-molecule level, we characterize mutation clustering relative to the PD/VSD interface in NaV1.7 and establish a transparent machine-learning framework that identifies structural locations where steepening of the sandpile slope may correlate with human pain phenotypes.

Methods

2

Setting the scene

NaVChs are modular biomolecules organized around a central pore axis that defines the ion conduction pathway (Fig. 1(a)). Understanding their functional architecture therefore requires analyzing molecular dynamics trajectories across multiple temporal and spatial scales. A prerequisite for such an analysis is that the NaVCh adopts a sufficiently long-lived or frequently revisited metastable state.Fig. 1Illustrative summary of the renormalization group procedure for a voltage-gated sodium channel. (a), The illustrated molecular side view corresponds to a pre-open NaVAb molecule (PDB code: 3rvy). The pore domain (PD) and voltage sensor domains (VSDs) are illustrated in blue and red, respectively. $[eqn]$ and $[eqn]$ are the membrane-parallel and membrane-perpendicular unit vectors, respectively. We introduce consecutive pore points, $[eqn]$ (SI Eq. (S1)), forming a path through the NaVAb pore. Each pore point serves as the center of an ensemble of nested balls, each of them characterized by a radius $[eqn]$ [Å] (Eq. (1)). An infinite number of radial paths (rays) emanate from each pore point in all directions, making analysis of the atomic environment along individual paths impractical. The renormalization group procedure solves this problem by collapsing radial paths into $[eqn]$ , rendering atomic environment properties inherently dependent on $[eqn]$ . Coarse-graining[41] then enables the computation of relevant scaling exponent. For example, substituting $[eqn]$ for $[eqn]$ in Eq. (30), returns via (31) the order parameter scaling exponent, $[eqn]$ (SI Eq. (S15)). (b), Top view, from the extracellular side (ES). $[eqn]$ [Å] represents the characteristic size of the PD, also marking the inflection point of the cumulative atom number ((5), (6)). $[eqn]$ [Å] is the radius of the smallest ball that contains the entire molecule (SI Eq. (S4)). IS stands for intracellular side.Fig. 1

We focus on observables that remain invariant under transformations of NaVCh size (or scale), conformational state, and subtype. Although this perspective may seem unconventional to molecular biologists accustomed to annotating channel regions based on structural and physicochemical distinctions, our framework complements that detailed view by revealing how apparently disparate features can be unified through mathematically tractable design principles.

To illustrate the approach, imagine a probe designed for the molecular environment. Choose a coordinate of interest, denoted $[eqn]$ , representing a biophysically meaningful location within the molecule. From $[eqn]$ , initiate a trajectory by extending a radial path (ray) outward toward the molecular ‘rim’ (Fig. 1(a) and (b)). By evaluating all such paths, we can characterize how the local environment around $[eqn]$ changes as a function of radial distance. At each step along a given path, the probe records measurement data; once a predefined distance is reached, we return to $[eqn]$ , select a new radial path, and repeat the process. Naturally, $[eqn]$ corresponds to a pore point, a coordinate along which ion transport dynamics unfold.

However, because infinitely many rays pass through any pore point, this probe protocol becomes infeasible: it enters an endless loop over all possible radial directions, preventing completion of the measurement set for any given $[eqn]$ . To avoid this impasse while still obtaining sufficiently informative data, we introduce a renormalization group (RG) technique [36], [40] operating directly in molecular space. The RG procedure collapses the infinitely many rays into a single parameter, $[eqn]$ [Å] (Fig. 1(a) and (b)), making all probe measurements explicit functions of radial distance. This coarse-graining operation [41] provides an unbiased representation of information across all radial directions while eliminating path-specific details.

The objective of this RG approach is to substantially reduce atom-packing degrees of freedom (DoF) while preserving the essential molecular physics. A successful RG implementation should yield only a small number of parameters defining a low-dimensional representation of NaVCh functional architecture. This, in turn, allows us to derive, rather than assume, the machine-learning features used to classify variants according to their structural location [37], [38], [39], [42], which is a crucial advantage, as such methods perform best when the underlying feature space admits a low-dimensional parameterization.

To orient the reader for Section 2.1, Fig. 2 provides a logical-flow diagram summarizing the key mathematical operations.Fig. 2Analytical procedures guide. Our starting point is to retrieve the atom-packing law and its associated intrinsic dimensionality measure via an appropriate maximum entropy procedure (upper box (I)). To understand the physicochemical forces that stabilize atom-packing patterns, we study the scaling behavior of hydropathic moments (upper box (II)). To derive machine learning features, we decompose a hydropathic moment into its hydrophobic and hydrophilic components in order to better understand how the ‘competition’ between oppositely acting physicochemical forces determines a molecule’s susceptibility to environmental and/or genetic perturbations (upper box (III)). For theoretical completeness, we recall that the observables appearing in the upper box adhere to a common renormalization rule (lower box).Fig. 2

Analytical procedures

2.1

Spatial organization principles

2.1.1

We focus on a pore point, $[eqn]$ (SI Eq. (S1) and Fig. 1(a), and introduce the open ball:

[eqn]

where $[eqn]$ [Å] is the probe radius and $[eqn]$ is the vector from $[eqn]$ to a candidate atom coordinate, $[eqn]$ . $[eqn]$ is the Euclidean norm. $[eqn]$ denotes the spherical surface of $[eqn]$ . In a structural biology context, $[eqn]$ is called an interface.

Mathematical functions and associated parameters appearing henceforth depend on $[eqn]$ and $[eqn]$ , respectively. For clarity, this dependence is omitted.

Packing of atoms

2.1.1.1

The number of atoms residing on $[eqn]$ is continuously approximated by the following generalized logistic differential equation [34], [43]:

[eqn]

$[eqn]$ is the number of atoms residing inside $[eqn]$ . $[eqn]$ represents the effective range over which atoms push each other apart (i.e., repel each other). On the other hand, $[eqn]$ represents the effective range over which atoms pull each other together (i.e., attract each other). $[eqn]$ is the atom-packing initial condition realized for some initial $[eqn]$ -value, $[eqn]$ . This means $[eqn]$ is the number of atoms residing in the initial ball, $[eqn]$ . $[eqn]$ [atom] is the carrying capacity of the NaVCh structure. It delimits the number of atoms residing inside $[eqn]$ for $[eqn]$ (see also (3)), thereby setting an upper bound on the number of atoms the structure can accommodate.

We propose an effective phenomenology supporting (2) based on the interplay between hydrophobic attraction and hydrophilic repulsion [44]. The nonlinear term on the right hand side (RHS) of (2) accounts for attractive, hydrophobicity-driven atomic interactions stabilizing an $[eqn]$ -cluster. Namely, it reflects the tendency of hydrophobic constituents to ‘hide’ inside $[eqn]$ . The term $[eqn]$ indicates that any group of $[eqn]$ atoms can form an interaction subnetwork. Since no spatial constraints apply to the formation of such a group, $[eqn]$ also accounts for long-range interatomic interactions. Hence, $[eqn]$ explains the emergence of long-range interactions through bond length adjustment driven by atomic repulsivity.

Accordingly, the linear term appearing in the RHS of Eq. (2) accounts for repulsive, hydrophilicity-driven interactions pushing atoms toward $[eqn]$ . This is attributed to water structuring effects that increase the energy required for efficient packing of atoms, according to the thermodynamic self-assembly principles outlined in [45]. Additionally, $[eqn]$ incorporates the Pauli exclusion principle, as discussed in Ref. [46].

The characteristic size of a pore domain

2.1.1.2

Solving (2) for $[eqn]$ gives:

[eqn]

where

[eqn]

$[eqn]$ imposes an upper bound on the average unsigned hydropathic energy of an individual atom. On the basis of simplicity, $[eqn]$ [kcal/atom] is chosen to be a constant. Accordingly, $[eqn]$ [kcal] delimits the absolute hydropathic energy stored in the initial ball, $[eqn]$ . $[eqn]$ [Å ^-1^] can then be interpreted as the rate at which hydrophobic energy surpluses, stored inside $[eqn]$ , are consumed in useful (stabilizing) interactions as the $[eqn]$ -cluster size increases. We note that this interpretation is simply a reformulation of our initial understanding about $[eqn]$ deduced from (2). Specifically, decreasing $[eqn]$ increases $[eqn]$ , thereby increasing the likelihood of distant atom interactions, reflecting the faster depletion of hydrophobic energy surpluses into stabilizing, potentially long-range bonds.

We do not seek a biophysical interpretation for the logarithmic argument, $[eqn]$ . Instead, we focus on $[eqn]$ .

We emphasize that $[eqn]$ marks the $[eqn]$ -value for which the second-order derivative of $[eqn]$ with respect to $[eqn]$ , i.e., the curvature of $[eqn]$ along the radial direction,

[eqn]

becomes zero. Stated differently, $[eqn]$ is an inflection point of $[eqn]$ . It follows that for $[eqn]$ , (2) is globally maximized, i.e.,:

[eqn]

The biophysical significance of (6) becomes apparent when considering empirical evidence suggesting that $[eqn]$ serves as an approximation of the interface mediating the structural transition from the PD to the VSDs [34], [35]. Accordingly, we treat $[eqn]$ as the characteristic length scale of the PD sub-architecture, i.e., the maximum size at which the PD retains its essential functional and structural characteristics, independent of VSD influence (for an illustration see Fig. 1(b), left). Namely, beyond $[eqn]$ (i.e., for $[eqn]$ ), the coupling interactions between a pore module and a radially succeeding voltage sensor cannot be neglected, driving the structural transition from the former toward the latter.

The irregular cylindrical surface emerging from the dense arrangement of $[eqn]$ balls along $[eqn]$ is parameterized by:

[eqn]

(7) serves as the characteristic cover for the PD, in the sense that it incorporates essential structural elements of the PD sub-architecture.

Mean-field conditioning of the atomic environment

2.1.1.3

The length scale

[eqn]

denotes the $[eqn]$ -value for which the atom-packing condition $[eqn]$ is satisfied, meaning exactly half of the $[eqn]$ atoms are residing inside $[eqn]$ . ‘b’ stands here for ‘bound’, since $[eqn]$ imposes an upper and lower bound on $[eqn]$ , depending on whether the attractive or repulsive interaction range prevails, i.e., whether $[eqn]$ or $[eqn]$ , respectively. Accordingly, the negative and positive sign of $[eqn]$ decides whether the length scale bound applies from below or from above, i.e., whether $[eqn]$ or $[eqn]$ , respectively.

If the attractive interaction range equals the repulsive interaction range (i.e., $[eqn]$ ), then $[eqn]$ and $[eqn]$ , and (3) becomes isomorphic with the Fermi-Dirac (i.e., logistic) distribution. Hydrophobicity-driven attraction and hydrophilicity-driven repulsion are then delicately balanced, imposing a mean-field conditioning on the atomic environment around a pore point. Across an evolutionary time scale, $[eqn]$ promotes isotropic space exploration in the PD and the VSDs, as an equal number of atoms is distributed in the PD and the VSDs (i.e., half of the $[eqn]$ atoms are found inside $[eqn]$ and the other half outside of it). Generally, this favors the emergence of globular-like molecular shapes, as demonstrated in Ref. [46], which, in the context of this study, is driven by a roughly equal and uniform allocation of masses inside and outside $[eqn]$ .

Unit mass fractal dimension

2.1.1.4

Expressing (2) in terms of (3), reveals that packing of atoms around a pore point satisfies the self-similarity relationship:

[eqn]

which implies that:

[eqn]

where $[eqn]$ is the smallest ball that accommodates all NaVCh structure atoms (SI Eq. (S4) and Fig. 1(b) right part).

(10) introduces two distinct yet interrelated constraints. Specifically, (10)(a) and (10)(b) ensure that even if the molecular radial size, $[eqn]$ [Å] (SI Eq. (S5)), becomes infinitely large, the total number of atoms cannot exceed $[eqn]$ , and the hydropathic interaction energy per atom converges to $[eqn]$ [kcal $[eqn]$ Å /atom], respectively.

The intrinsic dimension (i.e., the unit mass fractal dimension [47]) of an $[eqn]$ -cluster is continuously measured with:

[eqn]

(11) describes how ‘intensively’ (or ‘compactly’, as discussed in Ref. [48]) $[eqn]$ atoms fill space inside $[eqn]$ . Generally, the fraction of empty space inside $[eqn]$ increases with decreasing $[eqn]$ .

Atom-packing entropy

2.1.2

Let us now consider the following discretization of $[eqn]$ :

[eqn]

where $[eqn]$ is a scale index [34], [43]. $[eqn]$ is the total number of scale iterations, determining the resolution of the discretization. $[eqn]$ [Å] is the distance separating $[eqn]$ from its nearest-neighbor atom, while $[eqn]$ [Å] remains the molecular radial size (first considered below (10) and given by SI Eq. (S5)). Note that $[eqn]$ is devoid of atoms.

The probability that an atom resides inside $[eqn]$ is given by:

[eqn]

where $[eqn]$ , $[eqn]$ , is identified with the survival function of a Pareto Type-II distribution [49]. $[eqn]$ is the partition sum guaranteeing that $[eqn]$ .

The probability that ( $[eqn]$ )-atoms reside inside $[eqn]$ is given by the Escort probability [50]:

[eqn]

where $[eqn]$ is the corresponding partition sum. $[eqn]$ ’s biophysical importance stands out when we consider the following finite-size corrected version of (10(b)):

[eqn]

where $[eqn]$ is a finite-size corrected version of $[eqn]$ . $[eqn]$ [Å ^2^] and $[eqn]$ [Å] account for interfacial and radial geometric distortions due to $[eqn]$ ’s and $[eqn]$ ’s finiteness, respectively. Note that $[eqn]$ is constant. $[eqn]$ is a normalization factor whose specific value is irrelevant.

Crucially, for $[eqn]$ , the constraint (10)(b) can be recovered:

[eqn]

An entropy functional whose maximization explicitly incorporates (13), (14), (15) is given by:

[eqn]

where $[eqn]$ is known as the nonextensivity index [51], [52] and $[eqn]$ [kcal/( $[eqn]$ atom)] is the equivalent of Boltzmann’s constant. Note that [ $[eqn]$ ] is the physical unit at which the temperature of the current pore point is measured (SI 1.4).

From an information theoretical viewpoint, $[eqn]$ can be trivially interpreted as the average amount of surprise (or ‘unexpectedness’) associated with determining how many atoms reside inside $[eqn]$ .

From an evolutionary perspective, $[eqn]$ illustrates how the competition between hydrophobicity-driven attraction and hydrophilicity-driven repulsion determines the informational content of the atomic environment around a pore point.

Local scarcity of hydrophilicity can attenuate associated repulsion effects (i.e., $[eqn]$ ), promoting the formation of a hydrophobic core. $[eqn]$ then approaches the following upper bound:

[eqn]

where $[eqn]$ is a common exponential (SI Eqs. (S6) and (S10)). Generally, the limit $[eqn]$ does not necessitate a scarcity of hydrophilic components but it can arise from screening, where the surroundings attenuate interatomic repulsivity inside $[eqn]$ , causing the effective interaction range to collapse, even with a finite concentration of repulsive particles still present.

Maximizing $[eqn]$ toward $[eqn]$ expands the NaVCh configuration space volume, since the amount of water required to solvate a hydrophobic atom group inside $[eqn]$ is typically less than that required for solvating $[eqn]$ individually dispersed hydrophobic atoms. The remaining, non-solvating water molecules can then uncoordinatedly engage in hydrogen bond interactions with NaVCh constituents on $[eqn]$ , thereby increasing atom-packing DoF. Simultaneously, we notice that the intrinsic dimension of the PD,

[eqn]

is decreased, since

[eqn]

is monotonically and positively correlated with $[eqn]$ for fixed parameter values $[eqn]$ (SI Fig. S1 inset).

(20) illustrates how the PD sub-architecture can mitigate the potentially disorganizing effect of excess (‘redundant’) atom-packing DoF by lowering its intrinsic dimension.

Thermostability

2.1.3

Empirical measure

2.1.3.1

The thermostability of an $[eqn]$ -cluster is probed with the hydropathic moments toolbox [34], [35], [43], [53], [54], [55], [56]:

[eqn]

where $[eqn]$ is a generally defined Dirac delta function, $[eqn]$ is an atom coordinate with $[eqn]$ denoting the vector from $[eqn]$ to $[eqn]$ , and $[eqn]$ [kcal], $[eqn]$ , are noise-perturbed hydropathic weights originating from the Kapcha $[eqn]$ Rossky atomic hydropathic scale [57]. Noise accounts here for randomly occurring water density fluctuations.

Note that $[eqn]$ and $[eqn]$ act parallel and perpendicular, respectively, to the membrane, surface with $[eqn]$ and $[eqn]$ being the corresponding unit vectors (SI S1.2.3). Conventionally, $[eqn]$ points toward the extracellular side (ES).

The subscript ‘ $[eqn]$ ’ indicates the moment order determining the dimension of the space within which water-mediated interactions are probed. $[eqn]$ -parity determines whether we probe inertia ( $[eqn]$ ) or conductivity ( $[eqn]$ ) constraints, concerning rotational and translational atom-packing DoF expressed around and along the pore point path, $[eqn]$ , as detailed in SI 1.5.1–2, respectively.

Theoretical model

2.1.3.2

Normalizing $[eqn]$ over $[eqn]$ yields the $[eqn]$ -cluster hydropathic density [53], [54], [55], [56] whose oscillatory behavior is described by:

[eqn]

where $[eqn]$ determines the sign changing behavior of $[eqn]$ and $[eqn]$ envelopes the underlying wave packet. The factor $[eqn]$ introduces a finite-size correction, with $[eqn]$ [Å] being the adsorption bond length. $[eqn]$ is dimensionless and purely phenomenological, as it can only be deduced by fitting $[eqn]$ to experimental $[eqn]$ -traces. It generally describes the degree of adhesion of atoms on $[eqn]$ in response to interfacial tensions, with its sign indicating whether these tensions lead to an increase or decrease in $[eqn]$ .

Notably, $[eqn]$ preserves the general hydropathic energy form used in Refs. [58], [59], [60]. Moreover, consistent with Eq. (4), for $[eqn]$ , we verify that $[eqn]$ , illustrating that $[eqn]$ serves as an upper bound for the average unsigned hydropathic energy of an individual atom.

Scaling of hydropathic dipole field amplitude

2.1.3.3

Hydropathic energy wave packet self-modulation effects persisting over sufficiently large $[eqn]$ -intervals covering the PD and the VSDs are described by [34], [35]:

[eqn]

where $[eqn]$ [Å] denotes the $[eqn]$ -value for which the $[eqn]$ -curvature (Eq. (5)) is negatively maximized.

$[eqn]$ are the corresponding scaling exponents. $[eqn]$ and $[eqn]$ set an upper bound on how ‘intensively’ $[eqn]$ -th-order dipole-dipole hydropathic interactions can fill space in the PD and the VSDs, respectively.

The sign of $[eqn]$ indicates the direction of hydropathic interaction network intensification, either inward (i.e., $[eqn]$ ) or outward (i.e., $[eqn]$ ).

Mutational robustness

2.1.4

Decomposability

2.1.4.1

Sign-changes of $[eqn]$ mark $[eqn]$ -clusters of diminished thermostability. Mechanofunctional properties of these $[eqn]$ -clusters are expected to be sensitive to perturbations.

To understand how the multiscale competition between hydrophobic attraction and hydrophilic repulsion can lead to varying thermostability and emergent mechanofunctional properties, we consider the following decomposition ansatz:

[eqn]

where $[eqn]$ is the ansatz cutoff scale.

We verify that for vanishingly small noise, i.e., $[eqn]$ (see (21)), and $[eqn]$ , case A holds. Namely, $[eqn]$ and $[eqn]$ are the contributions of only hydrophilic and hydrophobic atoms, respectively, with $[eqn]$ , $[eqn]$ and $[eqn]$ .

Whether case A or B holds for the pair $[eqn]$ requires computational investigation, as there is no general argument supporting this assertion.

Since $[eqn]$ and $[eqn]$ are cumulative, and thus slowly changing, sign-preserving functions over the $[eqn]$ -range, we can straightforwardly compute their continuously varying scaling exponents:

[eqn]

and

[eqn]

respectively.

The physical meaning of $[eqn]$ and $[eqn]$ complements that of $[eqn]$ . Namely, $[eqn]$ and $[eqn]$ account for the intensification of the $[eqn]$ -th-order dipole-dipole interaction network stabilizing the subgroups of $[eqn]$ and $[eqn]$ atoms, respectively, along the radial direction.

The signs of $[eqn]$ and $[eqn]$ indicate whether the direction of interaction network intensification is inward (if $[eqn]$ ) or outward (if $[eqn]$ ).

Logarithmic composite susceptibility

2.1.4.2

The difference between $[eqn]$ and $[eqn]$ ,

[eqn]

illustrates how the interaction networks, stabilizing the subgroups of $[eqn]$ and $[eqn]$ atoms, ‘compete’ to fill space inside $[eqn]$ .

We postulate that when $[eqn]$ , the two networks are in a state of balance, where perturbations affecting the $[eqn]$ and $[eqn]$ subgroups tend to induce oppositely directed $[eqn]$ -responses, effectively canceling each other out. On the other hand, $[eqn]$ implies an interaction network imbalance, determining also the prevailing direction of a $[eqn]$ -response. Accordingly, the sign of $[eqn]$ informs about the direction of perturbation-induced $[eqn]$ -responses.

Dividing $[eqn]$ by $[eqn]$ defines the following size-normalized measure of interaction network imbalance:

[eqn]

assuming that the interaction network imbalance is uniformly distributed over the radial extent of an $[eqn]$ -cluster.

Integrating $[eqn]$ over an $[eqn]$ -range, yields the logarithm of the $[eqn]$ -th-order composite susceptibility, i.e.,:

[eqn]

with $[eqn]$ being identified as the $[eqn]$ -th-order composite susceptibility. Generally, $[eqn]$ can be interpreted as a descriptor of how internally generated hydrophobicity fields couple with internally generated hydrophilicity fields and vice versa.

Treating $[eqn]$ as an effective temperature ratio reveals that $[eqn]$ is analogous to a thermostability geodesic distance (SI S1.6.1). Simply put, $[eqn]$ quantifies how ‘far’ an $[eqn]$ -cluster’s thermostability state is from a state of minimal thermostability (or, maximal sensitivity to perturbations).

We emphasize that although both $[eqn]$ and $[eqn]$ qualify as thermostability ‘distance’ measures (since $[eqn]$ ), the geodesic property applies exclusively to $[eqn]$ . In fact, while $[eqn]$ (Eq. (23)) provides information at an envelope level, $[eqn]$ (27) reflects the actual sculpting (i.e., the radial interplay of successive peaks and valleys) of the hydropathic energy wave packet.

On the grounds of consistency, $[eqn]$ is considered to provide equivalent information to $[eqn]$ but at an interface level. Namely, $[eqn]$ measures how ‘far’ an $[eqn]$ -shell’s thermostability state is from a state of minimal thermostability. We term $[eqn]$ the ‘interfacial coupling strength’.

Renormalizability

2.1.5

The RG flow equation has the general form [61]:

[eqn]

where $[eqn]$ is a NaVCh structure log-observable (referred to as the effective coupling [61]). $[eqn]$ is the normalized hydropathic energy scale [61], representing the relative temperature change in response to PD/VSD interface fluctuations (SI Eq. (S13)). $[eqn]$ describes the dependence of $[eqn]$ on $[eqn]$ and is the equivalent of the beta function [61]. It represents a continuously varying scaling exponent attaining its critical value precisely at $[eqn]$ (SI 1.7).

The conceptual basis upon which (30) is built, is illustrated in Fig. 1(b) along with its caption.

From (30), we straightforwardly obtain the renormalizability relationship:

[eqn]

Eq. (31) establishes that changes in $[eqn]$ over a logarithmic radial range (on the left hand side of (31)) can be equivalently obtained by considering changes in $[eqn]$ driven by PD-subarchitecture deformations reflected in PD/VSD interface fluctuations (on the right hand side of (31)).

Computational procedures

2.2

Structure preparation

2.2.1

NaVCh structures are ‘cleaned’, protonated, and their orientation is fixed, following the procedures outlined in SI S1.2.

Curve fitting and model selection

2.2.2

Candidate $[eqn]$ -models were fitted to the dimensionless empirical cumulative atom number:

[eqn]

where $[eqn]$ is the total number of atoms found in a ‘clean’, protonated NaVCh structure. Algorithmic implementation details are enclosed in SI 1.11.

Results

3

NaVCh spatial organization

3.1

We compiled a structural dataset comprising 71 prokaryotic NaVCh structures (subtypes: NaChBac, NaVAb, NaVMs, NaVAe, NaVRh) and 50 eukaryotic structures (subtypes: NaVPas, NaVEe1, NaVEh, NaV1-8) of sufficient resolution (SI S1.1).

Geometry

3.1.1

inflection point universality

3.1.1.1

We report an excellent agreement between the empirical cumulative atom number, $[eqn]$ (Eq. (32)), and its theoretical counterpart, $[eqn]$ (Eq. (3)), across all 121 structures (Fig. 3(a) and (g), SI Figs. S5-S10(a), S12-S22(a)), supported by small mean absolute fitting errors (SI S2.1). P-values remained inconclusive due to the coarse-grained nature of our model and their sensitivity to noise (SI S2.1).Fig. 3Spatial organization features of the NaVCh superfamily. Statistical summary of the spatial-organization features of 71 prokaryotic (subtypes: NaChBac, NaVAb, NaVMs, NaVAe, NaVRh) and 50 eukaryotic (subtypes: NaVPas, NaVEe1, NaV1-8, NaVEh) NaVCh atomic structures (SI Tab. S1), for pore points, $[eqn]$ , and scale indices, $[eqn]$ . (a),(g), Collapsed traces of the empirical, $[eqn]$ [atom], and best-fitted theoretical, $[eqn]$ [atom], cumulative atom numbers. (b) and (h) illustrate the prototype NaVAb channel (PDB code: 3rvy) and the human NaV1.7 channel (PDB code: 7w9k), respectively. Atoms are colored according to their PD/VSD ordering score (SI S1.8.1.2), and projected onto a plane perpendicular (left side) and parallel (right side) to the membrane. $[eqn]$ is the mean value (computed along $[eqn]$ ) of the characteristic size of the pore domain (PD). VSD stands for voltage sensor domain. (c),(i), Collapsed traces of the empirical, $[eqn]$ [kcal/atom], and theoretical, $[eqn]$ [kcal/atom], absolute hydropathic energies per atom. $[eqn]$ is the hydrophilic component, and $[eqn]$ is the hydrophobic component of $[eqn]$ , respectively. (d),(j), The interplay between the normalized atom-packing entropy, $[eqn]$ , and the pair of interaction ranges, $[eqn]$ . (e),(k), Collapsed traces of ‘pointing extracellularly (ES)’ and ‘pointing intracellularly (IS)’ instances of the normalized membrane-vertical HDF amplitude, $[eqn]$ [kcal $[eqn]$ Å /atom]. $[eqn]$ is normalized by $[eqn]$ . If $[eqn]$ , $[eqn]$ is labeled as a ‘pointing ES’ ( $[eqn]$ ) instance; otherwise, ‘pointing IS’ ( $[eqn]$ ). (f),(l), Nonextensivity of $[eqn]$ [kcal/( $[eqn]$ atom)], as revealed by its weak and strong dependence on the molecular radial size, $[eqn]$ [Å], and the interaction range ratio, $[eqn]$ , respectively. Note that $[eqn]$ is normalized by $[eqn]$ (Eq. (18)). The critical inflection regime marks the interval from which all $[eqn]$ values are drawn. Trace-collapse procedures are described in SI S1.8.1.1.Fig. 3

To appreciate the statistical weight of this finding, we emphasize that our fitting algorithm had to converge approximately 121 $[eqn]$ 690 = 83,490 times, where 121 is the total number of NaVCh structures considered and 690 gives the average number of investigated pore points per NaVCh structure, separated along the pore axis by a sampling distance of 0.1 Å (SI Tab. S2).

These findings indicate that NaVCh atom-packing physics can be compressed into the set of parameters $[eqn]$ ((2), (3)), in a pore-point-specific manner. Key to our understanding is that $[eqn]$ and $[eqn]$ are indirect measures of molecular attraction and repulsion, influencing molecular compression and expansion, respectively. Accordingly, the NaVCh molecule is conceptualized as a compressible, fluid-like material, whose mechanofunctional properties vary along its principal pore axis, approximated by the pore point path $[eqn]$ . In turn, this implies that the atomic environment around each pore point admits a distinct statistical mechanical description or, equivalently, admits a local temperature linked to the hydropathic energy level of the PD levels (SI S1.4).

The formation of interfacial geometries in compressible materials requires the spontaneous cancellation of interfacial tensions, manifested as minima in the underlying interfacial free energy [62], a principle that also applies to the spatial organization of NaVChs. Structurally, it gives rise to the PD/VSD interface, denoted $[eqn]$ (Eq. (6)), mediating two qualitatively different – in terms of their thermostability character – atomic environments. This insight enables the structural annotation of atoms, determining whether they belong to the PD or the VSDs, as $[eqn]$ serves as an order parameter distinguishing between two coexisting material phases (Fig. 3(b) and (h), SI Figs. S5-S10(b), S12-S22(b)).

Under mean-field conditions (when $[eqn]$ , see below Eq. (8)), $[eqn]$ admits the standard critical exponent, $[eqn]$ (SI Eqs. (S15) and (S16)). Moreover, it can be readily shown that $[eqn]$ is related to other standard critical exponents through a generalized version of Widom’s scaling law – whose standard form can be recovered when the characteristic renormalization scale matches the correlation length (represented here by $[eqn]$ and $[eqn]$ , respectively) (SI S1.6.4).

Comparing the $[eqn]$ -distributions (with $[eqn]$ the radial size of the molecule) for prokaryotic and eukaryotic channels shows that, in eukaryotes, the distribution peaks at $[eqn]$ , whereas in prokaryotes it is broader with multiple peaks (SI Fig. S2(c) vs. (d)), consistent with the greater assembly heterogeneity of the prokaryotic dataset along the pore (e.g., arising from C-terminal extensions (SI Fig. S8)). In both cases, the $[eqn]$ -ratio remains bounded within $[eqn]$ guaranteeing that the structural transition occurs within well-defined radial bounds, as also supported by the clear sigmoid profiles exhibited by $[eqn]$ (and $[eqn]$ ) in Fig. 3(a) and (g).

Pore domain intrinsic dimensionality

3.1.1.2

Dimensionality analysis reveals that prokaryotic and eukaryotic PDs exhibit a preference for ‘flatness’, as their intrinsic dimensions (Eq. (19)) are, on average, values only slightly greater than 2 (Fig. 4(a) and (b)).Fig. 4Intrinsic dimensions, scaling exponents, and attractive-vs.-repulsive interaction range statistics. Statistical compilation of the pore domain (PD) intrinsic-dimension and interaction-range characteristics for 71 prokaryotic (subtypes: NaChBac, NaVAb, NaVMs, NaVAe, NaVRh) and 50 eukaryotic (subtypes: NaVPas, NaVEe1, NaV1-8, NaVEh) NaVCh atomic structures (SI Tab. S1). (a),(b), Histograms of the empirical and theoretical measures of the PD intrinsic dimension. The $[eqn]$ -distribution mean values of the empirical intrinsic dimension measure are $[eqn]$ and $[eqn]$ , with corresponding standard deviations of $[eqn]$ and $[eqn]$ , for prokaryotic and eukaryotic PDs, respectively. (c),(d), Histograms of the attractive and repulsive interaction ranges given by $[eqn]$ [Å] and $[eqn]$ [Å], respectively. The $[eqn]$ -distribution mean values are $[eqn]$ Å and $[eqn]$ Å for $[eqn]$ , and $[eqn]$ Å and $[eqn]$ Å for $[eqn]$ , in prokaryotes and eukaryotes, respectively. (e),(f), Histograms of the scaling exponents of the membrane-perpendicular dipole-field component, $[eqn]$ . The exponents $[eqn]$ and $[eqn]$ account for the scaling behavior of $[eqn]$ over $[eqn]$ -intervals covering the PD and the voltage sensor domains, respectively (Eq. (23)). They are computed according to procedures described in SI S1.8.3. PC stands for Pearson correlation coefficient.Fig. 4

By tuning the intrinsic dimension of the PD near 2, PD sub-architecture is effectively mapped onto consecutive membrane-parallel planes, with roughness reciprocally related to $[eqn]$ , favoring a smooth cylindrical shape. According to Eq. (20), this reflects a self-regulating mechanism that preserves PD structural and functional integrity by compensating for excess atom-packing DoF generated by a tightened atomic environment arising either from hydrophobic surpluses or attenuated repulsive forces, both of which drive $[eqn]$ : with $[eqn]$ held well above zero, decreasing $[eqn]$ reduces interatomic distances and counteracts the expansion of configuration-space volume caused by surplus atom-packing DoF. Lowering the intrinsic dimension therefore increases spatial-organization efficiency in the PD by favoring nearly water-free, planar configurations.

Eukaryotic PDs exhibit greater efficiency in spatial organization, as evidenced by their smaller mean and standard deviation values for intrinsic dimensions compared to prokaryotic PDs (see Fig. 4 caption). This finding is consistent with the general trends reported in Ref. [63], where lower intrinsic dimensions were found to correlate with higher organismal complexity.

Atom-packing energy and entropy

3.1.2

An exponentially decaying hydrophobic ‘force’ stabilizes the structure

3.1.2.1

As illustrated in Fig. 3(c) and (i), a decrease in $[eqn]$ -cluster size implies an increase in its average hydrophobicity, $[eqn]$ ((22), (24)). Specifically, $[eqn]$ converges toward $[eqn]$ kcal for $[eqn]$ (decreasing scale index), denoting the hydropathic score assigned to an individual hydrophobic atom based on the Kapcha $[eqn]$ Rossky hydropathic scale [57]. In contrast, the $[eqn]$ -cluster average hydrophilicity, $[eqn]$ ((22), (24)), tends to vanish for $[eqn]$ (Fig. 3(c) and (i)).

This trend showcases an entropically favorable arrangement in which hydrophobicities ‘hide’ inside $[eqn]$ rather than being exposed on its surface, $[eqn]$ . As hydrophobic constituents fill the available space inside $[eqn]$ , hydrophilic ones are displaced outward, preferentially settling near $[eqn]$ . This redistribution creates an initial surplus of hydrophobic energy available to pore-lining constituents, causing the experimentally observed absolute hydropathic energy per atom, $[eqn]$ , to peak as $[eqn]$ (Fig. 3(c) and (i); SI Figs. S5–S10(c), S12–S22(c)).

However, as the $[eqn]$ -cluster size grows (increasing scale index), the initial hydrophobic energy surplus is exponentially depleted at a rate governed by $[eqn]$ and converted into stabilizing interactions that bind the nested $[eqn]$ ’s together. Intriguingly, this suggests that interfacial adsorption effects do not substantially distort atom-packing conditions (Eq. (22)). In turn, this implies that NaVChs have evolved near a thermodynamic state in which they mimic the spatial organizational traits of larger, bulkier systems. This maximizes NaVCh mechanofunctional efficiency by allowing perturbations to be processed nearly adiabatically (SI S1.6.1). Our analysis supports this intuition, as shown by the good agreement between $[eqn]$ and $[eqn]$ [kcal/atom] (Fig. 3(c) and (i), SI Figs. S5–S10(c) and S12–S22(c)), where $[eqn]$ is the theoretically predicted absolute hydropathic energy per atom (Eq. (4)).

Evidently, if the repulsive interaction range $[eqn]$ grows too large, structural integrity could be compromised due to excessive interatomic spacing. To avoid this, the distribution of $[eqn]$ is located closer to zero than that of $[eqn]$ (Fig. 4(c) and (d)). Also, in contrast to $[eqn]$ , $[eqn]$ has a smaller mean value for eukaryotic NaVChs compared to prokaryotic NaVChs (Fig. 4 caption), supporting the idea that eukaryotic NaVChs, despite being more diverse – both structurally and functionally –, are spatially organized in a more efficient manner. Notably, the mean value of $[eqn]$ is approximately 8.8 Å and 12.9 Å in prokaryotic and eukaryotic NaVChs, respectively (Fig. 4(c) and (d)). These values are in good agreement with the typical $[eqn]$ 10 Å decay-length readout commonly associated with measuring the exponentially decaying amplitude of hydropathic interaction potentials between molecular surfaces [58], [59], [60], which are explicitly represented here by the interface geometry $[eqn]$ .

Atom-packing entropy variation along the pore

3.1.2.2

Moving from the IS to the ES along $[eqn]$ , we observe that the atom-packing entropy, $[eqn]$ (Eq. (17)), is smoothly up-regulated in both prokaryotes and eukaryotes (Fig. 3(d) and (j), SI Figs. S5-S10(d) and S12-S22(d)), reflecting the general tendency that $[eqn]$ becomes comparable to $[eqn]$ on the IS (see bottom left region of Fig. 3(d) and (j)). Atomic environments on the IS and ES are thus subject to qualitatively different entropic constraints, likely also experiencing asymmetric perturbation responses. To sustain such a delicate metastable equilibrium, even instantaneously, requires the contribution of external forces, since otherwise the structural integrity of the IS could be compromised. The observed nonextensivity, where the entropy is largely indifferent to changes in the radial molecular size while being governed by the interaction range ratio $[eqn]$ (Eq. (2), Fig. 3(f) and (l), and SI Figs. S5-S10(f) and S12-S22(f)), is likely the key property ensuring that the structure can quickly react to environmental changes and visit unconventional metastable states. In summary, $[eqn]$ can be intuitively understood as a regulator of the NaVCh’s entropy-driven behavior, modulating the tendency of the components to either pack tightly or unpack.

According to (17), (20), we expect $[eqn]$ to maximize in response to attenuation of repulsive effects ( $[eqn]$ ). Consistent with this expectation, we find that the maximum of $[eqn]$ typically occurs in the mid-pore region, where the equivalent of a hydrophobic core exists, namely, a prevalently hydrophobic central cavity (CC) (e.g., see Fig. 1 in [43]). In this mid-pore region, resisting structural disorder caused by excessive hydrophobicity-induced configurational freedom appears to be crucial: we observe that the PD tends to reduce its characteristic size, thereby decreasing its intrinsic dimension (SI S2.2) and, in turn, ensuring more efficient spatial organization.

The PD/VSD structural transition as an order/disorder phase transition

3.1.3

Implications for the NaVCh functional architecture arising from the cancellation of interfacial tensions in the vicinity of the PD/VSD interface are best summarized by the membrane-perpendicular first-order hydropathic dipole field (HDF) amplitude, $[eqn]$ [kcal $[eqn]$ Å] (as explained in SI 1.5.2).

The sign of $[eqn]$ indicates whether the $[eqn]$ -field induced by the PD at the current pore point is oriented extracellularly ( $[eqn]$ ) or intracellularly ( $[eqn]$ ) (Fig. 3 caption and SI 2.4.2, 2.5.2).

As shown in Fig. 3(e) and (k) and SI Figs. S5-S10(e), S12-S22(e), distinguishing between $[eqn]$ and $[eqn]$ highlights the critical [30], [31] nature of the inflection point.

Specifically, $[eqn]$ and $[eqn]$ exhibit clear negative and positive peaks, respectively, for $[eqn]$ , indicating that $[eqn]$ is globally maximized. This, in turn, implies that the rate of change of $[eqn]$ has stagnated, causing the interfacial free-energy pair $[eqn]$ [kcal] to nearly vanish. This behavior is akin to a smooth (i.e., second-order) phase transition, in which, near the inflection point of the order parameter $[eqn]$ , the associated response functions $[eqn]$ and $[eqn]$ exhibit sharp critical behavior.

Regardless of its orientation, $[eqn]$ increments in a power-law fashion in both prokaryotic and eukaryotic PDs. The corresponding scaling exponents, $[eqn]$ (Eq. (23)), are narrowly distributed with $[eqn]$ and $[eqn]$ (see Fig. 4(e) and (f), respectively). High Pearson correlation coefficients verify that these observations reflect a genuine self-similar increment behavior (e.g., see Fig. 3(c) from [34]). Microscopically, this necessitates that hydropathic dipoles connecting radial atom neighbors are nonrandomly aligned [34], since random alignment would make the self-similar incrementation of their cumulative index an extraordinary evolutionary accident.

The narrowness of the $[eqn]$ -distribution (Fig. 4(e) and (f)), further necessitates a highly specialized nature for the water-mediated interactions established between the PD and the ion/water pore mixture. The PD operates under a narrow-banded dehydration protocol, through which ion selectivity can naturally emerge, as the radial alignment of dipoles around a pore point guarantees that water reorganization energies optimally dissipate into the atomic structure. Perturbation amplification is strictly unidirectional: perturbation amplitude attenuates inwards (toward the pore-lining interface, $[eqn]$ ), while it amplifies outwards (toward the PD/VSD interface), irrespective of the current pore point. This implies the existence of a directed (or ordered) allosteric network inside the PD, where any pair of perturbed pore-lining constituents can potentially influence the same distant neighbor, unlike the other way around.

Once the characteristic PD size is surpassed (i.e., for $[eqn]$ ), $[eqn]$ bends upwards, while $[eqn]$ bends downwards, thereafter exhibiting transient behaviors that cannot be described by a single scaling rule, as inferred from the broad, zero-centered distribution of $[eqn]$ (Fig. 4(e) and (f)). $[eqn]$ may thus increment, decrement, or stagnate beyond the PD (note how percentile clouds broaden for $[eqn]$ in Fig. 3(e) and (k) and SI Figs. S5-S10(e), S12-S22(e)). Given the high Pearson correlation coefficients associated with distributional $[eqn]$ -patterns on both sides of zero (Fig. 4(e) and (f)), the observed diversification of the scaling behavior of $[eqn]$ beyond the PD, supports that allosteric pathways coupling the PD to the VSDs are bidirectional: perturbations can be either attenuated or amplified from a VSD toward the PD/VSD interface in a pore-point-specific manner (SI 2.4.3, 2.5.3). This implies the multidirected (or disordered) nature of the allosteric network coupling the PD with the VSDs.

To exemplify, we consider the NaV1.7 molecule and illustrate the scaling of $[eqn]$ (and $[eqn]$ ) alongside $[eqn]$ on the two sides of the pore: (i) the ES side of the Asp384/Glu942/Lys1422/Ala1714 SF sequence, where an ion-attracting funnel is formed, and (ii) the AG side, where the inactivation particle (Ile1742/Phe1743/Met1744 motif) is located (Fig. 5(a)). $[eqn]$ and $[eqn]$ show excellent agreement, demonstrating the importance of correctly estimating the value of $[eqn]$ along the pore (Fig. 5(b) and (c)). On the ES side, $[eqn]$ indicates that the atomic environment is packed so as to effectively attenuate repulsive interactions, whereas at the AG, $[eqn]$ indicates that repulsive interactions have recovered and become comparable to the attractive interaction range (Fig. 5(c) vs. (b)). This difference in atom packing appears to leave the intrinsic dimension of the PD largely unaffected, showcasing how the PD can readjust its characteristic size under substantial $[eqn]$ -variations to preserve structural and functional integrity as the range of repulsive interactions changes (insets in Fig. 5(b) and (c), together with Eq. (19) and Section 3.1.1). The scaling of $[eqn]$ exhibits a similar trend in both cases along the characteristic radial extent of the PD, with the corresponding scaling exponents lying close to one another and assigned high PC values, in agreement with our expectations from Fig. 4(f) (Fig. 5(d) and (e)). The key difference arises beyond the characteristic PD size, where $[eqn]$ bends downward on the ES side but continues to increase on the IS side, approximately obeying the same power law, revealing that perturbations on the two sides of the NaV1.7 pore are differentially processed (Fig. 5(d) vs. (e)). Implications of this observation for NaV1.7 mutational robustness in the context of human pain disease are examined in detail below.Fig. 5Scaling of hydropathic dipole field around the NaV1.7 pore. (a) Side view of a human NaV1.7 channel (PDB code: 7w9k). The pore domain (PD) and voltage sensor domains (VSDs) are illustrated in red and blue, respectively; the $[eqn]$ subunits are omitted for clarity, and the van der Waals surfaces of the selectivity filter (SF) sequence Asp384/Glu942/Lys1422/Ala1714 and of the inactivation particle (IP) Ile1742/Phe1743/Met1744 are highlighted. Magenta circles represent the spherical surfaces $[eqn]$ that delimit the characteristic size of the PD. (b) and (c) Example traces of the empirical cumulative atom number $[eqn]$ (Eq. (32)) and its best-fit model $[eqn]$ (Eq. (3)) on different sides of the pore, for representative pore points $[eqn]$ (intracellular side) and $[eqn]$ (extracellular side). Insets show the intrinsic PD dimension $[eqn]$ (Eq. (19)). (d) and (e) Example traces of the empirical membrane-perpendicular hydropathic dipole field $[eqn]$ (Eq. (21)) acting on different sides of the pore. The power-law exponents are computed over the radial extent of the PD and the VSDs according to the general rule outlined in Eq. (23). Associated Pearson correlation coefficients quantify the goodness of the power-law approximation and are $[eqn]$ for all cases. Molecular illustrations were generated using Yasara software [92].Fig. 5

NaV1.7 mutational robustness in the context of human pain diseases

3.2

Let us assume that the NaV1.7 molecule exhibits a uniform mutational robustness profile, meaning the molecule’s ability to tolerate mutations is approximately the same across all of its constituent parts. Given that the PD/VSD interface is the most occupied molecular surface, it then follows that this region is also a mutation hotspot, i.e., a geometric site where the likelihood of residue mutation is highest.

To scrutinize the validity of this assumption, we examine mutation clustering in the highest-resolution NaV1.7 structure currently available (PDB code: 7w9k). We consider all SCN9A-gene-related mutations found in gnomAD [23] and ClinVar [64] databases, yielding $[eqn]$ events spanning three main categories: pathogenic (6.5 %), non-pathogenic (11.3 %), and variants of uncertain significance (VUS) (82.2 %). The set of pathogenic mutations contains the subset of pain-disease-associated mutations (2.7 %) and a substantial portion of pain-disease-unrelated pathogenic mutations (3.8 %). Pain-disease-associated mutations are categorized as either gain-of-function (GoF) (2.1 %) or loss-of-function (LoF) (0.5 %) (SI S1.9). Non-pathogenic mutations comprise two subsets: benign and neutral, with the latter consisting of carefully selected negative controls for human pain disease, sourced from Refs. [43], [65].

To probe mutation clustering relative to the PD/VSD interface, we consider the dataset $[eqn]$ , where, for each of the $[eqn]$ mutations, $[eqn]$ is the Euclidean distance between a pore point and the geometric center of the residue undergoing mutation (its ‘structural location’). The following rule determines whether the mutational robustness uniformity assumption is accepted or rejected: if, at a given pore region, the mean, mode, and median of $[eqn]$ vanish simultaneously, then the likelihood of observing a mutation is maximized at $[eqn]$ , and the underlying mutation distribution is approximately bell-shaped, assigning roughly equal mutation weight (in a statistical sense) to the PD and the VSDs.

We report that the uniform mutational robustness hypothesis for the NaV1.7 molecule is rejected along $[eqn]$ , except at the AG pore region (Fig. 6(a)), as explained below.Fig. 6Distributional characteristics of mutations across the NaV1.7 structure. We summarize the statistical properties of the $[eqn]$ dataset in terms of the median, $[eqn]$ , and the mean, $[eqn]$ , for pore points $[eqn]$ . Here, $[eqn]$ denotes the Euclidean distance between $[eqn]$ and the geometric center of the residue being mutated. We break down $[eqn]$ into subsets, each corresponding to a different mutation set. Insets visualize the histogram of the $[eqn]$ dataset, i.e., the collapsed distribution incorporating contributions from all pore points. In (a), we show the $[eqn]$ -distribution of the parent set of all (i.e., Path. $[eqn]$ Benign $[eqn]$ Neutr. $[eqn]$ VUS) SCN9A-gene mutations. $[eqn]$ identifies the structural location where the coin-flipping entropy is maximized. Note that if $[eqn]$ , $[eqn]$ can be approximated by $[eqn]$ (Eq. (8)). The empirical and theoretical instances of $[eqn]$ are determined by the $[eqn]$ values for which $[eqn]$ and $[eqn]$ are satisfied, respectively. The parent dataset is broken down as follows: (b), pathogenic and non-pathogenic (Path. $[eqn]$ Benign $[eqn]$ Neutr.). (c), pathogenic (Path). (d), non-pathogenic (Benign $[eqn]$ Neutr.). (e), gain-of-function (GoF) and loss-of-function (LoF) (GoF $[eqn]$ LoF). (f), GoF. (g), LoF. Note that a negative and positive value of $[eqn]$ indicates that the residue being mutated is likely residing in the PD and the VSDs, respectively. The areas highlighted in light magenta represent the pore regions where the median is minimized.Fig. 6

While the mean and median of $[eqn]$ closely agree, and the mode oscillates around them within reasonable bounds (Fig. 6(a)), their systematic deviation from zero reflects random genetic fluctuations governed by a flipping-coin maximum entropy principle. Accordingly, the probabilities of a mutation occurring inside or outside $[eqn]$ are given by $[eqn]$ and $[eqn]$ , respectively. Maximizing the flipping-coin entropy, $[eqn]$ , requires that $[eqn]$ , with $[eqn]$ denoting the $[eqn]$ -value for which $[eqn]$ is satisfied.

As shown in Fig. 6(a), the line $[eqn]$ coincides with the $[eqn]$ -median, verifying that mutation clustering in the NaV1.7 is predominantly shaped by the flipping-coin randomness. Across an evolutionary time-scale, this results in a bell-shaped mutation distribution approximately centered at $[eqn]$ (Eq. (8)), ensuring that molecular diversity is explored inside and outside $[eqn]$ in an unbiased manner. Since the range of attractive interactions prevails over the range of repulsive interactions (SI Fig. S23), the $[eqn]$ -interface covers the PD/VSD interface (Fig. 7(a), thus serving as a buffer zone that absorbs mutation-induced perturbations and mitigates their impact on the PD.Fig. 7Inertia and conductivity profile of structural locations attracting pain-disease-associated mutations. (a), Side view of a human NaV1.7 channel (PDB code: 7w9k). The PD and VSDs are illustrated in red and blue, respectively. For clarity, the $[eqn]$ subunits are not shown. The dense arrangement of $[eqn]$ -balls along $[eqn]$ creates the smooth cylindrical surface (in green), denoted $[eqn]$ . For comparison, the characteristic size of the PD, $[eqn]$ , is also illustrated (in magenta). Note that the inequality $[eqn]$ arises from the atom-packing condition $[eqn]$ (SI Fig. S23). At the AG pore region, where $[eqn]$ ( $[eqn]$ ), $[eqn]$ implies $[eqn]$ with $[eqn]$ (see Eq. (8)). In (b) and (c), we statistically summarize the $[eqn]$ -cluster inertias, $[eqn]$ , and the $[eqn]$ -cluster conductivities, $[eqn]$ , characterizing structural locations where gain-of-function (GoF), loss-of-function (LoF), neutral (Neutr.), and benign (Benign) mutations appear, for $[eqn]$ . (d) and (e) provide analogous information at the interface level: we statistically summarize the $[eqn]$ -shell, inertias $[eqn]$ , and the $[eqn]$ -shell conductivities, $[eqn]$ . Clouds around the index traces represent the min–max range of the underlying data points. The misclassified (misclass.) subset contains pain-disease-associated mutations that are systematically misclassified by the machine-learning algorithm (see Fig. 8(b). The conductivity index derived from the first-order hydropathic dipole field is shown separately in SI Fig. S4, as the pair $[eqn]$ satisfies Eq. (24) only partially (SI S2.3.1). The construction of the statistical summary indices of $[eqn]$ and $[eqn]$ is described in SI S1.8.2. Molecular illustrations were generated using Yasara [92].Fig. 7

In contrast, this is not the case at the AG pore region, where all statistical indices considered above tend to vanish (Fig. 6(a)), consistent with a uniform mutational robustness profile as $[eqn]$ coincides with the PD/VSD interface (Fig. 7(a)). Although the PD loses its protective buffer, it gains evolutionary flexibility, allowing for greater exploration of molecular diversity. Moreover, mutations around the AG pore region weigh equally on the PD and the VSDs, preventing either from being disproportionately stressed.

Collapsing $[eqn]$ into a distribution yields a roughly symmetric bell shape, well described by a normal distribution with mean of $[eqn]$ and standard deviation of $[eqn]$ (inset in Fig. 6(a)). Excluding VUS shifts the distribution to the left, to $[eqn]$ (Fig. 6(b)). Pathogenic and non-pathogenic mutation distributions have means and standard deviations of $[eqn]$ and $[eqn]$ , respectively, indicating a slight preference for the PD and the VSDs (Fig. 6(c) and (d)). The pain-disease-associated mutation distribution is further left-shifted, with $[eqn]$ (Fig. 6(e)), indicating a strong preference for the PD.

Distinguishing between GoF and LoF mutations reveals opposite statistical trends (Fig. 6(f) and (g)). GoF mutations are concentrated near the AG (Fig. 6(f)), whereas LoF mutations converge toward the ES of the SF, where the extracellular funnel is formed (Fig. 6(g)). The molecular basis for GoF and LoF pain phenotypes can thus be rationalized in terms of qualitatively different perturbation modes. Positive $[eqn]$ exponents at the AG pore region favor GoF-triggered perturbations to amplify and propagate further into the VSDs (SI Fig. S23). By contrast, negative $[eqn]$ exponents on the ES side of the pore disfavor further amplification of LoF-triggered perturbations into the VSDs (SI Fig. S23), potentially causing them to be locally absorbed, with detrimental impact on the PD.

Violation of inertia and conductivity constraints underpins human pain disease at the molecular level

3.2.1

Let us further assume that pain-disease-associated mutations exploit imbalances in the interaction network at the interface level to perturb the underlying $[eqn]$ -cluster.

To scrutinize this assumption for NaV1.7, we utilize the Decomposition ansatz (24) (SI S2.3) to derive the logarithmic composite susceptibility $[eqn]$ (Eq. (29)) and combine it with the interfacial coupling strength $[eqn]$ (Eq. (28)) in the form of the ratio $[eqn]$ (with $[eqn]$ ). The quantity $[eqn]$ establishes an upper bound on the number of available perturbation modes encoded in a structural location (SI Eq. (S24)). Simply put, $[eqn]$ measures how many ways there are to destabilize an $[eqn]$ -cluster, thus serving as a perturbation potential (i.e., as a measure of the sandpile slope; SI Eq. (S24)) associated with the structural location under scrutiny.

A mechanical analogy rationalizing $[eqn]$ is that of a maximum torque/force principle. When tightening a bolt, applying force at a stable far-end grip, where the inertia is minimal, maximizes torque. Analogously, residues distributed over strongly coupled interfaces covering $[eqn]$ -clusters of vanishingly small inertia (probed with $[eqn]$ , $[eqn]$ ) are ideal mutagenesis sites, as they can effortlessly perturb the $[eqn]$ -cluster rotational profile. A similar mechanism governs perturbations of the $[eqn]$ -cluster conductivity profile, characterized by the transverse odd-parity moments $[eqn]$ ( $[eqn]$ ). A vanishingly small HDF amplitude indicates that the $[eqn]$ -cluster conductivity becomes highly susceptible to perturbations, such that even minor surface fluctuations can induce disproportionately large reorganizations in both the amplitude and orientation of the local dehydration forces.

In contrast to non-pathogenic mutations, pain-disease-associated mutations prefer interfaces covering $[eqn]$ -clusters whose inertia decreases when approaching the CC pore region from both sides and, additionally, at the SF and AG pore regions (Fig. 7(b)). Specifically, the inertia of the $[eqn]$ -clusters whose surfaces act as hotspots for GoF and LoF mutations vanishes twice and four times, respectively, at approximately symmetric locations along $[eqn]$ . The arrangement and number of these inertia zero-crossings may detrimentally perturb the NaV1.7 functional architecture by introducing excess rotational atom-packing degrees of freedom (DoF). Whether this leads to an increased or decreased pore-open probability and, consequently, to a GoF- or LoF-like electrophysiological signature is likely set by a threshold in the number of inertia zero-crossings, which determines the tolerable excess rotational atom-packing DoF before the gating cycle collapses. Targeting binding sites in the CC with anchoring molecules (e.g., local anesthetics) can therefore offer a rational strategy for restoring inertia and mitigating the effects of the perturbation.

Additionally, GoF and LoF mutations prefer interfaces covering $[eqn]$ -clusters whose conductivity is diminished beyond the AG pore region toward the IS (Fig. 7(c)). GoF mutations induce a conductivity zero-crossing on the ES of the SF pore region, whereas LoF mutations suppress conductivity on both sides of the SF without producing a definitive sign change exactly at the SF: the LoF-related trace in Fig. 7(c) approaches zero and fluctuates around it between the SF and the CC, as well as on the ES side of the SF. Pain-disease-associated mutations thus appear to maximally perturb HDFs in the vicinity of the SF and AG, but not exactly at them, thereby potentially altering ion/water fluxes precisely at mediator pore regions where the pore radius is rapidly changing.

At the interface level, inertia and conductivity amplitudes associated with pain-disease-related mutations fluctuate more strongly than those associated with non-pathogenic mutations (Fig. 7(d) and (e)). This suggests that GoF and LoF surface hotspots engage in substantially stronger coupling interactions with their environment compared to surfaces attracting non-pathogenic mutations. Moreover, GoF and LoF surface hotspots are characterized by oppositely signed interfacial conductivities at the ES of the CC, implying that, once perturbed, these interfaces could generate oppositely directed ion/water flux responses (Fig. 7(e)).

Verification via machine learning experimentation

3.2.2

We adopt a simple and transparent machine learning strategy based on a stratified $[eqn]$ -fold cross-validated support vector machine classifier with a radial basis function kernel, applied locally (per pore point) and then globally on features summarized via median statistics (for details, see SI S1.10). This two-stage setup is reminiscent of meta-learning, as the global model builds on information extracted by the local models, yet we intentionally avoid hyperparameter tuning or complex architectures so that $[eqn]$ -derived features – and their permutation-based importances – remain central to the analysis. To avoid divergent $[eqn]$ values, we treat $[eqn]$ and $[eqn]$ as two separate feature inputs and also investigate the significance of inertia- and conductivity-related constraints separately, ending up with four feature inputs: $[eqn]$ .

Machine learning experiment I: pain-disease-associated vs. neutrals (PDB: 7w9k)

3.2.2.1

Following previous works [35], [65], we apply our algorithm to a well-balanced dataset comprising pain-disease-associated mutations and neutral variants. The local performance of the classifier, evaluated in terms of the area under the curve (AUC) and F1 scores, remains generally stable along $[eqn]$ , indicating that the classifier can adapt well to local atom-packing conditions (SI S2.6.1).

A yes/no variant classification scheme based on a linear threshold is illustrated in Fig. 8(a). Its performance, evaluated in terms of the median AUC, reaches $[eqn]$ (Fig. 8(e) and Table 1).Fig. 8Machine-learning-assisted verification: pain-disease-associated mutation hotspots exhibit substantially distinct perturbation potential profile. (a), Summary of a machine-learning experiment evaluating the distinctiveness of the pain-disease-associated class (class_0: GoF $[eqn]$ LoF) relative to the neutral class (class_1: Neutr.). The gain-of-function (GoF) subclass comprises structural locations whose mutability is phenotypically associated with inherited erythromelalgia (IEM), small fiber neuropathy (SFN), and paroxysmal extreme pain disorder (PEPD) (SI Tab. S3). The loss-of-function (LoF) subclass comprises structural locations whose mutability is phenotypically associated with insensitivity to pain (SI Tab. S3). Neutrals are structural locations whose mutability is not likely to be phenotypically associated with human pain disease, sourced from [35], [65]. The optimal threshold at 0.462 corresponds to the median of the best thresholds obtained through bootstrapping; each bootstrap sample of class_0 probabilities yields a threshold that maximizes the F1-score. X-axis ticks indicate the structural location of a mutation event in the NaV1.7 structure. X-axis ticks of misclassified pain-disease-associated mutations (i.e., those that do not pass the linear threshold) are highlighted in black. (b), Top-view illustration of the human NaV1.7 channel (PDB code: 7w9k). For clarity, the $[eqn]$ -subunits are not shown. The van der Waals surface of misclassified structural locations is highlighted in yellow. The PD and the VSDs are shown in red and blue, respectively. In (c) and (d), we plot the permutation-based importance scores for the features $[eqn]$ and $[eqn]$ , $[eqn]$ , which account for inertia and conductivity constraints at the cluster level, respectively, obtained via 200 permutation rounds per pore point (per feature set) (see SI S1.10 for details). In (e) and (f), we illustrate the distribution of the final area-under-the-curve (AUC) scores obtained from the final classification round for two different machine-learning experiments, namely, the one summarized in (a) and another with class_0: GoF $[eqn]$ LoF vs. class_1: Neutr. $[eqn]$ Benign. Benign denotes structural locations whose mutability is not likely to be associated with disease (SI S1.9). Details concerning the machine-learning algorithm design and parameter selection can be found in SI S1.10. Molecular illustrations were generated using Yasara [92].Fig. 8. Table 1Machine learning experiments summary. Median area-under-the-curve (AUC) and F1 scores obtained during the final classification round (SI 1.10.2). The first and second numbers of each $[eqn]$ -pair are median values obtained during training and testing the final classification model, respectively. The thermostability significance of inertia (iner.) and conductivity (cond.) constraints is probed by the feature inputs $[eqn]$ and $[eqn]$ , respectively. SCN9A-gene mutation datasets I, II, and III contain the classes $[eqn]$ class_0: GoF $[eqn]$ LoF, class_1: Neutr. $[eqn]$ , $[eqn]$ class_0: GoF $[eqn]$ LoF, class_1: Neutr. $[eqn]$ Benign $[eqn]$ , and $[eqn]$ class_0: Path., class_1: Neutr. $[eqn]$ Benign $[eqn]$ , respectively. GoF and LoF stand for gain-of-function and loss-of-function, respectively, and represent mutations associated with increased or diminished pain sensation (listed in SI Tab. S3). Neutrals (neutr.) are carefully selected human pain disease negative controls, sourced from [35], [65]. Benign variants are generally not expected to be associated with a disease phenotype. Note that the pathogenic (path.) class contains both pain-disease-associated and pain-disease-unrelated pathogenic mutations.Table 1PDB: 7w9k, res.: 2.2 ÅPDB: 6j8j, res.: 3.2 ÅConstraintiner./cond.iner.cond.iner./cond.iner.cond.AUC (dataset I) $[eqn]$ $[eqn]$ $[eqn]$ $[eqn]$ $[eqn]$ $[eqn]$ F1 (dataset I) $[eqn]$ $[eqn]$ $[eqn]$ $[eqn]$ $[eqn]$ $[eqn]$ AUC (dataset II) $[eqn]$ $[eqn]$ $[eqn]$ $[eqn]$ $[eqn]$ $[eqn]$ F1 (dataset II) $[eqn]$ $[eqn]$ $[eqn]$ $[eqn]$ $[eqn]$ $[eqn]$ AUC (dataset III) $[eqn]$ $[eqn]$ $[eqn]$ $[eqn]$ $[eqn]$ $[eqn]$ F1 (dataset III) $[eqn]$ $[eqn]$ $[eqn]$ $[eqn]$ $[eqn]$ $[eqn]$

The $[eqn]$ feature-importance profile shows that maintaining higher-order interactions becomes increasingly important around the SF pore region. The importance of the second- and fourth-order inertia features exceeds that of the zeroth-order feature, suggesting that rotational and vibrational modes up to at least fourth order may influence ion selectivity (Fig. 8(c)). Corroboratively, the importance of higher-order interfacial inertia features rises sharply at the SF pore region, while, strikingly, the zeroth-order term $[eqn]$ loses significance (SI Fig. S26(a)).

Additionally, at the SF pore region, higher-order conductivity features – particularly those of third and fifth order – become increasingly dominant, even surpassing the importance of first-order features (Fig. 8(d)). This indicates that volumetric, potentially asymmetric interactions may be crucial for the dehydration of sodium ions. In the $[eqn]$ feature domain, we observe similar dependencies at the ES entry point of the SF (SI Fig. S26(b)).

To clarify whether these results depend on the sign of the features, which encode information about the direction of the perturbation response ((22), (27)), we repeat the machine learning experiment with the unsigned feature input $[eqn]$ . Classification performance becomes only marginally worse (Table 1 vs. SI Tab. S5), suggesting that the NaV1.7 functional architecture is affected by violations of the inertia and conductivity constraints shown in Fig. 7 in a largely direction-independent manner.

Machine learning experiment II: pain-disease-associated vs. non-pathogenic (PDB: 7w9k)

3.2.2.2

To demonstrate that our findings are not biased by the specific choice of neutrals, we repeat the machine learning experiment, this time focusing on the non-pathogenic mutation subset containing both neutral and benign mutations.

The resulting AUC scores have a median of $[eqn]$ (Fig. 8(f) and Table 1). Despite considerable class imbalance – non-pathogenic mutations greatly outnumber pain-disease-associated ones, with a class ratio as low as $[eqn]$ – the algorithm maintains both accuracy and stability, showing no detrimental effects (SI Figs. S24(c), S25(c)). These findings reinforce that pain-disease-associated mutations occur in molecular neighborhoods with a perturbation-potential profile that differs substantially from that of non-pathogenic mutations.

Machine learning experiment III: pathogenic vs. non-pathogenic (PDB: 7w9k)

3.2.2.3

Running our algorithm on the entire SCN9A-gene mutation dataset results in a performance drop of approximately $[eqn]$ , with the median AUC score decreasing to $[eqn]$ (Table 1). This reduction is primarily driven by the inclusion of pain-disease-unrelated pathogenic mutations. The perturbation-potential profile of structural locations whose mutability is associated with disease phenotypes distinct from painful or painless neuropathies resembles that of non-pathogenic mutations, at least up to the first derivative of $[eqn]$ . Our confidence in this finding is high, as fewer than $[eqn]$ of all pain-disease-unrelated pathogenic mutations are characterized as ‘Pathogenic/Likely pathogenic’ or ‘Likely pathogenic’.

Enhancing classifier performance would likely require incorporating higher-order derivatives of $[eqn]$ together with more refined scaling-exponent estimates, although noise becomes progressively harder to control with increasing derivative order and distance from the pore. Even with the current implementation, however, the gains from adding $[eqn]$ are very small, if present at all, which is plausibly explained by intrinsic structural noise and the noise sensitivity of derivative-based quantities (SI Tab. S5). We nevertheless report results for the combined feature set in the main text, to avoid relying solely on $[eqn]$ and to verify that including its derivative-based counterparts $[eqn]$ does not alter the conclusions, thereby reducing the risk of bias toward a single representation of the thermostability constraints. In line with these limitations, pain-disease-associated mutations affecting the VSDs are more likely to be misclassified (Fig. 8(b)). Four out of five misclassified GoF mutations are associated with an SFN phenotype and occur at VSD locations Arg185 (DI), Ile720 (DII), Ile739 (DII), and Arg1279 (DIII), while the fifth, linked to an IEM phenotype, is located at Trp1538 (DIV). Among the three misclassified LoF mutations, Arg99 (DI) and Leu172 (DI) reside in the VSD, and Cys1719 (DIV) lies in the extracellular loop connecting S5 and S6.

Repeating machine learning experiments I–III using a lower-resolution structure (PDB: 6j8j)

3.2.2.4

To assess how lower resolution impacts algorithm performance, we repeat the analysis using a widely studied NaV1.7 structure with lower resolution (PDB code: 6j8j). Despite a slight drop in performance, the results remain largely consistent (Table 1 vs. SI Tab. S5). Given that both the high- and low-resolution structures likely represent an inactivated state of the channel, the observed decrease in performance can be attributed to the limitations imposed by lower structural resolution.

Inertia-vs-conductivity: what matters more for NaV1.7 physiological functioning? (PDB: 7w9k, 6j8j)

3.2.2.5

Exchanging the full feature set $[eqn]$ with either $[eqn]$ or $[eqn]$ for the 7w9k or 6j8j structures results in a drop in algorithm performance that is always less than 10 % (Table 1, SI Tab. S5). Additionally, the AUC scores achieved with $[eqn]$ are close to those obtained with $[eqn]$ . Together, these results suggest that the feature sets $[eqn]$ and $[eqn]$ contain redundant information, implying that rotational and translational NaV1.7 degrees of freedom (DoF) are coupled. The corresponding perturbation modes are thus likely intertwined through feedback, with molecular rotations around the pore influencing the flow of ions through the pore, and vice versa.

Discussion

4

The NaVCh functional architecture represents the latest product of an evolutionary process initiated nearly three billion years ago [66]. It embodies a rich repertoire of metastable dynamics emerging from the instantaneous coordination of thousands of atoms assembled into several hundred amino acids and structurally organized into distinct domains around a central axis. Parsimonious models of the NaVCh functional architecture must therefore rely on evolutionarily conserved laws that connect the microstructure (single atom) with the macrostructure (multi-domain molecule) in a way that allows molecular functionality to emerge a priori, i.e., by virtue of the relationships established among these evolutionarily conserved laws. Since an atom is separated from a domain by orders of magnitude in length scale, our quest finds a natural place within the RG framework, which clarifies how (i.e., through which scaling operations) one can transition from one molecular scale to the next without losing the ability to reconstruct essential functional characteristics of the molecule. Because the evolutionarily conserved laws are precisely defined by the underlying scaling operations, the two concepts are interchangeable and are both encompassed by the term ‘scaling law.’ These considerations are summarized in the key analytical result of the RG flow equation (Eq. (30)), which is applied in a pore-point-specific manner, thereby establishing a connection between the porous microenvironment and the surrounding molecular macroembedding context.

Importantly, we arrived at Eq. (30) by starting from the simplest possible equation of state (Eq. (2)), which describes the number of atoms within an infinitely thin shell $[eqn]$ with a repulsive and a stabilizing term, yet is still capable of supporting a pore-forming macroenvironment in which a PD is radially succeeded by four VSDs. To identify the PD/VSD interface in an unbiased manner, we focused on NaVCh shape characteristics, specifically the sign change in the curvature of the ‘slow’ [67] state variable $[eqn]$ , which marks a prominent inflection point. We are therefore confident that the logical thread connecting Eq. (2) with Eq. (30) is well-founded, suggesting that renormalizability is a fundamental property of NaVCh protein molecules. It is thus unsurprising that our theoretical framework can recapitulate Widom’s scaling law [68] under mean-field-like conditions, and does so at a molecular scale matching the inflection point (SI S1.7.4), indicating that NaVChs belong to the same universality class as other multi-domain complex systems with some degree of fluidity, whose constituents interact via magnetic-like fields [69].

Experimental support comes from the analysis of sufficiently resolved, all-atom NaVCh structures, which adhere to Eq. (2) within reasonable structural constraints – deviating only when the PD is extended with a C-terminal domain and the VSDs are removed (SI Fig. S7). The NaVCh structure can thus be treated as an ‘extended’ [70] (or ‘transient’ [71]) self-similar object whose intrinsic dimension changes continuously with molecular scale (Eq. (11)). A statistical-mechanical description of such ‘extended’ fractals is available within the framework of nonextensive statistical mechanics [51], [52], hallmarked by the $[eqn]$ -generalized atom-packing entropy given by Eq. (17). Juxtaposing the notions of intrinsic dimension and atom-packing entropy suggests that formation of the central cavity is a structural consequence of hydrophobicities being buried inside the NaVCh core over evolutionary timescales (3.1.1 and SI S2.2). Accordingly, excess atom-packing degrees of freedom generated by increasing core hydrophobicity are compensated by a reduction in intrinsic dimension, thereby preserving the structural integrity of the PD sub-architecture, as described by Eq. (19). This phenomenon is generally interpreted within the evolution-driven dimensionality reduction framework [72], which favors more efficient spatial organization in eukaryotic NaVChs, as verified both here across the NaVCh superfamily and across thousands of proteins in Ref. [63]. The physical basis of this phenomenon lies in the long-range nature of water-mediated interactions, which allow spatially distant residues to influence one another [73], [74], [75]. Consistent with previous experimental findings reported in Refs. [58], [59], [60], [76], we find that the amplitude of the effective hydrophobic ‘force’ holding distant components together decays exponentially with increasing molecular scale, exhibiting a characteristic decay length, given by $[eqn]$ , on the order of 10 Å (Fig. 4(c) and (d)).

Self-similarity implies long-range interactions, which in turn lead to synergistic constituent functioning via allostery [77]. The NaVCh field is well positioned to begin contemplating the role of such synergies within NaVCh molecules and their significance in both physiological function and mutation-perturbed, disease-related contexts (e.g., see [78], [79], respectively). Our results suggest that the modus operandi of the PD primarily relies on synergy built upon a narrow-width hydropathic dipole field (HDF) exponent distribution (Section 3.1.3). We argue that this evolutionary trend effectively reduces the number of interaction modes required to peel water molecules from a sodium ion while simultaneously increasing the number of available pathways through which dehydration free energy can dissipate rapidly and deeply – i.e., in a nearly adiabatic manner – into the PD sub-architecture. Synergies are thus expected to arise from the unidirectional (outward-radiating) nature of allosteric interactions within the PD, enabling collective responses to perturbations, as observed in molecular dynamics simulations [80]. Beyond the PD, the directionality of allosteric interactions becomes diversified, supporting the notion that VSD constituents may engage in asymmetric interactions both among themselves and with the PD. This renders analysis of VSD information-processing capabilities challenging, calling for more detailed future investigations that examine each VSD individually. Nevertheless, even with our current coarse-grained approach, we infer that eukaryotic VSDs possess more specialized information-processing capabilities than their prokaryotic counterparts, enabling differential perturbation processing on the ES and IS (SI S2.5.3).

Our understanding of the molecular basis of human pain disease has been built upon painstaking experimental efforts, primarily relying on cell-based electrophysiology assays that screen single-amino-acid mutations in the human NaV1.7 molecule (for a list, see SI Tab. S3). Resolving the structure–function relationship for human NaV1.7 in the context of human pain disease can seem like assembling an almost impossible puzzle: each piece encodes only a tiny fraction of the relevant biophysical and neurophysiological possibilities. To demonstrate how our theoretical framework can simplify this task, we mapped all SCN9A gene mutations onto a wild-type NaV1.7 molecule and attempted to explain the observed mutation clustering patterns (Section 3.2). We identified the most frequently mutated interface and showed that it becomes indistinguishable from the PD/VSD interface around the AG, where mean-field conditions are established as $[eqn]$ . This finding indicates that the NaV1.7 mutation landscape has not emerged purely by chance but is constrained by NaV1.7 shape features that vary along the pore and are effectively captured by $[eqn]$ . A tendency toward a more globular-like shape lowers the risk of experiencing a detrimental perturbation: as the PD/VSD interface becomes the primary mutagenesis site, mutations tend to land between the PD and VSD, thereby sparing either domain from direct damage. Pain-disease-associated mutations appearing at the AG within the PD are most likely associated with a GoF phenotype, consistent with the mutation clustering patterns reported in Ref. [81]. This pattern is reversed for mutations linked to a LoF pain phenotype (Fig. 6, panel (g) vs. (f)); however, the limited number of available LoF pain mutations precludes any definitive conclusions. Nevertheless, when viewed through the RG lens, these observations suggest that the primary molecular distinction between GoF and LoF pain phenotypes lies in whether mutation-induced perturbation shocks are more likely to be distantly propagated or locally absorbed, as determined by the scaling properties of VSD-induced hydropathic fields (Section 3.2 and SI Fig. S23).

Just as adding a grain of sand to a steep slope can trigger an avalanche, mutations can exploit allosteric shortcuts [82] to augment NaVCh degrees of freedom (DoF), exerting a substantial destabilizing effect on the functional architecture. We conceive the NaVCh functional architecture as a hierarchically organized mechanical system centered around a principal pore axis, where hinges [83], [84], [85], [86] are arranged in nested levels – smaller sub-hinges exist within or regulate parts of larger hinges – thereby enabling multi-scale coordinated motion. Mechanistically, this implies a hierarchy among residues, with some more likely than others to significantly alter channel conductivity and inertia upon mutation-induced perturbation, analogous to how a force applied farther from a hinge generates a larger rotational effect about the axis, which can, in turn, induce large ion/water fluxes along the axis. An in-depth investigation of this idea was undertaken using the analytical procedures detailed in Section 2.1.4. The explanatory strength of this approach is illustrated in Fig. 7 and validated by a simple two-stage scheme reminiscent of meta-learning [87], in which a global model learns across the pore from prior, pore-point-specific (local) learning processes. Our algorithm achieved state-of-the-art AUC scores (relative to previous efforts [35], [65], [88]), demonstrating that a standard support vector machine classifier suffices to learn the biomechanical constraints that differentiate pain-disease-associated mutation hotspots from benign and/or neutral sites. However, it yields only mediocre results when applied to the entire dataset of pathogenic versus non-pathogenic SCN9A gene mutations. We argue that this is primarily due to low-resolution artifacts, which affect feature observability, and only secondarily due to limitations of the features themselves.

In summary, relaxing the inertia constraints that lock the transmembrane helical segments around the CC in a screwed-in state is a key perturbation mode through which GoF electrophysiological signatures can arise in the context of human pain disease. An analogous mechanism applies to LoF electrophysiological signatures, but in a more spatially extended form: enhancing rotational freedom at sites along the pore other than the CC can diminish open-state probability. It is therefore reasonable to regard desynchronization as the general mechanistic principle for shifting from a GoF-associated electrophysiological phenotype regime toward a LoF one. Weak desynchronization of transmembrane helical rotations augments channel activation dynamics by increasing the number of accessible PD/VSD coupling configurations, ultimately raising the probability of the open state. Strong desynchronization, however, has the opposite effect: different parts of the transmembrane helical segments begin to behave as asynchronous rotors, effectively decoupling the PD from the VSD and thereby diminishing open-state probability. Targeting NaV1.7’s CC with drug molecules that restore inertial dynamics could therefore serve as an experimental validation strategy. Dismantling orientation and amplitude constraints of HDFs along the extracellular funnel toward the SF constitutes an equally important perturbation mode. Our data suggest that whether a mutation results in gain-of-function (GoF) or loss-of-function (LoF) may depend on the orientation of dehydration forces exerted upon incoming ions (Fig. 7(c) and (d)), a hypothesis that can be tested through molecular dynamics simulations. How these predictions will generalize to larger datasets of GoF versus LoF mutations remains, however, to be determined.

Our findings suggest that evolution has fine-tuned NaVCh metastable dynamics by biasing mutation occurrence toward the outer vicinity of the PD/VSD interface (specifically, on the PD outer boundary described by Eq. (8) and illustrated in Fig. 7). Electrophysiologists can use this principle to interpret and design experiments, while pharmacologists can exploit it to modify NaVCh metastable dynamics in a desired manner. In practice, this means that structurally corresponding sites on the ‘safe-zone’ spherical surface $[eqn]$ , located on the extracellular (ES) versus intracellular (IS) side, are nonequivalent and are expected to support distinct perturbation responses. This offers a concrete experimental strategy: electrophysiological assays can systematically target matched sets of residues on $[eqn]$ (e.g., ES-facing versus IS-facing or axially symmetric positions) and compare their effects on activation, inactivation, and use-dependence. Such ‘axial symmetry scans’ effectively mimic the way evolution samples this interface and, under our framework, are predicted to differentially modulate PD/VSD coupling rather than yield uniform outcomes. The most interesting case arises around the AG, where the ‘safe zone’ coincides with the PD/VSD interface. This overlap identifies a spherical shell in which mutations can modulate PD/VSD coupling while remaining preferentially sampled – and apparently tolerated – by evolution, making it an especially attractive region for targeted electrophysiological and pharmacological interventions.

Several limitations apply to our work. First, the structural dataset considered here does not account for the full configuration repertoire that a NaVCh molecule can adopt. Most NaVChs of eukaryotic origin represent snapshots of an inactivated state. Second, we acknowledge that the high-resolution ( $[eqn]$ Å) NaV1.7 structures with PDB codes 8s9b [89], 8xmm [90], and 8thh [91] were not available at the time of structural dataset assembly and are therefore not included in the present analysis. Third, misclassification of VSD hotspots remains an inherent challenge, which could potentially be mitigated by either partitioning the flow into four subflows, each sampling a single VSD, or by initiating RG flows along axes that traverse each VSD individually. As long as Eq. (2) is satisfied, the latter option introduces an axis translation, shifting the perspective from the central pore axis to peripheral VSD-centered axes that simulate the direction along which gating charges move. Lastly, our work does not address dynamic NaVCh aspects, which, in turn, further limits our understanding of hydropathic interaction network intensification characteristics in the VSDs.

In conclusion, the RG provides a coarse-grained yet explanatory framework that efficiently describes NaVCh functional constraints. Leveraging this, machine learning approaches balance computational efficiency with interpretability, enabling clinically relevant insights into pain disorders and advancing understanding of the physicochemical principles shaping NaVCh function. Because our procedures are completely general, they could provide a foundation for analyzing various gene-protein relationships in a pathophysiological context, particularly for large globular or pore-forming systems that are hierarchically organized around a principal point set.

CRediT authorship contribution statement

Markos N. Xenakis: Writing – review & editing, Writing – original draft, Visualization, Validation, Software, Methodology, Investigation, Formal analysis, Data curation, Conceptualization. Angelika Lampert: Writing – review & editing, Writing – original draft, Validation, Supervision, Resources, Project administration, Methodology, Investigation, Funding acquisition, Data curation, Conceptualization.

Funding

This work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) through the following grants awarded to A.L.: 363055819/GRK2415 “Mechanobiology of 3D epithelial tissues (ME3T)”, 368482240/GRK2416 “MultiSenses-MultiScales”, and LA2740/6-1.

Code availability

The code used throughout is available at: https://github.com/mnxenakis/NaVCh_Scaling

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Bibliography89

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Hille B.Ionic channels of excitable membranes 3rd ed.2001 Sinauer Associates Sunderland, MA
2Hodgkin A.L.Huxley A.F.Currents carried by sodium and potassium ions through the membrane of the giant axon of loligo J Physiol 1164195244947210.1113/jphysiol.1952.sp 00471714946713 PMC 1392213 · doi ↗ · pubmed ↗
3Catterall W.A.Voltage-gated sodium channels at 60: structure, function and pathophysiology J Physiol 5901120122577258910.1113/jphysiol.2011.22420422473783 PMC 3424717 · doi ↗ · pubmed ↗
4Ahern C.A.Payandeh J.Bosmans F.Chanda B.The hitchhiker’s guide to the voltage-gated sodium channel galaxy J Gen Physiol 1471201512410.1085/jgp.201511492 PMC 469249126712848 · doi ↗ · pubmed ↗
5Hille B.The permeability of the sodium channel to organic cations in myelinated nerve J Gen Physiol 586197159961910.1085/jgp.58.6.5995315827 PMC 2226049 · doi ↗ · pubmed ↗
6Hille B.The permeability of the sodium channel to metal cations in myelinated nerve J Gen Physiol 596197263765810.1085/jgp.59.6.6375025743 PMC 2203202 · doi ↗ · pubmed ↗
7Hille B.Ionic selectivity, saturation, and block in sodium channels. A four-barrier model J Gen Physiol 665197553556010.1085/jgp.66.5.5351194886 PMC 2226224 · doi ↗ · pubmed ↗
8Roux B.Bernèche S.Egwolf B.Lev B.Noskov S.Y.Rowley C.N.Yu H.ION selectivity in channels and transporters J Gen Physiol 1375201141542610.1085/jgp.20101057721518830 PMC 3082929 · doi ↗ · pubmed ↗