Many-body contacts in fractal polymer chains and fBm trajectories
K.E. Polovnikov, S. Nechaev, M.V. Tamm

TL;DR
This paper derives probabilities of multiple contacts in fractal polymer chains modeled by fractional Brownian motion, providing a mathematical framework to interpret complex genomic contact data from Hi-C experiments.
Contribution
It introduces a determinant-based formula for multi-body contact probabilities in fractal polymers, extending previous models to include arbitrary fractal dimensions.
Findings
Derived a determinant expression for multi-contact probabilities
Mapped fractional Brownian motion to polymer conformations
Proposed a model for interpreting genome contact data
Abstract
We calculate the probabilities that a trajectory of a fractional Brownian motion with arbitrary fractal dimension visits the same spot times, at given moments , and obtain a determinant expression for these probabilities in terms of a displacement-displacement covariance matrix. Except for the standard Brownian trajectories with , the resulting many-body contact probabilities cannot be factorized into a product of single loop contributions. Within a Gaussian network model of a self-interacting polymer chain, which we suggested recently, the probabilities we calculate here can be interpreted as probabilities of multi-body contacts in a fractal polymer conformation with the same fractal dimension . This Gaussian approach, which implies a mapping from fractional Brownian motion trajectories to polymer conformations, can be used as aβ¦
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Many-body contacts in fractal polymer chains and fBm trajectories
K.E. Polovnikov1,2, S. Nechaev3,4 and M.V. Tamm2,5
1 Skolkovo Institute of Science and Technology, 143026 Skolkovo, Russia
2 Faculty of Physics, Lomonosov Moscow State University, 119992 Moscow, Russia
3Interdisciplinary Scientific Center Poncelet (ISCP), 119002, Moscow, Russia
4 Lebedev Physical Institute RAS, 119991, Moscow, Russia
5 Department of Applied Mathematics, MIEM, National Research University Higher School of Economics, 101000, Moscow, Russia
(March 17, 2024)
Abstract
We calculate the probabilities that a trajectory of a fractional Brownian motion with arbitrary fractal dimension visits the same spot times, at given moments , and obtain a determinant expression for these probabilities in terms of a displacement-displacement covariance matrix. Except for the standard Brownian trajectories with , the resulting many-body contact probabilities cannot be factorized into a product of single loop contributions. Within a Gaussian network model of a self-interacting polymer chain, which we suggested recently, the probabilities we calculate here can be interpreted as probabilities of multi-body contacts in a fractal polymer conformation with the same fractal dimension . This Gaussian approach, which implies a mapping from fractional Brownian motion trajectories to polymer conformations, can be used as a semiquantitative model of polymer chains in topologically-stabilized conformations, e.g., in melts of unconcatenated rings or in the chromatin fiber, which is the material medium containing genetic information. The model presented here can be used, therefore, as a benchmark for interpretation of the data of many-body contacts in genomes, which we expect to be available soon in, e.g., Hi-C experiments.
What is the probability for a Brownian trajectory to visit the same space point several times? What is the number of points in space visited exactly times by a single polymer chain? These questions are of significant importance for the classical polymer physics deGennes_book ; DoiEdwards ; GrosbergKhokhlov ; Rubinstein . Indeed, it is known that conformations of ideal polymer chains and Brownian trajectories are isomorphic up to the redefinition of time: the time for a Brownian motion is replaced by the contour length for a polymer. As a result, the points in space visited times by a Brownian trajectory correspond exactly to the many-body interactions in a polymer chain. Answers to the above questions (provided that one allows for a proper coarse-graining to avoid ultraviolet divergencies), are essentially quite simple: since a Brownian motion is a Markovian process, every loop it forms is independent from the other. As a result, the probability of a βrosetteβ, , for the trajectory to return to a given point at sequential time moments is equal to , where is the two-body contact probability. Interestingly, even in this case the total number of points visited exactly times exhibits rich and interesting scaling behavior depending on and khokhlov_notes ; maj_tamm ; turban . However, this simple factorization does not hold anymore for the case of fractional Brownian motion with . The increments of a fractional Brownian motion have long-ranged slowly decaying correlations, and the probability of, say, second loop will essentially depend on the fact that the first loop is formed and on its length. In this note we study these loop correlations and explicitly calculate the resulting multiloop probabilities for fractal polymers with .
Apart from the fundamental interest of our results for the study of general properties of fractional Brownian motion and for polymer physics, the probabilities of multi-body interactions are of significant importance for biophysical applications. Recently in polovnikov_sm18 we proposed to use the fractional Brownian motion trajectories as an analytically simple semiquantitative model of collapsed polymer conformations with fractal dimension , including polymer conformations in topologically stabilized states, such as melts of nonconcatenated rings and chromosomes in living nuclei. Technically, our computation of the multiloop probabilities is based on a specific way of constructing ensembles of discretized fractional Brownian trajectories based on a Gibbs sampling with the quadratic Hamiltonian suggested in polovnikov_sm18 . In the framework of our model it is possible to calculate different experimentally accessible static polovnikov_sm18 and dynamic holcman ; polovnikov_prl18 properties of collapsed fractal chains with a given fractal dimension.
Modern experimental techniques of genome-wide chromosome conformation capture (Hi-C)dekker ; lieberman-eiden allow to describe a conformation of the chromatin (the constituting fiber of the chromosome, containing dsDNA with genetic information, as well as various proteins attached to it) by tabulating a full collection of in vivo pairwise contacts of the chromatin fiber with itself. This Hi-C data has proven hugely beneficial for our understanding of chromosome conformations under different conditions. There exist a wide range of theories used to rationalize the existing Hi-C data gns ; grosberg93 ; mirny11 ; grosberg_review ; nicodemi ; rosa14 ; shin14 ; loop_extrusion ; loop_extrusion2 , however all these models seem to agree that in a wide range of length scales the resulting equilibrium chromatin packing is approximately fractal with transient fractal dimension lying in the interval , , which corresponds to a space-filling curve, being the most obvious candidate for the true limiting fractal dimension in the limit of infinitely long chains.
Additional evidence concerning chromatin conformations should come from consideration of triple and many-body contacts, which, generally speaking, contain information on the properties of the fiber packing, which is not reducible to the information obtained from two-loci contacts. Recent advances in experimental techniques pedro16 ; oudelaar18 allow to expect that soon it will be possible to collect enough statistics on triple contacts from the Hi-C experiments. Therefore, theoretical approaches allowing to make sense of this upcoming data are urgently needed. Here we calculate the three-body and many-body contact probabilities in the simple model of a Gaussian polymer chain, which can be used as a basic benchmark to compare the experiment data with. In particular, we propose here to measure experimentally the two-loop correlation factor (the ratio of a three-body contact probability to the product of the probabilities of two independent loops of the same size) as an important characteristic of the chromatin conformation statistics, and provide concrete predictions for the value of this factor within our fractal Gaussian polymer model.
To begin with, define the trajectory of a discretized random walk, or, similarly, a conformation of a polymer chain by a set of coordinates . In the random walk terms, characterizes the position of the walker at sequential time ticks, . In polymer terms, fixing the Gaussian transitions between sequential βtimesβ in the set , we define the standard beads-on-string model GrosbergKhokhlov . In what follows we assume that , and the process is stationary in a sense that mean distance between and is -independent:
[TABLE]
There exist a unique stationary measure over the realizations of the walk , which satisfies
[TABLE]
for all and some fixed constant called fractal dimension of the walk. The variable in (2) plays a role of the time between moments and in the random walk language, while for a polymer it is the contour distance along the chain. The process described by the stationary measure is called fractional Brownian motion. There exist numerous equivalent definitions of this process mandelbrot , and it is often characterized by the so-called Hurst exponent . Clearly, the value (), which corresponds to Brownian motion, is the only case when the process described by (2) is Markovian. Indeed, (2) does not respect the Chapman-Kolmogorov relation for any . It follows immediately from (2) that the mean-square spatial distance between two points on a trajectory grows algebraically with
[TABLE]
and the covariance
[TABLE]
where we used notation .
It has been shown recently in polovnikov_sm18 that in the limit of large one can approximate for the subdiffusive () fractal Brownian motion (fBm) by a Gibbs measure with a simple quadratic Hamiltonian:
[TABLE]
with a proper choice of interactions coefficients (here and below we use lowercase and uppercase bold letters to denotes vectors in 3-dimensional and 3N-dimensional space, respectively). In particular, if the coefficients depend only on the chemical distance between monomers , so that , and if for decays algebraically:
[TABLE]
with some , then depending on there are three possible asymptotic regimes of polymer chain statistics.
If all monomers (points of the trajectory) asymptotically merge, and
[TABLE]
regardless ; 2. 2.
If the interaction is irrelevant and the large scale properties of the trajectory are indistinguishable from the standard Brownian motion with . 3. 3.
Finally, and most interestingly, if the relation (2) holds for with some renormalized and a non-trivial fractal dimension
[TABLE]
The value of is critical, giving rise to the logarithmic corrections to (2).
In the polymer context, quadratic interactions in (5) can be interpreted as a set of harmonic springs of varying rigidity connecting each pair of monomers, as shown in Fig.1A. The Hamiltonian of type (5) has appeared in the study of 3D structures of proteins bahar97 ; haliloglu97 ; min , it was used for the description of static and dynamic properties of marginally compact trees with various fractal architectures dolgushev1 ; dolgushev2 , as well as for phenomenological study of the Rouse dynamics of non-ideal polymer chains holcman ; polovnikov_prl18 . A proxy Hamiltonian of the same type has been recently used to obtain three-dimensional structures of chromosomes based on experimental Hi-C contact maps letreut18 . A related hierarchical variational approach for an account of volume interactions of swollen polymer chains had been proposed in burlatsky .
Now, we are interested in the probability of forming an -body contact, or, in random walk terms, in the probability for fBm to visit the same spatial point times. In the simplest case of , the probability for two beads separated by chemical distance to be at the same space point, is
[TABLE]
This result, up to the numerical constant, can be easily estimated from the following qualitative argument. If some monomer is fixed at a given point in space, the monomer is able to explore the volume of order
[TABLE]
If the probabilities to visit all points of this volume are the same, we expect (up to a numerical constant) for the looping probability the expression in a full agreement with (9).
Standard Brownian motion is a Markovian process. Accordingly, the probability of forming many body contacts can be constructed as a product of probabilities to form consecutive loops. In particular, the probability to form a βtwo-loop rosetteβ with loops of contour lengths is
[TABLE]
The relation (11) no longer holds in the non-Markovian case of fBm. Thus, one expects
[TABLE]
where are given by (9), and is some yet unknown correlation function. In what follows we calculate function explicitly.
Triple contacts are formed by two consecutive loops, which are, typically, correlated due to the long-range memory characteristic of fractional Brownian motion. Hence, the decomposition of the probability into the product of two probabilities of pairwise contacts (11) is not valid anymore. One can, however, write an explicit expression for by integrating over Gibbs measure with the effective Hamiltonian (5)
[TABLE]
where we assume without loss of generality that , the Gibbs distribution is given by (5), and the Green function, , reads
[TABLE]
where is the full partition function of the chain with one link fixed at a given spatial position
[TABLE]
the matrix is related to the coefficients of the Hamiltonian (5) by
[TABLE]
and we used the Fourier transform of the -function to obtain the last equality of (13). The Green function (14) is an -dimensional Gaussian integral with the linear term , where components of the vector are
[TABLE]
It equals therefore
[TABLE]
Introduce and () β the eigenvectors and, respectively, eigenvalues of the interaction matrix , ordered from the smallest eigenvalue to the largest, and scalar (inner) products
[TABLE]
The quadratic form in (18) can be then rewritten as
[TABLE]
Substituting (17) into (19) one can express βs in terms of the components of the eigenvectors , giving
[TABLE]
where denotes the -th component of the vector . The right hand side (21) is a double Gaussian integral, so it can be calculated explicitly:
[TABLE]
where elements of the matrix are
[TABLE]
and the power 3/2 in the right hand side of (22) comes from the fact that are 3-dimensional vectors. Interestingly, the elements of matrix have the meaning of the mean-square distances between corresponding monomers. Indeed, expressing these distances as sums over normal coordinates and using the equipartition theorem, which determines the average amplitudes of the normal modes at equilibrium (see polovnikov_sm18 for more details), one gets and, therefore,
[TABLE]
This correspondence allows to write down the probability of a triple contact in the form suggested in (12):
[TABLE]
where is the probability of a double contact (9), and the correlator reads
[TABLE]
This result is valid for any Gaussian polymer chain with interactions (10). In particular, for ideal chain (ordinary Brownian motion) vectors and are uncorrelated, thus , and one returns to the expression (11).
In the case when monomers are separated by a significant contour distance, i.e., when and , only the eigenvalues with small () are important. Moreover, it is possible to further distinguish between the cases of and . In the first case, only several first eigenvalues are relevant and the behavior is not universal, i.e., for example, the behavior of linear chains and cycles is different. Meanwhile, for it is not just several first eigenvalues that are important but the whole shape of the edge of the spectrum. In particular, if for the spectrum has the form , where is called the spectral dimension of the elastic graph dolgushev1 , then the resulting equilibrium conformation is self-similar with monomer-monomer distance distribution given by (2) and fractal dimension satisfying
[TABLE]
It is exactly what happens in the case of the Hamiltonian (5) with power-law decaying coefficients (6) (see polovnikov_sm18 ). This allows one to use (3) and (4) for the mean-square distance between the monomers, and distance-distance correlations, respectively, resulting in the following expression for the correlator , which, as it turns out, depends in this case on a single variable
[TABLE]
As one would expect, the correlation function is symmetric and negative for all non-zero for the subdiffusive fractional Brownian motion (). If , , as it should be for normal Brownian motion. Figure Fig.2 left shows the dependence . It is clearly seen that correlation is the strongest (the absolute value of coefficient is the largest) for the loops of equal size, the corresponding value being . If the loops lengths are very different the correlation coefficient converges to zero as a power law
[TABLE]
Summing up, the total probability of a triple contact in a case of a fractional Brownian motion trajectory reads
[TABLE]
where
[TABLE]
are contour lengths of the loops, is a characteristic microscopic length, and
[TABLE]
is a product of single loop probabilities. Note that the naive guess always underestimates the loop probability (because it neglects the effective attraction of the chain units). The largest boost in the loop probability is achieved for the loops of equal size and equals
[TABLE]
meaning, e.g., that in the case (), which is the most interesting case from the polymer physics point of view, the number of rosettes consisting of two loops of equal size should be some 7% larger than one would expect from the naive uncorrelated loop assumption. Notably, this boost increases rather rapidly with the fractal dimension (Fig.2 right), reaching, e.g., roughly 14% for , which is reported as a transient fractal dimension in some polymer systems, e.g., ideal randomly branched polymers stockmayer ; daoud_joanny ; everaers_grosberg . In turn, for the triple rosettes formed with loops of very different size one expects the boost factor
[TABLE]
We expect that with the development of many-body Hi-C techniques it will become possible to experimentally measure this excess formation of triple contacts as compared to the product of the probabilities of two independent loops. This prediction is characteristic of the Gaussian model we use here: indeed, in the system with hard-core short-range interactions one expects the formation of triple loops to be suppressed, not boosted. Thus, if it is indeed observed experimentally, it would be a nice new argument in support of the Gaussian formalism. Also, equations (33) and (34) allow new alternative ways of measuring and, therefore, the effective fractal dimension of chromatin packing, which can be further compared to the results obtained by different methods.
In conclusion, let us briefly discuss the generalization of the formalism developed above for the calculation of the probabilities of many-body contacts. The probability for a polymer interacting with itself via the potential (5) to form a -rosette made of loops anchored by a set of monomers with numbers can be written as
[TABLE]
where we introduced the notation . Proceeding in the same way as above one can express the integral on the right hand side of (35) in the following way, which is direct generalization of (22):
[TABLE]
where ia a covariance matrix of vectors connecting successive monomers in the cluster: . These expressions can be further simplified, for example for the probability of 4-body contacts one gets
[TABLE]
where the function equals
[TABLE]
Summing up, the main results of our work are the formulae (25), (26) for the triple contacts, and (36) for multiple contacts in a Gaussian polymer with Hamiltonian (5). It is known that for long polymer chains with interaction coefficients decaying as (6) the conformations of such polymers are statistically the same as trajectories of fractional Brownian motion, leading to the result (28) for the correlation coefficient, and formulae (30) and (31) for three-body contacts. We believe that these results constitute a basic benchmark for interpretation of the experimental data on many-body contacts in fractal polymer conformations with , in particular, the packing of chromosomes, which we expect to be available soon. On the other hand, from the random walk theory point of view, we expect our result to be essential for the understanding of the localization phenomena of random walks with long-ranged memory, e.g., adsorption of such a walker on a point-like potential well.
Acknowledgements
The authors are grateful to L. Mirny for stimulating discussions on the topic of this work. SN is grateful to the RFBR grant 18-29-13013mk for partial support, KP and MT acknowledge the support of the Foundation for the Support of Theoretical Physics and Mathematics βBASISβ (grant 17-12-278).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) P.-G. de Gennes, Scaling Concepts in Polymer Physics , (Cornell University Press, 1979).
- 2(2) M. Doi, S.F. Edwards, The Theory of Polymer Dynamics , (Oxford University Press, Oxford, 1986).
- 3(3) A.Y. Grosberg, A.R. Khokhlov, Statistical Physics of Macromolecules , (AIP Press, Woodbury, NY, 1994).
- 4(4) M. Rubinstein, R. Colby, Polymer Physics , (Oxford University Press, Oxford, 2003).
- 5(5) A.R. Khokhlov, Statistical physics of macromolecules. Lecture notes , (Moscow State University Press, 2004) (in Russian).
- 6(6) S.N. Majumdar, M.V. Tamm, Number of common sites visited by N random walkers, Phys. Rev. E, 86 , 021135 (2012).
- 7(7) L. Turban, Mean number of encounters of N random walkers and intersection of strongly anisotropic fractals, arxiv:1304.4106 (2013).
- 8(8) K. Polovnikov, S. Nechaev, and M. V. Tamm., Effective Hamiltonian of topologically stabilized polymer states, Soft matter, 14 , 31 (2018).
