Boltzmann machine learning and regularization methods for inferring evolutionary fields and couplings from a multiple sequence alignment
Sanzo Miyazawa

TL;DR
This paper explores regularization and learning techniques for inferring evolutionary interactions in proteins using Boltzmann machine learning, demonstrating effective methods for sparse coupling inference and analyzing the recovery of interactions from sequence data.
Contribution
It introduces a combined regularization approach and a modified Adam method for better inference of sparse interactions in Boltzmann models of protein evolution.
Findings
Group L1 regularization outperforms L2 and L1 for sparse couplings.
Regularization parameters can be tuned to match average energies, ensuring smooth convergence.
Fields and couplings are accurately recovered, but pairwise correlations are harder to infer in natural proteins.
Abstract
The inverse Potts problem to infer a Boltzmann distribution for homologous protein sequences from their single-site and pairwise amino acid frequencies recently attracts a great deal of attention in the studies of protein structure and evolution. We study regularization and learning methods and how to tune regularization parameters to correctly infer interactions in Boltzmann machine learning. Using regularization for fields, group for couplings is shown to be very effective for sparse couplings in comparison with and . Two regularization parameters are tuned to yield equal values for both the sample and ensemble averages of evolutionary energy. Both averages smoothly change and converge, but their learning profiles are very different between learning methods. The Adam method is modified to make stepsize proportional to the gradient for sparse couplings. It is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAdam
