Factorization Machine with Quadratic-Optimization Annealing for RNA Inverse Folding and Evaluation of Binary-Integer Encoding and Nucleotide Assignment
Shuta Kikuchi, Shu Tanaka

TL;DR
This paper introduces a novel factorization machine-based optimization method for RNA inverse folding, analyzing how different binary-integer encodings and nucleotide assignments impact solution quality and stability.
Contribution
It develops a new FMQA framework for RNA inverse folding and systematically evaluates encoding and assignment effects on solution performance.
Findings
One-hot and domain-wall encodings outperform binary and unary in solution quality.
Boundary integer assignments favor G and C nucleotides in stem regions.
Domain-wall encoding leads to more thermodynamically stable RNA structures.
Abstract
The RNA inverse folding problem aims to identify nucleotide sequences that preferentially adopt a given target secondary structure. While various heuristic and machine learning-based approaches have been proposed, many require a large number of sequence evaluations, which limits their applicability when experimental validation is costly. We propose a method to solve the problem using a factorization machine with quadratic-optimization annealing (FMQA). FMQA is a discrete black-box optimization method reported to obtain high-quality solutions with a limited number of evaluations. Applying FMQA to the problem requires converting nucleotides into binary variables. However, the influence of integer-to-nucleotide assignments and binary-integer encoding on the performance of FMQA has not been thoroughly investigated, even though such choices determine the structure of the surrogate model and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRNA and protein synthesis mechanisms · DNA and Nucleic Acid Chemistry · Genomics and Chromatin Dynamics
