TL;DR
Mesh-RFT introduces a fine-grained reinforcement learning framework with localized face-level refinement for 3D mesh generation, significantly improving geometric and topological quality over existing methods.
Contribution
It is the first to optimize mesh quality at the face level using a novel Masked Direct Preference Optimization (M-DPO) approach combined with topology-aware scoring.
Findings
Reduces Hausdorff Distance by 24.6%
Improves Topology Score by 3.8%
Outperforms global DPO methods in mesh quality
Abstract
Existing pretrained models for 3D mesh generation often suffer from data biases and produce low-quality results, while global reinforcement learning (RL) methods rely on object-level rewards that struggle to capture local structure details. To address these challenges, we present Mesh-RFT, a novel fine-grained reinforcement fine-tuning framework that employs Masked Direct Preference Optimization (M-DPO) to enable localized refinement via quality-aware face masking. To facilitate efficient quality evaluation, we introduce an objective topology-aware scoring system to evaluate geometric integrity and topological regularity at both object and face levels through two metrics: Boundary Edge Ratio (BER) and Topology Score (TS). By integrating these metrics into a fine-grained RL strategy, Mesh-RFT becomes the first method to optimize mesh quality at the granularity of individual faces,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
MethodsDirect Preference Optimization · Spatio-temporal stability analysis
