OphEdit: Training-Free Text-Guided Editing of Ophthalmic Surgical Videos

Ritul Jangir; Arkya Jyoti Bagchi; Aiman Farooq; Mangalton Okram; Saurabh Seetaram Korgaonkar; and Deepak Mishra

arXiv:2605.07695·cs.CV·May 11, 2026

OphEdit: Training-Free Text-Guided Editing of Ophthalmic Surgical Videos

Ritul Jangir, Arkya Jyoti Bagchi, Aiman Farooq, Mangalton Okram, Saurabh Seetaram Korgaonkar, and Deepak Mishra

PDF

1 Repo

TL;DR

OphEdit is a training-free, text-guided framework for editing ophthalmic surgical videos that preserves anatomical accuracy while enabling semantic modifications.

Contribution

It introduces a novel second-order ODE inversion pipeline and a selective tensor injection method for precise, training-free surgical video editing.

Findings

01

Effectively handles complex surgical transformations.

02

Maintains anatomical geometry and temporal consistency.

03

First application of training-free video editing in ophthalmology.

Abstract

High-fidelity surgical video generation can greatly improve medical training and the development of AI, adapting these generative models for precise video editing remains a formidable challenge. Modifying surgical attributes, such as instrument tissue interactions or procedural phases is challenging due to the strict anatomical and temporal constraints. In this paper, we propose OphEdit, a novel training-free framework for the text-guided editing of ophthalmic surgical videos. Our approach leverages a deterministic second-order ODE inversion pipeline to capture Attention Value (V) tensors from the original video. By selectively injecting these stored tensors into the conditional Classifier-Free Guidance (CFG) branch during the denoising phase, OphEdit rigorously preserves the intricate anatomical geometry of the eye while seamlessly mapping text-driven semantic modifications onto the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ophedit/OphEdit
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.