# Advances in Semantic-Preserving Text Watermarking

**Authors:** Jiale Meng, Zheming Lu

PMC · DOI: 10.3390/s26051528 · Sensors (Basel, Switzerland) · 2026-02-28

## TL;DR

This paper reviews recent methods for text watermarking that preserve meaning, focusing on secure and imperceptible ways to protect text content.

## Contribution

The paper introduces a unified classification framework for analyzing semantic-preserving text watermarking techniques.

## Key findings

- Image-based, font-based, and format-based techniques are key approaches in semantic-preserving text watermarking.
- Current methods struggle with balancing imperceptibility and robustness in high-stakes applications.
- A structured classification helps clarify the strengths and limitations of existing watermarking strategies.

## Abstract

Textual content faces escalating security threats regarding copyright infringement, tampering, and unauthorized distribution. Text watermarking offers a vital defense mechanism by embedding imperceptible identifiers for source tracking and anti-counterfeiting. However, unlike general image watermarking, protecting text is uniquely challenging due to its highly discrete structure and low pixel redundancy, where even minute perturbations can compromise legibility. Over the past three decades, a wide range of text watermarking techniques have been proposed to address these challenges. While recent research has heavily favored semantic-based watermarking driven by Large Language Models (LLMs), these approaches are often inapplicable to high-stakes scenarios requiring strict content integrity and visual fidelity, such as legal documentation and artistic font protection. Addressing this gap, this paper presents a comprehensive survey of semantic-preserving text watermarking methods developed in recent years, with a particular focus on image-based, font-based, and format-based techniques. We propose a unified classification framework to systematically analyze these approaches, examining their methodological principles, robustness, embedding capacity, and imperceptibility. By clarifying the core characteristics and limitations of existing techniques, this survey aims to provide a structured technical reference for researchers and practitioners, facilitating the advancement of secure, robust, and scalable text protection technologies.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12987231/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12987231/full.md

## References

58 references — full list in the complete paper: https://tomesphere.com/paper/PMC12987231/full.md

---
Source: https://tomesphere.com/paper/PMC12987231