# DNA diamond formulates a decomposable composite letter constellation model for DNA data storage

**Authors:** Qi Ge, Menghui Ren, Tingting Qi, Changcai Han, Yingjin Yuan, Weigang Chen

PMC · DOI: 10.1038/s41467-026-68861-y · Nature Communications · 2026-01-31

## TL;DR

The paper introduces a new DNA data storage model called DNA diamond that improves storage density and reliability using a two-stage detection strategy.

## Contribution

The DNA diamond model introduces a decomposable constellation and entropy-guided detection for high-density DNA data storage.

## Key findings

- The eight-letter system achieves 2.5 bits per letter with error-free recovery at 14× coverage.
- The 15-letter system enables 3.125 bits per letter with error-free recovery at 33× coverage.

## Abstract

Oligonucleotide multiplicity is an inherent property of current DNA synthesis technology. Composite letter DNA storage exploits this property to improve logical density and reduce costs. However, letter indistinguishability and high molecular diversity pose challenges for reliable recovery. Here, we formulate a composite letter constellation model, named DNA diamond, consisting of 15 decomposable points. Inspired by set partitioning in telecommunications, we propose a two-stage letter detection framework that partitions these letters into four distinguishable subsets based on their discrete entropy. Furthermore, we incorporate encoded double-end indices to eliminate crosstalk between synthesis sites and simultaneously apply length filtering to suppress error propagation during readout. We validate the eight-letter and 15-letter composite letter DNA storage under DNA diamond model, each with 10,000 composite strands. The eight-letter system achieves a payload density of 2.5 bits per letter and enables error-free recovery at 14× coverage, surpassing the storage density of prior six-letter systems while requiring lower coverage. The full 15-letter constellation enables 3.125 bits per letter for payload with error-free recovery at 33× coverage, corresponding to a density of 2.23 bits per letter for payload plus indices. The proposed decomposable DNA diamond model advances a practical and scalable framework for high-density composite DNA data storage.

Composite-letter DNA data storage promises higher density but suffers from readout challenges. Here, the authors present a decomposable diamond constellation and an entropy-guided two-stage detection strategy, enabling reliable recovery at low sequencing coverage for high-density DNA data storage.

## Full-text entities

- **Chemicals:** diamond (MESH:D018130)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12909984/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12909984/full.md

## References

11 references — full list in the complete paper: https://tomesphere.com/paper/PMC12909984/full.md

---
Source: https://tomesphere.com/paper/PMC12909984