Uchimata: a toolkit for visualization of 3D genome structures on the web and in computational notebooks

David Kouřil; Trevor Manz; Tereza Clarence; Nils Gehlenborg

PMC · DOI:10.1093/bioinformatics/btag035·January 22, 2026

Uchimata: a toolkit for visualization of 3D genome structures on the web and in computational notebooks

David Kouřil, Trevor Manz, Tereza Clarence, Nils Gehlenborg

PDF

Open Access

TL;DR

Uchimata is a toolkit that allows users to visualize 3D genome structures in web browsers and Jupyter Notebooks.

Contribution

Uchimata introduces a flexible and integrable toolkit for 3D genome visualization combining JavaScript and Python.

Findings

01

Uchimata provides a JavaScript library and Python widget for rendering 3D genome structures.

02

The toolkit supports expressive visual encodings and filtering based on genomic and spatial criteria.

03

Uchimata is open-source and integrates well with existing Python-based bioinformatics tools.

Abstract

Uchimata is a toolkit for visualization of 3D structures of genomes. It consists of two packages: a Javascript library facilitating the rendering of 3D models of genomes, and a Python widget for visualization in Jupyter Notebooks. Main features include an expressive way to specify visual encodings, and filtering of 3D genome structures based on genomic semantics and spatial aspects. Uchimata is designed to be highly integratable with biological tooling available in Python. Uchimata is released under the MIT License. The Javascript library is available on NPM, while the widget is available as a Python package hosted on PyPI. The source code for both is available publicly on Github (https://github.com/hms-dbmi/uchimata and https://github.com/hms-dbmi/uchimata-py) and Zenodo (https://doi.org/10.5281/zenodo.17831959 and https://doi.org/10.5281/zenodo.17832045). The documentation with…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species3

Mus musculus(house mouse · species)Saccharomyces cerevisiae(baker's yeast · species)Homo sapiens(human · species)

Figures1

Click any figure to enlarge with its caption.

Uchimata consists of two packages, Javascript and Python, intended to cover use cases for visualization of 3D genome models across the web and computational notebooks. Key features include expressive configuration of visual encoding (A), selections based on spatial attributes (B), and selections based on genomic coordinates (C). Two use cases illustrating situations where 3D visualization brings new insights (D): mapping bin coordinates to continuous color scale shows concentration of telomeres on one side of the structures (left), and encoding gene density to color and scale of the bin marks shows concentration of gene-rich regions within the structures, further from the boundary (right).

Funding1

—National Institutes of Health10.13039/100000002

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenomics and Chromatin Dynamics · Genomics and Phylogenetic Studies · Genome Rearrangement Algorithms

Full text

1 Introduction

Alongside sequencing-based technologies developed to probe the spatial conformation of genomes—such as the influential Hi-C method (Lieberman-Aiden et al. 2009)—there are ongoing efforts to produce concrete 3D models that depict genome folding. Although research on macromolecular structures has long benefited from sustained investment in resources such as the Protein Data Bank (Berman et al. 2000) and visualization tools like PyMOL (Schrödinger, LLC, 2015) and Mol* (Sehnal et al. 2021), comparable tools for analysis and visualization of physical genome structures remain limited.

There are two principal approaches to generating structural models of genomes (Imakaev et al. 2015): data-driven methods, which produce structures that satisfy constraints derived from input data (e.g. Hi-C), and methods that simulate structures de novo based on mechanistic principles. Li et al. (2023) review reconstruction methods and highlight how resulting 3D structures can lead to novel insights. Portillo-Ledesma et al. (2023) further emphasize that the choice of model resolution and simulation technique restricts the scale of the phenomena that can be investigated. Oluwadare et al. (2019) summarize approaches for reconstructing chromosome- and genome-level structures from Hi-C data. Notable examples of existing structures include Duan et al.’s yeast genome model (Duan et al. 2010), Stevens et al.’s mammalian (mouse) genome structures constructed from single-cell Hi-C (Stevens et al. 2017), and Tan et al.’s genome structures of single diploid human cells (Tan et al. 2018). Recent studies show promise in using machine learning techniques to infer 3D genome structures (Schuette et al. 2025). Beyond individual publications, there have been efforts to develop centralized databases of 3D genome models (Oluwadare et al. 2020), although community adoption remains limited.

Visualizing specialized 3D data can be challenging for computational biologists lacking computer graphics expertise, who depend on tools that abstract low-level rendering details. Although several tools have been developed for visualizing 3D genome structural data—such as Genome3D (Asbury et al. 2010), 3DGB (Butyaev et al. 2015), GMOL (Nowotny et al. 2016), HiC-3DViewer (Djekidel et al. 2017), and CSynth (Todd et al. 2021)—most exist as standalone desktop or web applications, making them difficult to integrate into notebook-based analytical workflows, to adapt for case-specific applications, and to customize beyond the visual choices predefined by their developers.

To address the lack of dedicated visualization tools in computational notebooks, scientists often repurpose general plotting libraries or tools originally designed for protein structures. For instance, nglutils (https://github.com/mirnylab/nglutils) builds on the molecular viewer nglview (Nguyen et al. 2018) to support interactive visualization of genome structures within notebooks. Similarly, some simulation packages for 3D genome structures recommend using general visualization libraries, such as matplotlib (Hunter 2007) or Fresnel (https://github.com/glotzerlab/fresnel). Biologists often reuse molecular file formats (e.g. PDB) to store 3D genome structures, but differing interpretations of these standards for genomic data can cause incompatibilities even within the same format.

We developed uchimata, a toolkit for visualizing 3D genome structures across the web and computational notebook environments (Fig. 1). Its unique combination of features exceeds the capabilities of existing solutions: (i) A declarative specification for visual encoding provides flexible and expressive customization of the visual depiction and allows mapping genomic data onto the 3D structure; (ii) An API for spatial and genomics filtering of 3D genome structures supports operations such as applying cutting planes or selecting genomic ranges. (iii) Dual JavaScript and Python availability enables use both during analysis—close to the modeling and analysis environment—and as an integrated component within larger web applications, such as data portals.

Uchimata consists of two packages, Javascript and Python, intended to cover use cases for visualization of 3D genome models across the web and computational notebooks. Key features include expressive configuration of visual encoding (A), selections based on spatial attributes (B), and selections based on genomic coordinates (C). Two use cases illustrating situations where 3D visualization brings new insights (D): mapping bin coordinates to continuous color scale shows concentration of telomeres on one side of the structures (left), and encoding gene density to color and scale of the bin marks shows concentration of gene-rich regions within the structures, further from the boundary (right).

At its core, uchimata operates on tabular data that link genomic regions to XYZ coordinates, making it suitable for a broad range of applications, including Hi-C-based simulations, de novo modeling, and emerging approaches such as imaging-based methods.

2 Methods

2.1 Design of the uchimata toolkit

The design of uchimata is guided by the principle of composability (Manz 2024). To support long-term maintainability and enable the reuse of visualization tools as components in novel scenarios, each tool should focus on a narrow and well-defined set of features. This contrasts with the approach of many other tools in this domain, which combine 3D data with Hi-C matrices and genomic browsers, ultimately reducing reusability. In uchimata, our focus is on visualizing 3D models, while related representations are deliberately left to other specialized tools. This modular approach allows users to select and combine components to construct interfaces and pipelines tailored to their specific requirements, avoiding the pitfalls of monolithic software that becomes progressively difficult to extend and maintain.

The second guiding principle is the separation of concerns and leveraging strengths of individual programming environments. Over time, the Javascript and Python ecosystems have developed distinct competencies. The web platform excels at building user interfaces and offers a wide range of visualization libraries. Its core technologies—HTML, CSS, and Javascript—benefit from standardization and strong backward compatibility. Python, by contrast, is well suited for data wrangling and analysis, and has become a dominant platform in computational biology. Some libraries, such as NumPy (Harris et al. 2020) and pandas (McKinney 2010), introduced concepts now considered core to the Python ecosystem, even though they exist outside the language and its standard library. Similarly, we advocate separating the tasks of building the 3D genome models (e.g. Hi-C-based simulations) and visualizing them. This separation must be facilitated by robust mechanisms for exchanging data between tools.

Therefore, uchimata embraces existing data standards, some of which reach beyond genomics or bioinformatics. This approach allows it to benefit from mature infrastructure developed by broader communities. For example, uchimata uses Apache Arrow (https://arrow.apache.org) as its in-memory representation of 3D genome structures. Rather than supporting a fixed set of file formats, the uchimata Javascript library accepts only structures as Arrow tables, whereas the Python package also ingests canonical scientific Python structures such as NumPy array and pandas DataFrame, internally converting both to Arrow. This approach addresses incompatibilities in existing file formats for genome structures. Several tools reinterpret the PDB format, originally designed for atomistic models, to store 3D genome structures, but each adopts its own conventions. One concrete issue is that genomic coordinates can span up to nine digits, exceeding the standard column widths of the text-based PDB format and forcing users to use non-standard fields. Representing 3D genome models as Arrow tables leverages a standardized binary format widely supported by analytical tools. Our adoption of Apache Arrow is inspired by other efforts to unify bioinformatics file formats, e.g. Oxbow (Abdennur et al. 2025).

The main downside is the need to convert existing structures to Arrow. Due to the subtle differences among file formats, fully automatic conversion is rarely possible. We provide examples of how to perform this conversion in the Supplemental Materials, available as supplementary data at Bioinformatics online.

2.2 Implementation

The core web uchimata library (https://github.com/hms-dbmi/uchimata) is implemented using Typescript, transformed into a standard Javascript module, and hosted on NPM. The Jupyter notebook widget (https://github.com/hms-dbmi/uchimata-py) is built using anywidget (Manz et al. 2024) and uses the Javascript library to serve the canvas within a notebook’s cell. The 3D graphics rendering is accomplished through three.js, currently using the WebGL backend. We use DuckDB (Raasveldt and Mühleisen 2019) to perform a variety of queries to filter the Apache Arrow tables representing the 3D structures and associated data.

3 Results

The development of uchimata was driven by the aim to support a broad range of end-use scenarios. We focused on core functionality that could be reused in web-based applications with case-specific interfaces, without overloading the core library with features relevant only to a narrow subset of users. We also envisioned seamless integration of uchimata into data portals.

A second major group of use cases involves visualization in exploratory stages, such as during simulations and in downstream analyses of simulated data. While the library can be used directly in Javascript-based notebook environments such as Observable Notebooks (https://observablehq.com/platform/notebooks), much computational notebook work today is carried out in Python, leveraging its extensive scientific software ecosystem.

3.1 Integration in web applications and Javascript-based notebooks

In the first scenario, uchimata functions like any other Javascript library in a web environment, enabling visualization of 3D genome structural data. To show uchimata’s ability to support novel visualizations, we use it within Observable Notebooks—a literate programming environment, similar to Jupyter Notebooks, that executes Javascript code in interactive cells. A set of examples is available at https://hms-dbmi.github.io/uchimata/, source code for these examples is in the uchimata Github repository, under the “docs” folder. At its core, a 3D genome structure is a list of XYZ coordinates, sometimes associated with genomic coordinates. How these data items are visually represented is left to the user, who specifies a view config that maps items to concrete marks, such as spheres or cubes, and their visual features such as color and scale. This manner of declaring visual encodings is inspired by the grammar-of-graphics approach (Wilkinson 2005). A scene contains an array of structures, each paired with its corresponding view config, which together define how the structures are rendered. The view config lets users define how each structure is depicted by mapping data columns to visual attributes. For example, mapping a column containing the chromosome information to the color channel assigns a different color to each chromosome. Such annotations can be applied at any level (e.g. compartment, TADS) and depend solely on the data supplied with the structure.

We envision uchimata being integrated into use case-specific web applications and genome browsers. We recently used uchimata to extend the Gosling grammar (L’Yi et al. 2022) with support for 3D genome models (Kouřil et al. 2026), thereby implementing the previously recognized “spatial” layout for genomic coordinate systems (Nusrat et al. 2019) that was missing in existing grammar-based tools. This grammar-of-graphics approach allows for both expressiveness and reproducibility, and by unifying 3D representations of genomes with conventional genomic data views, it opens new avenues for exploring efficient visual linking between distant genomic loci.

3.2 Interoperability with existing bioinformatics workflows in Python

Computational notebooks play an important role in day-to-day analytical work of genomics researchers (van den Brandt et al. 2025). In the second scenario, uchimata can be used as a widget for Python-based computational notebooks.

Python is often used for chromatin simulations, where visual inspection is typically the first step in assessing the results. To shorten the iteration loop, a visualization tool should be available directly within the simulation environment, enabling biologists to perform basic checks, adjust parameters, and rerun simulations. The uchimata widget offers multiple ways to input data which aligns with in-memory storage practices in Python-based computation notebooks. Most relevant for Jupyter environments are widely used data structures such as NumPy arrays and pandas DataFrame.

Furthermore, the availability through Python allows for integration with existing bioinformatics tools. For example, we can combine uchimata with bioframe (Open2C et al. 2024) to apply genomic range selections on 3D genome structures. Bioframe offers functionality for loading genomic data from typical formats such as BED, GFF, or GTF, and performing a variety of interval operations, representing the ranges as pandas DataFrames. Uchimata then accepts these dataframes as selection queries and outputs corresponding bins of the 3D structure. The user can thus apply the same range as selection across multiple different 3D structures, facilitating comparison across an ensemble of simulated structures. In addition to genomics-based filtering, uchimata supports spatial selections (Fig. 1B), currently implementing cutting-plane (cross-section) operations and selecting spherical neighborhoods of a specified radius.

Figure 1D highlights two examples that demonstrate how visualization of 3D models reveals insights about genome structure. First, we use Stevens et al.’s mouse cell structures (Stevens et al. 2017) and encode each chromosome’s coordinates with a continuous color scale. This results in clear identification of locations where telomeres concentrate (Fig. 1D, left). Second, we aggregate gene annotation loaded from a GTF into bins matching the resolution of the structure. We then map this gene density data to both color and scale of the marks representing the bins (Fig. 1D, right). Scaling the marks can act as a form of removing occlusion and highlighting inner structure. The biological insight in this example is that gene-rich regions concentrate in the center of the genome structure, further from the border.

4 Conclusion

While the heatmap representation (i.e. the contact matrices resulting from Hi-C) remains the most direct experimental way to observe how whole genomes fold in nuclear space, we believe that the 3D structural representation offers a complementary benefit. Its primary strength is that it contextualizes genomic data within 3D space, which can lead to hypotheses related to specific patterns observed in the actual 3D structure. With uchimata, we contribute software that makes it easier to visualize this type of genomics data. With this crucial infrastructure in place, we intend to further investigate means of linking between traditional genome browser views, dense multiscale matrix viewers, and 3D structure visualizations. On the rendering side, we plan to adopt the WebGPU API as it becomes broadly supported across major web browsers.

Supplementary Material

btag035_Supplementary_Data

Bibliography32

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Abdennur N , Bzura C, Manz T et al abdenlab/oxbow: py-oxbow@v 0.5.0. Zenodo, 2025.
2Asbury TM , Mitman M, Tang J et al Genome 3D: a viewer-model framework for integrating and visualizing multi-scale epigenomic information within a three-dimensional genome. BMC Bioinformatics 2010;11:444.20813045 10.1186/1471-2105-11-444PMC 2941692 · doi ↗ · pubmed ↗
3Berman HM , Westbrook J, Feng Z et al The protein data bank. Nucleic Acids Res 2000;28:235–42.10592235 10.1093/nar/28.1.235PMC 102472 · doi ↗ · pubmed ↗
4Butyaev A , Mavlyutov R, Blanchette M et al A low-latency, big database system and browser for storage, querying and visualization of 3D genomic data. Nucleic Acids Res 2015;43:e 103.25990738 10.1093/nar/gkv 476PMC 4652742 · doi ↗ · pubmed ↗
5Djekidel MN , Wang M, Zhang MQ et al Hi C-3D Viewer: a new tool to visualize hi-C data in 3D space. Quant Biol 2017;5:183–90.
6Duan Z , Andronescu M, Schutz K et al A three-dimensional model of the yeast genome. Nature 2010;465:363–7.20436457 10.1038/nature 08973 PMC 2874121 · doi ↗ · pubmed ↗
7Harris CR , Millman KJ, van der Walt SJ et al Array programming with Num Py. Nature 2020;585:357–62.32939066 10.1038/s 41586-020-2649-2PMC 7759461 · doi ↗ · pubmed ↗
8Hunter JD. Matplotlib: a 2D graphics environment. Comput Sci Eng 2007;9:90–5.