# Gain efficiency with streamlined and automated data processing: Examples from high-throughput monoclonal antibody production

**Authors:** Malwina Kotowicz, Magdalena Shumanska, Sven Fengler, Birgit Kurkowsky, Anja Meyer-Berhorn, Elisa Moretti, Josephine Blersch, Gisela Schmidt, Jakob Kreye, Scott van Hoof, Elisa Sánchez-Sendín, S. Momsen Reincke, Lars Krüger, Harald Prüß, Philip Denner, Eugenio Fava, Dominik Stappert, Bhanwar Lal Puniya, Bhanwar Lal Puniya, Bhanwar Lal Puniya, Bhanwar Lal Puniya

PMC · DOI: 10.1371/journal.pone.0326678 · PLOS One · 2025-07-01

## TL;DR

This paper presents a modular pipeline for efficient data processing in complex biological workflows, improving reproducibility and compliance with FAIR principles.

## Contribution

The novel contribution is a modular and automated data processing pipeline tailored for biologists with limited computational expertise.

## Key findings

- The pipeline streamlines data management and speeds up experimental cycles in monoclonal antibody production.
- It enhances reproducibility and promotes compliance with FAIR data principles.
- The approach is versatile, demonstrated through a proof-of-concept in induced pluripotent stem cell culture.

## Abstract

Data management and sample tracking in complex biological workflows are essential steps to ensure necessary documentation and guarantee reusability of data and metadata. Currently, these steps pose challenges related to correct annotation and labeling, error detection, and safeguarding the quality of documentation. With growing acquisition of biological data and the expanding automatization of laboratory workflows, manual processing of sample data is no longer favorable, as it is time- and resource-consuming, prone to biases and errors, and lacks scalability and standardization. Thus, managing heterogeneous biological data calls for efficient and tailored systems, especially in laboratories run by biologists with limited computational expertise. Here, we showcase how to meet these challenges with a modular pipeline for data processing, facilitating the complex production of monoclonal antibodies from single B-cells. We present best practices for development of data processing pipelines concerned with extensive acquisition of biological data that undergoes continuous manipulation and analysis. Moreover, we assess the versatility of proposed design principles through a proof-of-concept data processing pipeline for automated induced pluripotent stem cell culture and differentiation. We show that our approach streamlines data management operations, speeds up experimental cycles and leads to enhanced reproducibility. Finally, adhering to the presented guidelines will promote compliance with FAIR principles upon publishing.

## Full-text entities

- **Diseases:** Alzheimer (MESH:D000544), neurological disorders (MESH:D009461), ASCC (MESH:D000092423), COVID-19 (MESH:D000086382), Infectious Disease (MESH:D003141), Zoonotic diseases (MESH:D015047)
- **Chemicals:** Menadione (MESH:D024483), oxygen (MESH:D010100), Naphthoquinones (MESH:D009285), PONE-D-24-34015R2 (-), Glycerol (MESH:D005990), mABs (MESH:D000911)
- **Species:** Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049], Staphylococcus aureus (species) [taxon 1280], Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** HEK — Homo sapiens (Human), Transformed cell line (CVCL_0045)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12212921/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12212921/full.md

## References

52 references — full list in the complete paper: https://tomesphere.com/paper/PMC12212921/full.md

---
Source: https://tomesphere.com/paper/PMC12212921