# REDAC: RNA-seq expression data analysis chatbot

**Authors:** Giovanni Maria De Filippis, Pranoy Sahu, Pasqualina Ambrosio, Stefania Picascia, Matteo Lo Monte, Ilenia Agliarulo, Simone Di Paola, Cristiano Russo, Christian Tommasino, Nicola Normanno, Daniela Frezzetti, Seetharaman Parashuraman, Antonio M Rinaldi, Francesco Russo

PMC · DOI: 10.1093/bioadv/vbaf321 · Bioinformatics Advances · 2025-12-27

## TL;DR

REDAC is a chatbot tool that simplifies RNA-seq data analysis using natural language and provides reproducible results with visualizations and biological interpretations.

## Contribution

REDAC introduces a novel chatbot interface for RNA-seq analysis using LLMs and a retrieval-augmented generation module for pathway interpretation.

## Key findings

- REDAC enables rapid and transparent RNA-seq analysis through natural language queries.
- The tool integrates Gemma and LLaMA models with a PubMed-based module for pathway enrichment interpretation.
- REDAC supports reproducibility by generating automated analysis reports.

## Abstract

To date, due to the complexity of both the analytical processes and the result interpretation of RNA-seq expression data analyses, researchers often require the support of bioinformaticians expertise. Selecting appropriate statistical tests and performing essential data manipulations, such as normalization and filtering, in a rigorous and reproducible manner remains a significant challenge for many users.

We developed REDAC, a web-based R application that offers an interactive platform designed to simplify and enhance RNA-seq expression data exploration and analysis. REDAC provides a straightforward approach to perform differentially RNA-seq analysis rapidly, easily, and transparently through natural language queries from users. Moreover, it allows to run complete analyses, generate comprehensive visualizations, and obtain biological interpretation of pathway enrichment results via two popular Large Language Models: Gemma and LLaMA guided by a PubMed based Retrieval-Augmented Generation module. Finally, REDAC promotes reproducibility through the automated generation of analysis reports.

REDAC is available for local (https://github.com/franruss/REDAC) and online use (https://frusso.shinyapps.io/REDAC). User manual: https://github.com/franruss/REDAC/blob/main/docs/REDAC_user_manual.pdf

## Full-text entities

- **Diseases:** NSCLC (MESH:D002289), LLMs (MESH:D007806), inflammatory (MESH:D007249), toxicity (MESH:D064420), hallucination (MESH:D006212), REDAC (MESH:D001039)
- **Chemicals:** Gefitinib (MESH:D000077156), REDAC (-)
- **Species:** Lama glama (llama, species) [taxon 9844], Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** HCC827 — Homo sapiens (Human), Lung adenocarcinoma, Cancer cell line (CVCL_2063), PC9 — Homo sapiens (Human), Lung adenocarcinoma, Cancer cell line (CVCL_B260)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12927421/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12927421/full.md

## References

38 references — full list in the complete paper: https://tomesphere.com/paper/PMC12927421/full.md

---
Source: https://tomesphere.com/paper/PMC12927421