# Using deep neural networks and LASSO regression to predict miRNA expression changes based on mRNA data

**Authors:** Franz Leonard Böge, Helena U. Zacharias, Stefanie C. Becker, Klaus Jung

PMC · DOI: 10.3389/fbinf.2025.1566162 · Frontiers in Bioinformatics · 2025-07-08

## TL;DR

This paper explores using machine learning to predict miRNA expression from mRNA data, showing that cross-omics predictions are feasible.

## Contribution

The study demonstrates that miRNA expression can be predicted from mRNA data using deep learning and LASSO regression, with some success in cross-study evaluations.

## Key findings

- Most settings achieved strong correlations at the individual sample level.
- Correlations of log-fold changes and p-values were observed in some datasets.
- Data augmentation improved neural network performance but not LASSO models.

## Abstract

Since the rise of molecular high-throughput technologies, many diseases are now studied on multiple omics layers in parallel. Understanding the interplay between microRNAs (miRNA) and their target mRNAs is important to understand the molecular level of diseases. While much public data from mRNA experiments are available for many diseases, few paired datasets with both miRNA and mRNA expression profiles are available. This study aimed to assess the possibility of predicting miRNA expression data based on mRNA expression data, serving as a proof of principle that such cross-omics predictions are feasible. Furthermore, current research relies on target databases where information about miRNA–target relationships is provided based on experimental and computational studies.

To make use of publicly available mRNA profiles, we investigate the ability of artificial deep neural networks and linear least absolute shrinkage and selection operator (LASSO) regression to predict unknown miRNA expression profiles. We evaluate the approach using seven paired miRNA/mRNA expression datasets, four from studies on West Nile virus infection in mouse tissues and three from human immunodeficiency virus (HIV) infection in human tissues. We assessed the performance of each model first by within-data evaluations and second by cross-study evaluations. Furthermore, we investigated whether data augmentation or separate models for data from diseased and non-diseased samples can improve the prediction performance.

In general, most settings achieved strong correlations at the Level of individual samples. In some datasets and settings, correlations of log-fold changes and p-values from differential expression analysis (DEA) between true and predicted miRNA profiles can be observed. Correlation between log fold changes could also be seen in a cross-study evaluation for the HIV datasets. Data augmentation consistently improved performance in neural networks, while its impact on LASSO models was not significant.

Overall, cross-omics prediction of expression profiles appears possible, even with some correlations on the Level of the differential expression analysis.

## Linked entities

- **Species:** Mus musculus (taxon 10090), Homo sapiens (taxon 9606)

## Full-text entities

- **Diseases:** HIV) infection (MESH:D015658), West Nile virus infection (MESH:D014901)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606], Human immunodeficiency virus (species) [taxon 12721]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12279838/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12279838/full.md

## References

48 references — full list in the complete paper: https://tomesphere.com/paper/PMC12279838/full.md

---
Source: https://tomesphere.com/paper/PMC12279838