Using Guided Transfer Learning to Predispose AI Agent to Learn Efficiently from Small RNA-sequencing Datasets
Kevin Li, Danko Nikoli\'c, Vjekoslav Nikoli\'c, Davor Andri\'c, Lauren, M. Sanders, Sylvain V. Costes

TL;DR
This paper introduces Guided Transfer Learning (GTL) to improve AI learning efficiency from small RNA-seq datasets by pre-training on large-scale data, enabling better few-shot learning performance in high-dimensional, low-sample settings.
Contribution
The work proposes a novel GTL approach that incorporates inductive biases during pre-training, enhancing transfer learning effectiveness for HDLSS RNA-seq data.
Findings
GTL significantly improves few-shot learning accuracy on RNA-seq tasks.
Pre-training on recount3 data enables the AI to learn gene expression patterns.
GTL outperforms traditional transfer learning methods on small sample size datasets.
Abstract
Given the increasing availability of RNA-seq data and its complex and heterogeneous nature, there has been growing interest in applying AI/machine learning methodologies to work with such data modalities. However, because omics data is characterized by high dimensionality and low sample size (HDLSS), current attempts at integrating AI in this domain require significant human guidance and expertise to mitigate overfitting. In this work we look at how transfer learning can be improved to learn from small RNA-seq sample sizes without significant human interference. The strategy is to gain general prior knowledge about a particular domain of data (e.g. RNA-seq data) by pre-training on a general task with a large aggregate of data, then fine-tuning to various specific, downstream target tasks in the same domain. Because previous attempts have shown traditional transfer learning failing on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCancer-related molecular mechanisms research · Molecular Biology Techniques and Applications · Genomics and Phylogenetic Studies
