Empowering Meta-Analysis: Leveraging Large Language Models for Scientific Synthesis
Jawad Ibn Ahad, Rafeed Mohammad Sultan, Abraham Kaikobad, Fuad Rahman,, Mohammad Ruhul Amin, Nabeel Mohammed, Shafin Rahman

TL;DR
This paper presents a novel method using fine-tuned large language models with Retrieval Augmented Generation to automate and improve the accuracy of scientific meta-analyses, reducing human effort and error.
Contribution
The study introduces a fine-tuning approach with a new loss metric and prompt engineering to enhance LLM performance in automated meta-analysis tasks.
Findings
Fine-tuned LLMs generate 87.6% relevant meta-analysis content.
Relevancy irrelevance reduced from 4.56% to 1.9%.
Effective in low-resource environments.
Abstract
This study investigates the automation of meta-analysis in scientific documents using large language models (LLMs). Meta-analysis is a robust statistical method that synthesizes the findings of multiple studies support articles to provide a comprehensive understanding. We know that a meta-article provides a structured analysis of several articles. However, conducting meta-analysis by hand is labor-intensive, time-consuming, and susceptible to human error, highlighting the need for automated pipelines to streamline the process. Our research introduces a novel approach that fine-tunes the LLM on extensive scientific datasets to address challenges in big data handling and structured data extraction. We automate and optimize the meta-analysis process by integrating Retrieval Augmented Generation (RAG). Tailored through prompt engineering and a new loss metric, Inverse Cosine Distance (ICD),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management
