Empowering Meta-Analysis: Leveraging Large Language Models for   Scientific Synthesis

Jawad Ibn Ahad; Rafeed Mohammad Sultan; Abraham Kaikobad; Fuad Rahman,; Mohammad Ruhul Amin; Nabeel Mohammed; Shafin Rahman

arXiv:2411.10878·cs.CL·November 19, 2024

Empowering Meta-Analysis: Leveraging Large Language Models for Scientific Synthesis

Jawad Ibn Ahad, Rafeed Mohammad Sultan, Abraham Kaikobad, Fuad Rahman,, Mohammad Ruhul Amin, Nabeel Mohammed, Shafin Rahman

PDF

Open Access 1 Repo

TL;DR

This paper presents a novel method using fine-tuned large language models with Retrieval Augmented Generation to automate and improve the accuracy of scientific meta-analyses, reducing human effort and error.

Contribution

The study introduces a fine-tuning approach with a new loss metric and prompt engineering to enhance LLM performance in automated meta-analysis tasks.

Findings

01

Fine-tuned LLMs generate 87.6% relevant meta-analysis content.

02

Relevancy irrelevance reduced from 4.56% to 1.9%.

03

Effective in low-resource environments.

Abstract

This study investigates the automation of meta-analysis in scientific documents using large language models (LLMs). Meta-analysis is a robust statistical method that synthesizes the findings of multiple studies support articles to provide a comprehensive understanding. We know that a meta-article provides a structured analysis of several articles. However, conducting meta-analysis by hand is labor-intensive, time-consuming, and susceptible to human error, highlighting the need for automated pipelines to streamline the process. Our research introduces a novel approach that fine-tunes the LLM on extensive scientific datasets to address challenges in big data handling and structured data extraction. We automate and optimize the meta-analysis process by integrating Retrieval Augmented Generation (RAG). Tailored through prompt engineering and a new loss metric, Inverse Cosine Distance (ICD),…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

EncryptedBinary/Meta_analysis
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management