Can Foundational Large Language Models Assist with Conducting   Pharmaceuticals Manufacturing Investigations?

Hossein Salami (1); Brandye Smith-Goettler (2); Vijay Yadav (2) ((1); Digital Services; MMD; Merck & Co.; Inc.; Rahway; NJ; USA; (2) Digital; Services; MMD; Merck & Co.; Inc.; West Point; PA; USA)

arXiv:2404.15578·cs.CL·April 25, 2024

Can Foundational Large Language Models Assist with Conducting Pharmaceuticals Manufacturing Investigations?

Hossein Salami (1), Brandye Smith-Goettler (2), Vijay Yadav (2) ((1), Digital Services, MMD, Merck & Co., Inc., Rahway, NJ, USA, (2) Digital, Services, MMD, Merck & Co., Inc., West Point, PA, USA)

PDF

Open Access

TL;DR

This study evaluates the potential of general-purpose large language models like GPT-4 and Claude-2 to assist in pharmaceutical manufacturing investigations by extracting information and identifying similar deviations from historical records.

Contribution

The paper demonstrates how LLMs can be applied to domain-specific manufacturing data for information extraction and similarity search, highlighting their capabilities and limitations.

Findings

01

GPT-4 and Claude-2 achieve high accuracy in information extraction.

02

Semantic search effectively identifies similar manufacturing deviations.

03

LLMs exhibit reasoning and hallucination risks in complex cases.

Abstract

General purpose Large Language Models (LLM) such as the Generative Pretrained Transformer (GPT) and Large Language Model Meta AI (LLaMA) have attracted much attention in recent years. There is strong evidence that these models can perform remarkably well in various natural language processing tasks. However, how to leverage them to approach domain-specific use cases and drive value remains an open question. In this work, we focus on a specific use case, pharmaceutical manufacturing investigations, and propose that leveraging historical records of manufacturing incidents and deviations in an organization can be beneficial for addressing and closing new cases, or de-risking new manufacturing campaigns. Using a small but diverse dataset of real manufacturing deviations selected from different product lines, we evaluate and quantify the power of three general purpose LLMs (GPT-3.5, GPT-4,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical and Computational Modeling · Software Engineering Research · Machine Learning in Materials Science

MethodsAttention Is All You Need · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Absolute Position Encodings · Dropout · Dense Connections · Label Smoothing · Residual Connection · Softmax · Adam