AutoLLM-CARD: Towards a Description and Landscape of Large Language   Models

Shengwei Tian; Lifeng Han; Goran Nenadic

arXiv:2409.17011·cs.CL·November 26, 2024

AutoLLM-CARD: Towards a Description and Landscape of Large Language Models

Shengwei Tian, Lifeng Han, Goran Nenadic

PDF

Open Access 1 Repo

TL;DR

This paper introduces AutoLLM-CARD, a system that automatically extracts key information from scientific papers to generate model cards for large language models, aiding researchers in managing information overload.

Contribution

It presents a novel method combining NER and RE techniques to automatically generate LLM model cards from academic publications, which is a new approach in the field.

Findings

01

Processed 106 papers to extract key LLM features

02

Constructed a dataset linking model names, licenses, and applications

03

Shared code and data for automatic LLM card generation

Abstract

With the rapid growth of the Natural Language Processing (NLP) field, a vast variety of Large Language Models (LLMs) continue to emerge for diverse NLP tasks. As more papers are published, researchers and developers face the challenge of information overload. Thus, developing a system that can automatically extract and organise key information about LLMs from academic papers is particularly important. The standard format for documenting information about LLMs is the LLM model card (\textbf{LLM-Card}). We propose a method for automatically generating LLM model cards from scientific publications. We use Named Entity Recognition (\textbf{NER}) and Relation Extraction (\textbf{RE}) methods that automatically extract key information about LLMs from the papers, helping researchers to access information about LLMs efficiently. These features include model \textit{licence}, model \textit{name},…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shengwei-tian/dependency-parser-visualization
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques