AutoLLM-CARD: Towards a Description and Landscape of Large Language Models
Shengwei Tian, Lifeng Han, Goran Nenadic

TL;DR
This paper introduces AutoLLM-CARD, a system that automatically extracts key information from scientific papers to generate model cards for large language models, aiding researchers in managing information overload.
Contribution
It presents a novel method combining NER and RE techniques to automatically generate LLM model cards from academic publications, which is a new approach in the field.
Findings
Processed 106 papers to extract key LLM features
Constructed a dataset linking model names, licenses, and applications
Shared code and data for automatic LLM card generation
Abstract
With the rapid growth of the Natural Language Processing (NLP) field, a vast variety of Large Language Models (LLMs) continue to emerge for diverse NLP tasks. As more papers are published, researchers and developers face the challenge of information overload. Thus, developing a system that can automatically extract and organise key information about LLMs from academic papers is particularly important. The standard format for documenting information about LLMs is the LLM model card (\textbf{LLM-Card}). We propose a method for automatically generating LLM model cards from scientific publications. We use Named Entity Recognition (\textbf{NER}) and Relation Extraction (\textbf{RE}) methods that automatically extract key information about LLMs from the papers, helping researchers to access information about LLMs efficiently. These features include model \textit{licence}, model \textit{name},…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
