HyperPIE: Hyperparameter Information Extraction from Scientific Publications
Tarek Saier, Mayumi Ohta, Takuto Asakura, Michael F\"arber

TL;DR
HyperPIE introduces a novel approach for extracting hyperparameter information from scientific publications using fine-tuned BERT models and large language models, significantly improving extraction accuracy and enabling large-scale analysis across disciplines.
Contribution
The paper formalizes hyperparameter extraction as an entity and relation recognition task, creating a labeled dataset and developing models that outperform existing methods.
Findings
29% F1 improvement over baseline for relation extraction
5.5% F1 improvement in entity recognition using large language models
Enabled large-scale hyperparameter analysis across disciplines
Abstract
Automatic extraction of information from publications is key to making scientific knowledge machine readable at a large scale. The extracted information can, for example, facilitate academic search, decision making, and knowledge graph construction. An important type of information not covered by existing approaches is hyperparameters. In this paper, we formalize and tackle hyperparameter information extraction (HyperPIE) as an entity recognition and relation extraction task. We create a labeled data set covering publications from a variety of computer science disciplines. Using this data set, we train and evaluate BERT-based fine-tuned models as well as five large language models: GPT-3.5, GALACTICA, Falcon, Vicuna, and WizardLM. For fine-tuned models, we develop a relation extraction approach that achieves an improvement of 29% F1 over a state-of-the-art baseline. For large language…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Data Quality and Management · Scientific Computing and Data Management
MethodsSparse Evolutionary Training · Refunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Adam · {Dispute@FaQ-s}How to file a dispute with Expedia? · Attention Dropout · 15 Ways to Contact How can i speak to someone at Delta Airlines · Layer Normalization
