TL;DR
PlantDeBERTa is a specialized open-source language model designed for extracting structured knowledge from plant stress-response literature, combining transformer architecture with rule-based and ontology-grounded methods to improve domain-specific NLP in plant science.
Contribution
We introduce PlantDeBERTa, a novel transformer-based model tailored for plant science, integrating linguistic post-processing and ontology normalization for high-precision entity recognition.
Findings
Strong generalization across entity types
Effective domain adaptation in low-resource settings
Public release of the model promotes transparency
Abstract
The rapid advancement of transformer-based language models has catalyzed breakthroughs in biomedical and clinical natural language processing; however, plant science remains markedly underserved by such domain-adapted tools. In this work, we present PlantDeBERTa, a high-performance, open-source language model specifically tailored for extracting structured knowledge from plant stress-response literature. Built upon the DeBERTa architecture-known for its disentangled attention and robust contextual encoding-PlantDeBERTa is fine-tuned on a meticulously curated corpus of expert-annotated abstracts, with a primary focus on lentil (Lens culinaris) responses to diverse abiotic and biotic stressors. Our methodology combines transformer-based modeling with rule-enhanced linguistic post-processing and ontology-grounded entity normalization, enabling PlantDeBERTa to capture biologically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsHow do I file a dispute with Expedia?*DisputeFastService · Softmax · Attention Is All You Need · Focus · Ontology · DeBERTa
