Bottom-up Domain-specific Superintelligence: A Reliable Knowledge Graph is What We Need
Bhishma Dedhia, Yuval Kansal, Niraj K. Jha

TL;DR
This paper proposes a bottom-up approach using knowledge graphs to train language models for domain-specific superintelligence, demonstrated in medicine, leading to significant reasoning improvements over state-of-the-art models.
Contribution
It introduces a task generation pipeline from knowledge graph primitives and fine-tunes models for domain-specific reasoning, advancing towards medical superintelligence.
Findings
QwQ-Med-3 outperforms existing reasoning models on ICD-Bench.
Model utilizes primitives to improve hardest task performance.
Transfer learning enhances medical question-answering accuracy.
Abstract
Language models traditionally used for cross-domain generalization have recently demonstrated task-specific reasoning. However, their top-down training approach on general corpora is insufficient for acquiring abstractions needed for deep domain expertise. This may require a bottom-up approach that acquires expertise by learning to compose simple domain concepts into more complex ones. A knowledge graph (KG) provides this compositional structure, where domain primitives are represented as head-relation-tail edges and their paths encode higher-level concepts. We present a task generation pipeline that synthesizes tasks directly from KG primitives, enabling models to acquire and compose them for reasoning. We fine-tune language models on the resultant KG-grounded curriculum to demonstrate domain-specific superintelligence. While broadly applicable, we validate our approach in medicine,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Domain Adaptation and Few-Shot Learning · Advanced Graph Neural Networks
