Talking with the Latents -- how to convert your LLM into an astronomer
Ilay Kamai, Marc-Huertas Company, Mike J. Smith, Hagai B. Perets

TL;DR
This paper introduces a domain-agnostic framework that enhances large language models with physical reasoning capabilities by integrating latent physical features, enabling scientific reasoning and interpretability without task-specific fine-tuning.
Contribution
The authors present a novel knowledge distillation approach that fuses latent physical features with LLMs, improving scientific reasoning and interpretability across multiple models and tasks.
Findings
Models outperform strong baselines on scientific tasks
Latent physical features enable interpretability and manipulation
Framework is applicable across scientific domains
Abstract
Recent advances in Large Language Models (LLMs) offer unique opportunities for scientific tasks, yet their ability to reason over complex numerical data remains largely unexplored. We propose a simple mechanism to introduce domain-specific physical knowledge into LLMs by fusing pre-trained latent physical features with a pre-trained language model. Our method employs a teacher-student knowledge distillation framework where a large LLM (teacher) generates synthetic question-answer supervision to transfer physical reasoning to a smaller LLM (student). The student is conditioned on latent physical features and trained via a lightweight adapter and Low-Rank Adaptation (LoRA). We demonstrate that this approach, applied to models with 1B, 8B, and 32B parameters, enables effective reasoning over real scientific data. Our models substantially outperform strong baselines, such as Gemini 3 Pro,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Multimodal Machine Learning Applications · Topic Modeling
