ChipAlign: Instruction Alignment in Large Language Models for Chip Design via Geodesic Interpolation
Chenhui Deng, Yunsheng Bai, Haoxing Ren

TL;DR
ChipAlign introduces a training-free geodesic interpolation method to merge large language models, significantly improving instruction-following abilities in chip design models without sacrificing domain expertise.
Contribution
This work presents a novel geodesic interpolation technique for model merging that enhances instruction alignment in chip LLMs without additional training.
Findings
Up to 26.6% improvement on IFEval benchmark.
3.9% performance increase on OpenROAD QA.
8.25% gain on production chip QA benchmarks.
Abstract
Recent advancements in large language models (LLMs) have expanded their application across various domains, including chip design, where domain-adapted chip models like ChipNeMo have emerged. However, these models often struggle with instruction alignment, a crucial capability for LLMs that involves following explicit human directives. This limitation impedes the practical application of chip LLMs, including serving as assistant chatbots for hardware design engineers. In this work, we introduce ChipAlign, a novel approach that utilizes a training-free model merging strategy, combining the strengths of a general instruction-aligned LLM with a chip-specific LLM. By considering the underlying manifold in the weight space, ChipAlign employs geodesic interpolation to effectively fuse the weights of input LLMs, producing a merged model that inherits strong instruction alignment and chip…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel-Driven Software Engineering Techniques · Computational Geometry and Mesh Generation · Modular Robots and Swarm Intelligence
