ChipAlign: Instruction Alignment in Large Language Models for Chip Design via Geodesic Interpolation

Chenhui Deng; Yunsheng Bai; Haoxing Ren

arXiv:2412.19819·cs.AR·July 17, 2025

ChipAlign: Instruction Alignment in Large Language Models for Chip Design via Geodesic Interpolation

Chenhui Deng, Yunsheng Bai, Haoxing Ren

PDF

Open Access

TL;DR

ChipAlign introduces a training-free geodesic interpolation method to merge large language models, significantly improving instruction-following abilities in chip design models without sacrificing domain expertise.

Contribution

This work presents a novel geodesic interpolation technique for model merging that enhances instruction alignment in chip LLMs without additional training.

Findings

01

Up to 26.6% improvement on IFEval benchmark.

02

3.9% performance increase on OpenROAD QA.

03

8.25% gain on production chip QA benchmarks.

Abstract

Recent advancements in large language models (LLMs) have expanded their application across various domains, including chip design, where domain-adapted chip models like ChipNeMo have emerged. However, these models often struggle with instruction alignment, a crucial capability for LLMs that involves following explicit human directives. This limitation impedes the practical application of chip LLMs, including serving as assistant chatbots for hardware design engineers. In this work, we introduce ChipAlign, a novel approach that utilizes a training-free model merging strategy, combining the strengths of a general instruction-aligned LLM with a chip-specific LLM. By considering the underlying manifold in the weight space, ChipAlign employs geodesic interpolation to effectively fuse the weights of input LLMs, producing a merged model that inherits strong instruction alignment and chip…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel-Driven Software Engineering Techniques · Computational Geometry and Mesh Generation · Modular Robots and Swarm Intelligence