AI Assistants to Enhance and Exploit the PETSc Knowledge Base

Barry Smith; Junchao Zhang; Hong Zhang; Lois Curfman McInnes; Murat Keceli; Archit Vasan; Satish Balay; Toby Isaac; Le Chen; Venkatram Vishwanath

arXiv:2506.20608·cs.AI·September 23, 2025

AI Assistants to Enhance and Exploit the PETSc Knowledge Base

Barry Smith, Junchao Zhang, Hong Zhang, Lois Curfman McInnes, Murat Keceli, Archit Vasan, Satish Balay, Toby Isaac, Le Chen, Venkatram Vishwanath

PDF

Open Access

TL;DR

This paper explores integrating large language models with PETSc's extensive knowledge base to improve support, documentation, and development workflows in scientific computing.

Contribution

It introduces a novel LLM-powered system combining retrieval, reranking, and chatbots to utilize PETSc's knowledge base effectively for user and developer support.

Findings

01

Effective system architecture for PETSc knowledge integration

02

Evaluation of LLMs and embedding models for technical info

03

Initial positive impact on software development workflows

Abstract

Generative AI, especially through large language models (LLMs), is transforming how technical knowledge can be accessed, reused, and extended. PETSc, a widely used numerical library for high-performance scientific computing, has accumulated a rich but fragmented knowledge base over its three decades of development, spanning source code, documentation, mailing lists, GitLab issues, Discord conversations, technical papers, and more. Much of this knowledge remains informal and inaccessible to users and new developers. To activate and utilize this knowledge base more effectively, the PETSc team has begun building an LLM-powered system that combines PETSc content with custom LLM tools -- including retrieval-augmented generation (RAG), reranking algorithms, and chatbots -- to assist users, support developers, and propose updates to formal documentation. This paper presents initial experiences…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · Dropout · Byte Pair Encoding · Softmax · Dense Connections · Layer Normalization · Linear Warmup With Linear Decay · BERT · BART