Optimizing Retrieval-Augmented Generation for Electrical Engineering: A Case Study on ABB Circuit Breakers
Salahuddin Alawadhi, Noorhan Abbas

TL;DR
This paper explores optimizing Retrieval-Augmented Generation (RAG) systems for electrical engineering by developing domain-specific datasets, evaluating different pipelines, and assessing chunking strategies to improve accuracy and relevance in high-stakes engineering contexts.
Contribution
It introduces a domain-specific dataset for ABB circuit breakers and evaluates multiple RAG pipelines and chunking methods tailored for electrical engineering documentation.
Findings
Certain RAG configurations achieve high precision and relevance.
Chunking strategies significantly impact retrieval accuracy.
Limitations remain in ensuring factual accuracy and completeness.
Abstract
Integrating Retrieval Augmented Generation (RAG) with Large Language Models (LLMs) has shown the potential to provide precise, contextually relevant responses in knowledge intensive domains. This study investigates the ap-plication of RAG for ABB circuit breakers, focusing on accuracy, reliability, and contextual relevance in high-stakes engineering environments. By leveraging tailored datasets, advanced embedding models, and optimized chunking strategies, the research addresses challenges in data retrieval and contextual alignment unique to engineering documentation. Key contributions include the development of a domain-specific dataset for ABB circuit breakers and the evaluation of three RAG pipelines: OpenAI GPT4o, Cohere, and Anthropic Claude. Advanced chunking methods, such as paragraph-based and title-aware segmentation, are assessed for their impact on retrieval accuracy and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
