CAISSON: Concept-Augmented Inference Suite of Self-Organizing Neural Networks
Igor Halperin

TL;DR
CAISSON introduces a hierarchical, multi-view clustering framework using dual Self-Organizing Maps to enhance retrieval-augmented generation, especially for complex multi-entity queries.
Contribution
It presents a novel multi-view clustering approach with dual SOMs for improved document organization in RAG systems, along with a synthetic evaluation framework, SynFAQA.
Findings
Significant performance improvements on complex multi-entity retrieval tasks.
Enhanced document discovery by combining semantic and conceptual views.
Maintains practical response times for interactive applications.
Abstract
We present CAISSON, a novel hierarchical approach to Retrieval-Augmented Generation (RAG) that transforms traditional single-vector search into a multi-view clustering framework. At its core, CAISSON leverages dual Self-Organizing Maps (SOMs) to create complementary organizational views of the document space, where each view captures different aspects of document relationships through specialized embeddings. The first view processes combined text and metadata embeddings, while the second operates on metadata enriched with concept embeddings, enabling a comprehensive multi-view analysis that captures both fine-grained semantic relationships and high-level conceptual patterns. This dual-view approach enables more nuanced document discovery by combining evidence from different organizational perspectives. To evaluate CAISSON, we develop SynFAQA, a framework for generating synthetic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Softmax · Linear Warmup With Linear Decay · Multi-Head Attention · Byte Pair Encoding · WordPiece · Dropout · Dense Connections · Layer Normalization
