ObjectGraph: From Document Injection to Knowledge Traversal -- A Native File Format for the Agentic Era
Mohit Dubey, Open Gigantic

TL;DR
OBJECTGRAPH is a new file format that represents documents as knowledge graphs, enabling more efficient and effective interaction with autonomous LLM agents by reducing token usage and preserving content fidelity.
Contribution
The paper introduces OBJECTGRAPH, a novel document format that models documents as knowledge graphs, addressing structural limitations of existing formats for agent consumption.
Findings
Up to 95.3% token reduction in agent tasks.
No significant degradation in task accuracy (p > 0.05).
98.7% content preservation in transpiler benchmark.
Abstract
Every document format in existence was designed for a human reader moving linearly through text. Autonomous LLM agents do not read - they retrieve. This fundamental mismatch forces agents to inject entire documents into their context window, wasting tokens on irrelevant content, compounding state across multi-turn loops, and broadcasting information indiscriminately across agent roles. We argue this is not a prompt engineering problem, not a retrieval problem, and not a compression problem: it is a format problem. We introduce OBJECTGRAPH (.og), a file format that reconceives the document as a typed, directed knowledge graph to be traversed rather than a string to be injected. OBJECTGRAPH is a strict superset of Markdown - every .md file is a valid .og file - requires no infrastructure beyond a two-primitive query protocol, and is readable by both humans and agents without tooling.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
