Loading paper
Let your LLM generate a few tokens and you will reduce the need for retrieval | Tomesphere