PageIndex is a reasoning-based retrieval augmented generation (RAG) framework designed to process long documents by converting them into tree-structured indexes instead of relying on vector similarity search. This approach allows large language models to perform agentic reasoning over the document's structure, simulating how human experts navigate complex documents to find relevant information.