Open-domain multimodal document retrieval must navigate a graph of paragraphs, tables, and images where meaning depends on local context and hyperlinks. Existing graph-based retrievers rely on hop-agnostic similarity scores and rigid plans, so they struggle to recover when early hops are misleading or underspecified.
Failure is Feedback (FiF) reframes traversal as a sequential decision process. An orchestrator maintains a structured memory of successes and failures, escalates from lightweight vector matching to LLM reasoning only when a hop is ambiguous, and performs history-aware backtracking that re-anchors search using failure traces. Experiments on MultimodalQA, MMCoQA, and WebQA show state-of-the-art retrieval while keeping API usage cost-aware.