How it works
Paste your sitemap (XML, one URL per line, or sitemap index) and your llms.txt. The checker extracts every URL from both, normalises paths, and cross-references them. It flags four states: aligned (in both), missing from llms.txt (in sitemap but not a primary source), contradiction (in sitemap but blocked by llms.txt), and only in llms.txt (listed as a source but not in the sitemap).
How to fix each state
- Contradictions - decide if the URL should be crawlable. If yes, remove the
Disallowin llms.txt. If no, remove it from the sitemap. - Missing from llms.txt- these are pages you want indexed but haven't listed as primary sources. Add the important ones to your
## Primary sourcessection. - Only in llms.txt- these paths aren't in your sitemap, so crawlers may not discover them through normal means. Either add them to the sitemap or drop them from llms.txt.
Pair with
Build a clean llms.txt with the LLMs.txt Generator, validate syntax in the LLMs.txt Validator, and preview crawler access with the AI Content Access Simulator. Strategy reading: what is llms.txt.