LLM InfrastructureFree · runs in your browser

Bot × path access matrix

AIContentAccess Simulator

Simulate exactly which URLs GPTBot, ChatGPT-User, ClaudeBot, Claude-Web, PerplexityBot, Google-Extended, Applebot-Extended, CCBot and Bytespider can reach on your site. Why this matters →

robots.txt

Paths to test (one per line)

Per-bot access summary

GPTBot

OpenAI (training)

7/8

88% reachable

ChatGPT-User

OpenAI (browse)

6/8

75% reachable

OAI-SearchBot

OpenAI (search)

6/8

75% reachable

ClaudeBot

Anthropic (training)

7/8

88% reachable

Claude-Web

Anthropic (browse)

6/8

75% reachable

PerplexityBot

Perplexity (training)

8/8

100% reachable

Perplexity-User

Perplexity (query)

6/8

75% reachable

Google-Extended

Google (Gemini/Bard)

8/8

100% reachable

Applebot-Extended

Apple (AI)

6/8

75% reachable

CCBot

Common Crawl

0/8

0% reachable

Bytespider

ByteDance

6/8

75% reachable

Bot × path matrix

Path	GPTBot	ChatGPT-User	OAI-SearchBot	ClaudeBot	Claude-Web	PerplexityBot	Perplexity-User	Google-Extended	Applebot-Extended	CCBot	Bytespider
/	✓	✓	✓	✓	✓	✓	✓	✓	✓	✕	✓
/pricing/	✓	✓	✓	✓	✓	✓	✓	✓	✓	✕	✓
/blog/how-to-ship/	✓	✓	✓	✓	✓	✓	✓	✓	✓	✕	✓
/docs/api/	✓	✓	✓	✓	✓	✓	✓	✓	✓	✕	✓
/cart/	✓	✕	✕	✓	✕	✓	✕	✓	✕	✕	✕
/admin/	✓	✕	✕	✓	✕	✓	✕	✓	✕	✕	✕
/private/whitepaper.pdf	✕	✓	✓	✕	✓	✓	✓	✓	✓	✕	✓
/login/	✓	✓	✓	✓	✓	✓	✓	✓	✓	✕	✓

How it works

Paste your robots.txt and a list of paths you want to test. The simulator parses each User-agent block, then runs every path through the longest-match rule for each of the 11 AI crawlers tracked here. You see a matrix of green allows and red blocks, plus a per-bot summary of how many of your test paths are reachable.

Which bots matter

GPTBot + ChatGPT-User + OAI-SearchBot - OpenAI splits training, browse and search into three separate agents. Block one and you only affect that behaviour.
ClaudeBot + Claude-Web- Anthropic uses ClaudeBot for training and Claude-Web for real-time browsing. Block ClaudeBot if you're opting out of training.
PerplexityBot + Perplexity-User - Two agents. Perplexity-User represents live user queries, so blocking it removes you from Perplexity answers entirely.
Google-Extended, Applebot-Extended- Training opt-outs for Gemini and Apple Intelligence that don't affect traditional search ranking.
CCBot, Bytespider - Common Crawl and ByteDance scrapers that most operators block outright.

Pair with

Generate a clean ruleset with the AI Crawl Rule Generator, then pair with a LLMs.txt Generator and cross-check with the Sitemap vs LLMs.txt Consistency Checker. Strategy reading: what is llms.txt.

Related tools

LLM Infrastructure

AI Crawl Rule Generator

Pick an opinionated policy and emit matching robots.txt and llms.txt rules for nine AI crawlers including GPTBot, ClaudeBot, PerplexityBot, Google-Extended and Applebot-Extended.

LLM Infrastructure

Sitemap vs LLMs.txt

Cross-check your sitemap.xml and llms.txt for contradictions, missing primary sources and phantom URLs so AI crawlers receive consistent signals.

LLM Infrastructure

LLMs.txt Validator

Paste an existing llms.txt file and we'll validate the syntax, flag unknown directives, warn about common mistakes, and tell you whether GPTBot, ClaudeBot and the other major AI crawlers will parse it correctly.

LLM Infrastructure

LLMs.txt Tester

Fetch any live domain's llms.txt and test whether it's reachable, correctly formatted and what directives are being sent to AI crawlers.

From the Knowledge Hub

Tutorials

What Is llms.txt?

llms.txt is the robots.txt of the AI era - a single file at the root of your domain that tells LLMs what to read, what to ignore, and how to attribute your content. Here's the exact spec, a working template, and the GEO upside.

AI Search

How AI Assistants Choose Citations

Reverse-engineered: the exact ranking signals ChatGPT, Claude, Gemini and Perplexity use to decide which sources end up in the citation chips at the bottom of an answer - and how to engineer your content to win them.

Want this done for you?

Ship the full GEO playbook in 14 days

Geolify GEO packages bundle every tool on this site into a 14-day done-for-you build - llms.txt, schema, entity strength, content overhaul, citations and the measurement stack. From $499.

Buy GEO Packages See Pricing ← All tools

Explore More Packages

Combine services for maximum AI visibility.

Compare All Pricing

AIContentAccess Simulator

How it works

Which bots matter

Pair with

Related tools

AI Crawl Rule Generator

Sitemap vs LLMs.txt

LLMs.txt Validator

LLMs.txt Tester

From the Knowledge Hub

What Is llms.txt?

How AI Assistants Choose Citations

Ship the full GEO playbook in 14 days

Explore More Packages

GEO Packages

Local GEO

AI SEO Boost