Flash deal ends in--:--:--— use GEO10 for 10% off

LLM InfrastructureFree · runs in your browser

Robots.txt × AI crawlers

AIBotAccess Checker

Paste your robots.txt and instantly see which AI crawlers (GPTBot, ChatGPT-User, OAI-SearchBot, ClaudeBot, anthropic-ai, PerplexityBot, Google-Extended) are allowed, blocked or missing - plus the fix for each gap.

Paste your robots.txt

Allowed

4

of 16

Partial

11

of 16

Blocked

1

of 16

GPTBotOpenAICritical

Trains ChatGPT models

Allowed
ChatGPT-UserOpenAICritical

On-demand fetch from ChatGPT

Disallowed: /admin/, /private/

Partial
OAI-SearchBotOpenAICritical

Indexes for ChatGPT Search

Disallowed: /admin/, /private/

Partial
ClaudeBotAnthropicCritical

Trains Claude models

Allowed
anthropic-aiAnthropic

Legacy Anthropic crawler

Disallowed: /admin/, /private/

Partial
Claude-WebAnthropicCritical

On-demand fetch from Claude.ai

Disallowed: /admin/, /private/

Partial
Google-ExtendedGoogleCritical

Gates Gemini training

Allowed
GooglebotGoogleCritical

Powers Google AI Overviews

Disallowed: /admin/, /private/

Partial
PerplexityBotPerplexityCritical

Indexes for Perplexity citations

Allowed
Perplexity-UserPerplexityCritical

On-demand fetch from Perplexity

Disallowed: /admin/, /private/

Partial
CCBotCommon Crawl

Feeds many LLM training sets

Disallowed: /admin/, /private/

Partial
Applebot-ExtendedApple

Apple Intelligence training

Disallowed: /admin/, /private/

Partial
Meta-ExternalAgentMeta

Llama training

Disallowed: /admin/, /private/

Partial
BytespiderByteDance

Often blocked for aggression

Blocked
cohere-aiCohere

Cohere RAG crawler

Disallowed: /admin/, /private/

Partial
AmazonbotAmazon

Alexa / Q training

Disallowed: /admin/, /private/

Partial

How the checker works

Paste your robots.txt file and we parse every User-agent: group, then check each of the 16 major AI crawlers against the rules. The verdict is one of: allowed (crawler can index everything), partial (crawler is allowed but some paths are blocked), blocked (crawler is fully locked out), or not-mentioned (crawler falls back to the wildcard group). Not-mentioned is usually fine - but if your wildcard group is Disallow: /, your AI visibility is zero.

How to fix it

  • If a critical bot is blocked, add an explicit group:User-agent: GPTBot Allow: /
  • If the wildcard group blocks everything, your site is invisible to every assistant - fix the wildcard before worrying about per-bot rules.
  • Be intentional about Bytespider and CCBot - many sites block these without losing AI visibility, because the major assistants have their own named crawlers anyway.
  • Pair robots.txt with a well-formed llms.txt file for the prose context layer that robots.txt can't express.

Why this matters for AI search

If a crawler is blocked, the assistant that owns it can't index your content - which means you can't be cited, quoted, or summarized in answers from that assistant, no matter how good your content is. The full mechanism is in our do AI assistants follow links guide and why your site isn't in ChatGPT.

Want this done for you?

Ship the full GEO playbook in 14 days

Geolify GEO packages bundle every tool on this site into a 14-day done-for-you build - llms.txt, schema, entity strength, content overhaul, citations and the measurement stack. From $499.

Explore More Packages

Combine services for maximum AI visibility.