Flash deal ends in--:--:--— use GEO10 for 10% off

LLM InfrastructureFree · runs in your browser

Opinionated policy → full ruleset

AICrawlRule Generator

Pick an opinionated policy and emit matching robots.txt and llms.txt rules for nine AI crawlers including GPTBot, ClaudeBot, PerplexityBot, Google-Extended and Applebot-Extended. Why this matters →

Paths blocked universally

Bot policies

OpenAI training (GPTBot)

Opts out of training. Does not affect live ChatGPT answers.

Allow

OpenAI live (ChatGPT-User, OAI-SearchBot)

Blocks live browse and SearchGPT. Usually you want these ALLOWED.

Allow

Anthropic training (ClaudeBot)

Opts out of Claude model training.

Allow

Anthropic live (Claude-Web)

Blocks live Claude browse. Usually ALLOWED.

Allow

Perplexity (PerplexityBot, Perplexity-User)

PerplexityBot is training, Perplexity-User is live queries.

Allow

Google-Extended (Gemini training)

Opts out of Gemini training without affecting Google search.

Allow

Applebot-Extended (Apple Intelligence training)

Opts out of Apple AI training without affecting Siri/Spotlight.

Allow

Common Crawl (CCBot)

Large public web archive. Used by most LLM training sets.

Block

ByteDance (Bytespider)

Aggressive scraper. Most operators block it.

Block
User-agent: *
Disallow: /admin/
Disallow: /cart/
Disallow: /checkout/
Disallow: /account/
Disallow: /api/

User-agent: GPTBot
Allow: /
Disallow: /admin/
Disallow: /cart/
Disallow: /checkout/
Disallow: /account/
Disallow: /api/

User-agent: ChatGPT-User
Allow: /
Disallow: /admin/
Disallow: /cart/
Disallow: /checkout/
Disallow: /account/
Disallow: /api/

User-agent: OAI-SearchBot
Allow: /
Disallow: /admin/
Disallow: /cart/
Disallow: /checkout/
Disallow: /account/
Disallow: /api/

User-agent: ClaudeBot
Allow: /
Disallow: /admin/
Disallow: /cart/
Disallow: /checkout/
Disallow: /account/
Disallow: /api/

User-agent: Claude-Web
Allow: /
Disallow: /admin/
Disallow: /cart/
Disallow: /checkout/
Disallow: /account/
Disallow: /api/

User-agent: PerplexityBot
Allow: /
Disallow: /admin/
Disallow: /cart/
Disallow: /checkout/
Disallow: /account/
Disallow: /api/

User-agent: Perplexity-User
Allow: /
Disallow: /admin/
Disallow: /cart/
Disallow: /checkout/
Disallow: /account/
Disallow: /api/

User-agent: Google-Extended
Allow: /
Disallow: /admin/
Disallow: /cart/
Disallow: /checkout/
Disallow: /account/
Disallow: /api/

User-agent: Applebot-Extended
Allow: /
Disallow: /admin/
Disallow: /cart/
Disallow: /checkout/
Disallow: /account/
Disallow: /api/

User-agent: CCBot
Disallow: /

User-agent: Bytespider
Disallow: /

How it works

Pick a policy for each bot group and list any paths you want blocked universally. The generator emits a robots.txt and matching llms.txt ## Bot policy block with the right User-agent rules for GPTBot, ChatGPT-User, OAI-SearchBot, ClaudeBot, Claude-Web, PerplexityBot, Perplexity-User, Google-Extended, Applebot-Extended, CCBot and Bytespider.

Sensible defaults

  • Allow ChatGPT-User and Perplexity-User. These represent real user queries and blocking them removes you from live answers.
  • Block CCBot and Bytespider by default. Common Crawl is the main training corpus for open models and Bytespider is aggressive with few benefits.
  • Universal blocks for cart, admin, account. These should be Disallowed for every user-agent regardless of policy.

Pair with

Once generated, simulate access with the AI Content Access Simulator, validate syntax with the LLMs.txt Validator, and cross-check against your sitemap with the Sitemap vs LLMs.txt Consistency Checker. Strategy reading: what is llms.txt.

Want this done for you?

Ship the full GEO playbook in 14 days

Geolify GEO packages bundle every tool on this site into a 14-day done-for-you build - llms.txt, schema, entity strength, content overhaul, citations and the measurement stack. From $499.

Explore More Packages

Combine services for maximum AI visibility.