Click any bot for a one-page reference covering UA strings, network identity, verification, robots.txt directives, and quirks.
Search
AI training
AI search-index
AI live-fetch
The training / search / live taxonomy
The most important single thing in this catalog: major AI vendors operate three separate bots — training, search index, and live retrieval — with three distinct robots.txt tokens. Most "Block AI Bots" WAF presets do not split them. A site that wants AI shopping visibility but not to be training data should block the training-tier UAs and allow the live-retrieval and search-index tiers.
| Vendor | Training | Search index | Live retrieval |
|---|---|---|---|
| OpenAI | GPTBot | OAI-SearchBot | ChatGPT-User |
| Anthropic | ClaudeBot | Claude-SearchBot | Claude-User |
| Perplexity | PerplexityBot | (combined) | Perplexity-User |
| Meta | Meta-ExternalAgent | (combined) | Meta-ExternalFetcher |
| Apple | Applebot-Extended (opt-out) | Applebot | Applebot |
Google-Extended (opt-out) | Googlebot | n/a |