Bot reference catalog

One-page summaries of 30 common search, AI/LLM, and social-preview crawlers.

← bot index · audit index

PerplexityBot

Vendor
Perplexity AI
Type
AI search-index
JavaScript rendering
no
Honors robots.txt
partial
VendorPerplexity AI
TypeSearch-index crawler
robots.txt tokenPerplexityBot
JavaScript renderingNo — HTTP-only
Honors robots.txtPartial — has been observed ignoring robots.txt directives in 2024
Vendor docsdocs.perplexity.ai/guides/bots

User-Agent strings

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; PerplexityBot/1.0; +https://perplexity.ai/perplexitybot)

Purpose

Builds the index Perplexity's search-grounded answers draw from. Pages crawled by PerplexityBot are eligible to appear as cited sources in Perplexity's answer cards.

Network identity

than OpenAI's or Anthropic's.

In our audit

Blocked at 62/100 sites as part of the AI-bot cluster. Where allowed, received the same content as the other AI bots — Perplexity's bot infrastructure does not perform any special handshake.

At amazon.co.uk, shopping.yahoo.co.jp, and the other dynamic-rendering sites, PerplexityBot is not in the trusted-crawler allowlist — it receives the small shell while Googlebot/Bingbot/Applebot receive the pre-rendered version.

How to allow / block

To allow indexing:

User-agent: PerplexityBot
Allow: /

To block:

User-agent: PerplexityBot
Disallow: /

Quirks

sites that had disallowed PerplexityBot in robots.txt, by routing requests through residential IPs with a generic Chrome UA. Perplexity has since updated its policies, but the incident left trust friction with some publishers.

(next file).

Sources