Bot reference catalog

One-page summaries of 30 common search, AI/LLM, and social-preview crawlers.

← bot index · audit index

Baiduspider

Vendor
Baidu
Type
Search
JavaScript rendering
minimal
Honors robots.txt
yes
VendorBaidu
TypeTraditional search crawler (China)
robots.txt tokenBaiduspider
JavaScript renderingMinimal — SSR / dynamic rendering strongly recommended
Honors robots.txtYes
Vendor docshelp.baidu.com/question?prod_id=99&class=476&id=3001

User-Agent strings

Mozilla/5.0 (compatible; Baiduspider/2.0; +http://www.baidu.com/search/spider.html)

Mobile variant:

Mozilla/5.0 (Linux;u;Android 4.2.2;zh-cn;) AppleWebKit/534.46 (KHTML,like Gecko) Version/5.1 Mobile Safari/10600.6.3 (compatible; Baiduspider/2.0; +http://www.baidu.com/search/spider.html)

Sibling spiders: Baiduspider-image, Baiduspider-video, Baiduspider-news, Baiduspider-favo.

Purpose

The crawler behind Baidu Search, China's dominant search engine. Also feeds Baidu's ecommerce verticals and content products.

In our audit

baidu.com itself was tested in v1 (top-30 general). Critically, baidu serves a fully pre-rendered ~250 KB HTML to Googlebot, Bingbot, GPTBot, ClaudeBot, and PerplexityBot — while Chrome receives a 357-character shell. The most extreme dynamic-rendering ratio (700×) we measured.

This is the technique Baidu is most famous for technically: Baidu's Spider has notably weaker JS-rendering than Google or Bing, so a long tail of Chinese sites pre-render specifically for it. That same infrastructure now serves AI crawlers too.

How to allow / block

User-agent: Baiduspider
Disallow: /private/

User-agent: Baiduspider-image
Disallow: /images/private/

Quirks

rendering or SSR — this is the only major search vendor that still has it in its docs.

to *.baidu.com is the standard verification.

Sources