Top-30 Cross-Bot HTML Parity Audit · May 2026

What does GPTBot, ClaudeBot, PerplexityBot, Googlebot and a real Chrome browser see when they hit the front page of the 30 most-visited websites in the world?

Generated 2026-05-18 22:32 UTC · 30 sites × 10 user-agents · raw data in audit/2026-05/data/ · methodology: direct HTTP, no JS execution, public HasData proxy fallback only for Cloudflare-challenged Chrome baselines.

1 · What this audit measures

For each of the Top-30 most-visited websites worldwide (Similarweb, April 2026), we send the same plain GET / request from ten different User-Agent headers and compare the HTML each one receives back. Two humans (Chrome desktop & mobile), two traditional search bots (Googlebot, Bingbot), and six AI/LLM crawlers (GPTBot, OAI-SearchBot, ChatGPT-User, ClaudeBot, PerplexityBot, Applebot).

The Chrome desktop response is the baseline. For every other UA we record: HTTP status, byte size, the <title>, <h1>, JSON-LD blocks, visible-text length, and a SHA-256 of the normalized visible text. "100% same" means the visible-text hash is byte-identical to Chrome; lower ratios indicate either an empty SPA shell, a login wall, or a divergent variant.

A 403 / 429 / Cloudflare challenge is recorded as a finding, not retried via a residential proxy — because the question is precisely "what does this UA see today?". All requests are HTTP-level only (no JS execution), which mirrors what GPTBot / ClaudeBot / PerplexityBot actually do today (Vercel, 2025: none of the major AI crawlers execute JavaScript).

2 · Why dynamic rendering is back on the table in 2026

Google formally deprecated dynamic rendering (serving a pre-rendered HTML to bots and a JS app to humans) in 2024 and continues to recommend SSR / SSG / hydration instead. But the 2025–2026 reality has shifted the debate:

The audit below is therefore a sanity check on the world's biggest sites: do they actually achieve content parity across human + search + AI user-agents? Or are some bots already getting a thinner, blocked, or divergent view of the web?

3 · Headline numbers

30
sites tested
14
UA-neutral (no blocks, no divergence)
12
sites that block ≥1 AI bot
4
sites that block ALL declared AI bots
User-Agent200 OKBlockedByte-identical to Chrome≥90% text parity<20% text (thin)
Chrome Desktop Human25/305/30 (17%)0/30 (0%)22/30 (73%)0/30
Chrome Mobile Human26/304/30 (13%)10/30 (33%)14/30 (47%)4/30
Googlebot Search23/307/30 (23%)6/30 (20%)12/30 (40%)4/30
Bingbot Search22/308/30 (27%)8/30 (27%)16/30 (53%)3/30
GPTBot AI training22/308/30 (27%)13/30 (43%)15/30 (50%)4/30
OAI-SearchBot AI search23/307/30 (23%)13/30 (43%)16/30 (53%)4/30
ChatGPT-User AI live-fetch23/307/30 (23%)12/30 (40%)16/30 (53%)4/30
ClaudeBot AI training22/308/30 (27%)10/30 (33%)16/30 (53%)4/30
PerplexityBot AI search23/307/30 (23%)12/30 (40%)16/30 (53%)4/30
Applebot AI search23/307/30 (23%)10/30 (33%)15/30 (50%)3/30

4 · Key findings

4.1 · Hard AI-bot blocks (the gatekeeping sites)

Seven sites return HTTP 4xx to one or more declared AI crawlers from this server's IP while serving Chrome normally. This is the closest thing to a "no robots" policy actually enforced at the edge — and it splits cleanly by vendor:

Note yahoo.com blocks every one of the six AI bots (GPTBot, OAI-SearchBot, ChatGPT-User, ClaudeBot, PerplexityBot) while still giving Chrome a full SSR page. linkedin.com blocks only GPTBot (their well-publicized stance against AI training data scraping). ebay.com blocks every declared AI bot UA and the search bots — a near-total "humans-and-direct-traffic only" posture from the front page.

4.2 · Sites blocking declared Googlebot / Bingbot from a random IP

Five sites refuse the canonical Googlebot or Bingbot User-Agent when it comes from an IP outside the official bot ranges. The lesson: never trust the User-Agent on its own — these sites verify reverse-DNS or maintain an IP allowlist on top of UA matching.

Wikipedia is the most public-spirited example: its anti-abuse stack rejects "Googlebot" claims from non-Google IPs with the same 403 you'd give a curl script — a healthy default that more sites should copy.

4.3 · Pure SSR — the same bytes for everyone

16 of the 30 sites are fully UA-neutral: no blocks, no challenges, every user-agent gets a response. Crucially, 14 of them serve byte-identical visible text (SHA-256 matches) to at least one AI crawler and Chrome — proof that a single SSR pipeline is feeding both browsers and bots:

This is the modern post-deprecation pattern Google asked for in 2024: one HTML response, served to everyone, no UA forking. Amazon, Naver, Netflix, Microsoft, Temu, Twitch, and Gemini all run this way.

4.4 · "Dynamic rendering by another name" — bots get MORE than humans

A handful of sites still return dramatically more content to declared bots than to a real Chrome browser. The most extreme is baidu.com: Chrome gets a 357-character shell and a script that hydrates the search UI; Googlebot, Bingbot, GPTBot, ClaudeBot and PerplexityBot all get a ~250 KB fully-rendered HTML — the classic pre-render-for-bots pattern that Google formally deprecated but the rest of the world quietly keeps using.

Whether this counts as "cloaking" depends on whether the content is the same. In Baidu's case the same query box and same brand are present in both responses, just rendered server-side for crawlers — which is exactly the "edge pre-rendering" pattern that PrerenderProxy implements for client sites in Finland.

4.5 · IP-level blockades (everything blocked, not bot-specific)

Four sites returned a hard 4xx to every single UA we tried — including a real Chrome. From this server's IP (a Hetzner datacenter address) they treat all traffic as untrusted regardless of User-Agent:

These are not informative findings about UA-specific behavior — they're informative about the broader trend of "block all datacenter IPs at the edge". From a residential IP all four would likely respond normally; they're included so the data set is honest about which results are constrained by our origin.

5 · Site × UA heatmap

Green = byte-identical to Chrome. Yellow = 50–90% text parity. Blue = 10–50% (likely a thinner variant or partial render). Red text = HTTP block (403/429/etc.). Click a domain to jump to the per-site breakdown.

#SiteChrome Desktop
human
Chrome Mobile
human
Googlebot
search
Bingbot
search
GPTBot
ai_train
OAI-SearchBot
ai_search
ChatGPT-User
ai_user
ClaudeBot
ai_train
PerplexityBot
ai_search
Applebot
ai_search
1google.com
search · SSR (search)
100% ≈
parity
88%8%
thin
5%
thin
4%
thin
4%
thin
4%
thin
4%
thin
4%
thin
5%
thin
2youtube.com
video · SPA + SSR shell
100% ≈
parity
3%
thin
3%
thin
120% ≈
parity
120% ≈
parity
120% ≈
parity
120% ≈
parity
120% ≈
parity
120% ≈
parity
120% ≈
parity
3facebook.com
social · SPA, login wall
100% ≈
parity
40093% ≈
parity
129% ≈
parity
129% ≈
parity
129% ≈
parity
129% ≈
parity
129% ≈
parity
129% ≈
parity
129% ≈
parity
4instagram.com
social · SPA, login wall
100% ≈
parity
1%
thin
1%
thin
1%
thin
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
1%
thin
100% same
byte-identical
1%
thin
5chatgpt.com
ai_chat · Next.js
BLOCK
http_403
BLOCK
http_403
BLOCK
http_403
BLOCK
http_403
BLOCK
http_403
BLOCK
http_403
BLOCK
http_403
BLOCK
http_403
BLOCK
http_403
BLOCK
http_403
6reddit.com
social · Next.js SSR
BLOCK
http_403
BLOCK
http_403
BLOCK
http_403
BLOCK
http_403
BLOCK
http_403
BLOCK
http_403
BLOCK
http_403
BLOCK
rate_limited
BLOCK
http_403
BLOCK
http_403
7x.com
social · SPA, login wall
100% ≈
parity
100% same
byte-identical
404BLOCK
http_403
402402402100% ≈
parity
402404
8whatsapp.com
messaging · Marketing site, SSR
100% ≈
parity
40096% ≈
parity
96% ≈
parity
96% ≈
parity
96% ≈
parity
96% ≈
parity
96% ≈
parity
96% ≈
parity
100% ≈
parity
9tiktok.com
video · SPA
100% ≈
parity
100% same
byte-identical
BLOCK
http_403
BLOCK
http_403
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
10bing.com
search · SSR
100% ≈
parity
9%
thin
8%
thin
15%15%15%15%15%15%15%
11wikipedia.org
reference · MediaWiki SSR
100% ≈
parity
100% ≈
parity
BLOCK
http_403
BLOCK
http_403
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
BLOCK
http_403
100% same
byte-identical
BLOCK
http_403
12yahoo.co.jp
portal · SSR
BLOCK
http_403
BLOCK
http_403
34%98% ≈
parity
BLOCK
http_403
BLOCK
http_403
BLOCK
http_403
BLOCK
http_403
BLOCK
http_403
46%
13yahoo.com
portal · SSR
100% ≈
parity
100% same
byte-identical
442% ≈
parity
441% ≈
parity
BLOCK
rate_limited
BLOCK
rate_limited
BLOCK
rate_limited
BLOCK
rate_limited
BLOCK
rate_limited
436% ≈
parity
14yandex.ru
search · SSR
ERR
captcha_page
ERR
captcha_page
15gemini.google.com
ai_chat · SPA, auth
100% ≈
parity
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
16amazon.com
ecommerce · SSR
100% ≈
parity
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
17linkedin.com
social · SSR + SPA
100% ≈
parity
97% ≈
parity
97% ≈
parity
98% ≈
parity
ERR
http_999
98% ≈
parity
98% ≈
parity
98% ≈
parity
98% ≈
parity
98% ≈
parity
18baidu.com
search · SSR
100% ≈
parity
42%42%69488% ≈
parity
69488% ≈
parity
69488% ≈
parity
69488% ≈
parity
69488% ≈
parity
69488% ≈
parity
100% same
byte-identical
19bet.br
betting · SPA
ERR
empty_body
ERR
empty_body
ERR
empty_body
ERR
empty_body
ERR
empty_body
ERR
empty_body
ERR
empty_body
ERR
empty_body
ERR
empty_body
ERR
empty_body
20naver.com
portal · SSR
100% ≈
parity
815% ≈
parity
815% ≈
parity
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
21cloud.microsoft
saas · SSR
100% ≈
parity
86%BLOCK
http_403
BLOCK
http_403
95% ≈
parity
95% ≈
parity
95% ≈
parity
95% ≈
parity
87%BLOCK
http_403
22netflix.com
video · SSR + SPA
100% ≈
parity
100% ≈
parity
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% ≈
parity
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
23pinterest.com
social · Next.js SSR
BLOCK
http_403
394% ≈
parity
394% ≈
parity
394% ≈
parity
BLOCK
http_403
BLOCK
http_403
BLOCK
http_403
394% ≈
parity
394% ≈
parity
394% ≈
parity
24live.com
saas · Redirect to login
100% ≈
parity
117% ≈
parity
117% ≈
parity
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
BLOCK
http_403
25ebay.com
ecommerce · SSR (substituted at rank 25)
100% ≈
parity
47%ERR
empty_body
ERR
empty_body
ERR
empty_body
ERR
empty_body
ERR
empty_body
ERR
empty_body
ERR
empty_body
ERR
empty_body
26bilibili.com
video · SSR + SPA
100% ≈
parity
90% ≈
parity
86%100% ≈
parity
4%
thin
4%
thin
4%
thin
4%
thin
4%
thin
100% same
byte-identical
27temu.com
ecommerce · SSR
28twitch.tv
video · SPA (substituted at rank 28)
100% ≈
parity
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
29dzen.ru
portal · SSR
30microsoft.com
saas · SSR
100% ≈
parity
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical
100% same
byte-identical

6 · Per-site detail

#1 · google.com

search · SSR (search) · https://www.google.com/
UA-neutral
Chrome baseline: title=Google · text=3479 chars · bytes=84306 · jsonld=0 · sha=ecee3cf423a093
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 843063479100%ecee3cf4230GoogleBevor Sie zu Google weitergehen
Chrome Mobile
Human
200 78493308288%911ca0c2410GoogleBevor Sie zu Google weitergehen
Googlebot
Search
200 587813088%e407c0b2400Google
Bingbot
Search
200 215331795%220c89f2680Google
GPTBot
AI training
200 102721544%22f61e12980Google
OAI-SearchBot
AI search
200 102871544%22f61e12980Google
ChatGPT-User
AI live-fetch
200 102891544%22f61e12980Google
ClaudeBot
AI training
200 102931544%22f61e12980Google
PerplexityBot
AI search
200 102911544%22f61e12980Google
Applebot
AI search
200 205421795%220c89f2680Google

#2 · youtube.com

video · SPA + SSR shell · https://www.youtube.com/
UA-neutral
Chrome baseline: title=YouTube · text=192 chars · bytes=77828 · jsonld=0 · sha=e116b81604f5a5
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 77828192100%e116b816040YouTube
Chrome Mobile
Human
200 6506373%fb7accfff80YouTube
Googlebot
Search
200 6121073%fb7accfff80YouTube
Bingbot
Search
200 73918232120%0b0fb8ded20YouTube
GPTBot
AI training
200 76880232120%0b0fb8ded20YouTube
OAI-SearchBot
AI search
200 77135232120%0b0fb8ded20YouTube
ChatGPT-User
AI live-fetch
200 77047232120%0b0fb8ded20YouTube
ClaudeBot
AI training
200 77199232120%0b0fb8ded20YouTube
PerplexityBot
AI search
200 76949232120%0b0fb8ded20YouTube
Applebot
AI search
200 74457232120%0b0fb8ded20YouTube

#3 · facebook.com

social · SPA, login wall · https://www.facebook.com/
1 divergent UA(s)
Chrome baseline: title=Facebook · text=485 chars · bytes=75981 · jsonld=0 · sha=6f0cd26ae2a2dd
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 75981485100%6f0cd26ae20Facebook
Chrome Mobile
Human
400 138115632%6fab2b75c60Error Facebook
Googlebot
Search
200 1203045493%f50a1f424c0Facebook — Выполните вход или зарегистрируйтесьFacebook
Bingbot
Search
200 78145627129%280e920a570Facebook
GPTBot
AI training
200 85004627129%280e920a570Facebook
OAI-SearchBot
AI search
200 85072627129%280e920a570Facebook
ChatGPT-User
AI live-fetch
200 84979627129%280e920a570Facebook
ClaudeBot
AI training
200 85196627129%280e920a570Facebook
PerplexityBot
AI search
200 85080627129%280e920a570Facebook
Applebot
AI search
200 74146627129%280e920a570Facebook

#4 · instagram.com

social · SPA, login wall · https://www.instagram.com/
UA-neutral
Chrome baseline: title=Instagram · text=839 chars · bytes=118606 · jsonld=0 · sha=26085a6f2b8aa4
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 118606839100%26085a6f2b0Instagram
Chrome Mobile
Human
200 11355391%bad57ef7830Instagram
Googlebot
Search
200 11443391%bad57ef7830Instagram
Bingbot
Search
200 13283791%bad57ef7830Instagram
GPTBot
AI training
200 118595839100%byte-identical0Instagram
OAI-SearchBot
AI search
200 118864839100%byte-identical0Instagram
ChatGPT-User
AI live-fetch
200 119113839100%byte-identical0Instagram
ClaudeBot
AI training
200 13286391%bad57ef7830Instagram
PerplexityBot
AI search
200 118839839100%byte-identical0Instagram
Applebot
AI search
200 13293291%bad57ef7830Instagram

#5 · chatgpt.com

ai_chat · Next.js · https://chatgpt.com/
10 UA(s) blocked AI-blocked: gptbot, oai_searchbot, chatgpt_user, claudebot, perplexitybot, applebot
Chrome baseline: title= · text=41 chars · bytes=4723 · jsonld=0 · sha=9fea79acb501af
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
403 472341100%9fea79acb50http_403
Chrome Mobile
Human
403 467341100%byte-identical0http_403
Googlebot
Search
403 466341100%byte-identical0http_403
Bingbot
Search
403 458341100%byte-identical0http_403
GPTBot
AI training
403 450241100%byte-identical0http_403
OAI-SearchBot
AI search
403 447341100%byte-identical0http_403
ChatGPT-User
AI live-fetch
403 448241100%byte-identical0http_403
ClaudeBot
AI training
403 444541100%byte-identical0http_403
PerplexityBot
AI search
403 448741100%byte-identical0http_403
Applebot
AI search
403 456741100%byte-identical0http_403

#6 · reddit.com

social · Next.js SSR · https://www.reddit.com/
10 UA(s) blocked AI-blocked: gptbot, oai_searchbot, chatgpt_user, claudebot, perplexitybot, applebot
Chrome baseline: title= · text=221 chars · bytes=190240 · jsonld=0 · sha=199c03b0e41f0f
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
403 190240221100%199c03b0e40http_403
Chrome Mobile
Human
403 190240221100%byte-identical0http_403
Googlebot
Search
403 1486764345%711843b2130Blockedwhoa there, pardner!http_403
Bingbot
Search
403 1522801362%2b65e06aba0Blockedwhoa there, pardner!http_403
GPTBot
AI training
403 1522801362%06b0ae3b7d0Blockedwhoa there, pardner!http_403
OAI-SearchBot
AI search
403 190240221100%byte-identical0http_403
ChatGPT-User
AI live-fetch
403 1522801362%3fe1ed4de20Blockedwhoa there, pardner!http_403
ClaudeBot
AI training
429 00%rate_limited
PerplexityBot
AI search
403 190240221100%byte-identical0http_403
Applebot
AI search
403 190240221100%byte-identical0http_403

#7 · x.com

social · SPA, login wall · https://x.com/
1 UA(s) blocked 3 divergent UA(s)
Chrome baseline: title= · text=493 chars · bytes=61699 · jsonld=0 · sha=a9213f547242ae
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 61699493100%a9213f54720JavaScript is not available.
Chrome Mobile
Human
200 62601493100%byte-identical0JavaScript is not available.
Googlebot
Search
404 149826052%452e9cf02b0X / ?Nothing to see here
Bingbot
Search
403 1740763154%820df8795d0Attention Required! | CloudflareSorry, you have been blockedhttp_403
GPTBot
AI training
402 555511%35e642f0e10
OAI-SearchBot
AI search
402 555511%35e642f0e10
ChatGPT-User
AI live-fetch
402 555511%35e642f0e10
ClaudeBot
AI training
200 27979496100%742d653d2a1X. It’s what’s happening / X<div dir="ltr" class="css-146c3p1 r-qvutc0 r-37j5jr r-q4m81j r-a023e6 r-rjixqe r-b88u0q r-1awozwy
PerplexityBot
AI search
402 555511%35e642f0e10
Applebot
AI search
404 149826052%452e9cf02b0X / ?Nothing to see here

#8 · whatsapp.com

messaging · Marketing site, SSR · https://www.whatsapp.com/
UA-neutral
Chrome baseline: title=WhatsApp | Secure and Reliable Free Private Messaging and Calling · text=4515 chars · bytes=46518 · jsonld=0 · sha=5bedce08758c03
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 465184515100%5bedce08750WhatsApp | Secure and Reliable Free Private Messaging and CallingMessage privately
Chrome Mobile
Human
400 26081302%b50eff329b0ErrorSorry, something went wrong.
Googlebot
Search
200 44789435596%a68c190d5c0WhatsApp | Secure and Reliable Free Private Messaging and CallingMessage privately
Bingbot
Search
200 44723437196%bdd3d90dbb0WhatsApp | Secure and Reliable Free Private Messaging and CallingMessage privately
GPTBot
AI training
200 45805437196%bdd3d90dbb0WhatsApp | Secure and Reliable Free Private Messaging and CallingMessage privately
OAI-SearchBot
AI search
200 45828437196%bdd3d90dbb0WhatsApp | Secure and Reliable Free Private Messaging and CallingMessage privately
ChatGPT-User
AI live-fetch
200 45854437196%bdd3d90dbb0WhatsApp | Secure and Reliable Free Private Messaging and CallingMessage privately
ClaudeBot
AI training
200 44716437196%bdd3d90dbb0WhatsApp | Secure and Reliable Free Private Messaging and CallingMessage privately
PerplexityBot
AI search
200 45818437196%bdd3d90dbb0WhatsApp | Secure and Reliable Free Private Messaging and CallingMessage privately
Applebot
AI search
200 460494543100%a2af04161b0WhatsApp | Secure and Reliable Free Private Messaging and CallingMessage privately

#9 · tiktok.com

video · SPA · https://www.tiktok.com/
2 UA(s) blocked
Chrome baseline: title=TikTok - Make Your Day · text=22 chars · bytes=76535 · jsonld=0 · sha=9a1dadd3120566
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 7653522100%9a1dadd3120TikTok - Make Your Day
Chrome Mobile
Human
200 4097522100%byte-identical0TikTok - Make Your Day
Googlebot
Search
403 9940%78342a09050http_403
Bingbot
Search
403 9940%78342a09050http_403
GPTBot
AI training
200 7603722100%byte-identical0TikTok - Make Your Day
OAI-SearchBot
AI search
200 7622022100%byte-identical0TikTok - Make Your Day
ChatGPT-User
AI live-fetch
200 7595922100%byte-identical0TikTok - Make Your Day
ClaudeBot
AI training
200 7622822100%byte-identical0TikTok - Make Your Day
PerplexityBot
AI search
200 6900522100%byte-identical0TikTok - Make Your Day
Applebot
AI search
200 6905022100%byte-identical0TikTok - Make Your Day

#10 · bing.com

search · SSR · https://www.bing.com/
UA-neutral
Chrome baseline: title=Search - Microsoft Bing · text=307 chars · bytes=49536 · jsonld=0 · sha=7826d291ca4a2c
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 49536307100%7826d291ca0Search - Microsoft Bing
Chrome Mobile
Human
200 53963289%e63e22fb590Search - Microsoft Bing
Googlebot
Search
200 54487268%53453841350Haku – Microsoft Bing
Bingbot
Search
200 237904715%e154afbd8b0Haku – Microsoft Bing
GPTBot
AI training
200 235314715%e154afbd8b0Haku – Microsoft Bing
OAI-SearchBot
AI search
200 235814715%e154afbd8b0Haku – Microsoft Bing
ChatGPT-User
AI live-fetch
200 236464715%e154afbd8b0Haku – Microsoft Bing
ClaudeBot
AI training
200 234934715%e154afbd8b0Haku – Microsoft Bing
PerplexityBot
AI search
200 235954715%e154afbd8b0Haku – Microsoft Bing
Applebot
AI search
200 266874715%e154afbd8b0Haku – Microsoft Bing

#11 · wikipedia.org

reference · MediaWiki SSR · https://en.wikipedia.org/wiki/Main_Page
4 UA(s) blocked AI-blocked: claudebot, applebot
Chrome baseline: title=Wikipedia, the free encyclopedia · text=13309 chars · bytes=49223 · jsonld=1 · sha=afe74582bd4a6e
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 4922313309100%afe74582bd1Wikipedia, the free encyclopediaMain Page
Chrome Mobile
Human
200 4933213411100%40e7e116931Wikipedia, the free encyclopediaWelcome to Wikipedia
Googlebot
Search
403 1051050%4ccd03b5e70http_403
Bingbot
Search
403 1051050%99856652f50http_403
GPTBot
AI training
200 4922313309100%byte-identical1Wikipedia, the free encyclopediaMain Page
OAI-SearchBot
AI search
200 4922313309100%byte-identical1Wikipedia, the free encyclopediaMain Page
ChatGPT-User
AI live-fetch
200 4922313309100%byte-identical1Wikipedia, the free encyclopediaMain Page
ClaudeBot
AI training
403 82820%41609a6ede0http_403
PerplexityBot
AI search
200 4922313309100%byte-identical1Wikipedia, the free encyclopediaMain Page
Applebot
AI search
403 1051050%a3984fbc9a0http_403

#12 · yahoo.co.jp

portal · SSR · https://www.yahoo.co.jp/
7 UA(s) blocked 2 divergent UA(s) AI-blocked: gptbot, oai_searchbot, chatgpt_user, claudebot, perplexitybot
Chrome baseline: title=【お知らせ】欧州経済領域(EEA)およびイギリスからご利用のお客様へ - Yahoo! JAPAN · text=1876 chars · bytes=10051 · jsonld=0 · sha=893c480de5c714
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
403 100511876100%893c480de50【お知らせ】欧州経済領域(EEA)およびイギリスからご利用のお客様へ - Yahoo! JAPANhttp_403
Chrome Mobile
Human
403 100511876100%byte-identical0【お知らせ】欧州経済領域(EEA)およびイギリスからご利用のお客様へ - Yahoo! JAPANhttp_403
Googlebot
Search
200 2314764434%3eae0b8bec0Yahoo! JAPANYahoo! JAPAN
Bingbot
Search
200 33152184698%12cbceb6450Yahoo! JAPANYahoo! JAPAN
GPTBot
AI training
403 100511876100%byte-identical0【お知らせ】欧州経済領域(EEA)およびイギリスからご利用のお客様へ - Yahoo! JAPANhttp_403
OAI-SearchBot
AI search
403 100511876100%byte-identical0【お知らせ】欧州経済領域(EEA)およびイギリスからご利用のお客様へ - Yahoo! JAPANhttp_403
ChatGPT-User
AI live-fetch
403 100511876100%byte-identical0【お知らせ】欧州経済領域(EEA)およびイギリスからご利用のお客様へ - Yahoo! JAPANhttp_403
ClaudeBot
AI training
403 100511876100%byte-identical0【お知らせ】欧州経済領域(EEA)およびイギリスからご利用のお客様へ - Yahoo! JAPANhttp_403
PerplexityBot
AI search
403 100511876100%byte-identical0【お知らせ】欧州経済領域(EEA)およびイギリスからご利用のお客様へ - Yahoo! JAPANhttp_403
Applebot
AI search
200 857287946%c793ac23de0Yahoo! JAPANYahoo! JAPAN

#13 · yahoo.com

portal · SSR · https://www.yahoo.com/
5 UA(s) blocked 3 divergent UA(s) AI-blocked: gptbot, oai_searchbot, chatgpt_user, claudebot, perplexitybot
Chrome baseline: title=Tietosuojavalintasi · text=2521 chars · bytes=14916 · jsonld=0 · sha=9dac5ffe5a2254
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 149162521100%9dac5ffe5a0Tietosuojavalintasiguce
Chrome Mobile
Human
200 149182521100%byte-identical0Tietosuojavalintasiguce
Googlebot
Search
200 28983511163442%40f5a850ca0Yahoo | Mail, Weather, Search, Politics, News, Finance, Sports & Videos
Bingbot
Search
200 32403211133441%0a1f0749831Yahoo | Mail, Weather, Search, Politics, News, Finance, Sports & Videos
GPTBot
AI training
429 23230%0d24c98db90rate_limited
OAI-SearchBot
AI search
429 23230%0d24c98db90rate_limited
ChatGPT-User
AI live-fetch
429 23230%0d24c98db90rate_limited
ClaudeBot
AI training
429 23230%0d24c98db90rate_limited
PerplexityBot
AI search
429 23230%0d24c98db90rate_limited
Applebot
AI search
200 32115410996436%ed701d4d1b1Yahoo | Mail, Weather, Search, Politics, News, Finance, Sports & Videos

#14 · yandex.ru

search · SSR · https://yandex.ru/
2 UA(s) blocked AI-blocked: claudebot, perplexitybot
Chrome baseline: title= · text=0 chars · bytes=2002 · jsonld=0 · sha=e3b0c44298fc1c
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 20020e3b0c442980
Chrome Mobile
Human
200 20080byte-identical0
Googlebot
Search
200 269489140561bd46b1e750Дзен — главная новостная информационная платформа, которая помогает миллионам людей узнавать, что происходит в мире.
Bingbot
Search
200 36851015257e67a3a9e030Дзен — главная новостная информационная платформа, которая помогает миллионам людей узнавать, что происходит в мире.
GPTBot
AI training
200 18640byte-identical0
OAI-SearchBot
AI search
200 18530byte-identical0
ChatGPT-User
AI live-fetch
200 18570byte-identical0
ClaudeBot
AI training
200 139984668cac1ecdd40Вы не робот?Подтвердите, что запросы отправляли вы, а не роботcaptcha_page
PerplexityBot
AI search
200 140024667dfbcf0f550Вы не робот?Подтвердите, что запросы отправляли вы, а не роботcaptcha_page
Applebot
AI search
200 34137310fb679f7fd50Яндекс — быстрый поиск в интернете

#15 · gemini.google.com

ai_chat · SPA, auth · https://gemini.google.com/
UA-neutral
Chrome baseline: title=‎Google Gemini · text=22 chars · bytes=113291 · jsonld=1 · sha=8f3a3901758f9e
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 11329122100%8f3a3901751‎Google Gemini
Chrome Mobile
Human
200 11305222100%byte-identical1‎Google Gemini
Googlebot
Search
200 11297922100%byte-identical1‎Google Gemini
Bingbot
Search
200 11060622100%byte-identical1‎Google Gemini
GPTBot
AI training
200 11675222100%byte-identical1‎Google Gemini
OAI-SearchBot
AI search
200 11672722100%byte-identical1‎Google Gemini
ChatGPT-User
AI live-fetch
200 11679722100%byte-identical1‎Google Gemini
ClaudeBot
AI training
200 11684522100%byte-identical1‎Google Gemini
PerplexityBot
AI search
200 11682322100%byte-identical1‎Google Gemini
Applebot
AI search
200 11330422100%byte-identical1‎Google Gemini

#16 · amazon.com

ecommerce · SSR · https://www.amazon.com/
UA-neutral
Chrome baseline: title= · text=157 chars · bytes=2007 · jsonld=0 · sha=9711627e613137
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
202 2007157100%9711627e610JavaScript is disabled
Chrome Mobile
Human
202 2007157100%byte-identical0JavaScript is disabled
Googlebot
Search
202 2007157100%byte-identical0JavaScript is disabled
Bingbot
Search
202 2007157100%byte-identical0JavaScript is disabled
GPTBot
AI training
202 2007157100%byte-identical0JavaScript is disabled
OAI-SearchBot
AI search
202 2007157100%byte-identical0JavaScript is disabled
ChatGPT-User
AI live-fetch
202 2007157100%byte-identical0JavaScript is disabled
ClaudeBot
AI training
202 2007157100%byte-identical0JavaScript is disabled
PerplexityBot
AI search
202 2007157100%byte-identical0JavaScript is disabled
Applebot
AI search
202 2007157100%byte-identical0JavaScript is disabled

#17 · linkedin.com

social · SSR + SPA · https://www.linkedin.com/
1 UA(s) blocked AI-blocked: gptbot
Chrome baseline: title=LinkedIn: Log In or Sign Up · text=5720 chars · bytes=17260 · jsonld=0 · sha=0a760b42bd52c7
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 172605720100%0a760b42bd0LinkedIn: Log In or Sign UpWelcome to your professional community
Chrome Mobile
Human
200 16450555297%e06c0b90cf0LinkedIn: Log In or Sign UpWelcome to your professional community
Googlebot
Search
200 15952555297%c2b4f6d2a00LinkedIn: Log In or Sign UpWelcome to your professional community
Bingbot
Search
200 16070560598%4007e2c3860LinkedIn: Log In or Sign UpWelcome to your professional community
GPTBot
AI training
999 21092574%d8667fb2460999: request failedRequest deniedhttp_999
OAI-SearchBot
AI search
200 16406560598%4007e2c3860LinkedIn: Log In or Sign UpWelcome to your professional community
ChatGPT-User
AI live-fetch
200 16405560598%4007e2c3860LinkedIn: Log In or Sign UpWelcome to your professional community
ClaudeBot
AI training
200 16405560598%4007e2c3860LinkedIn: Log In or Sign UpWelcome to your professional community
PerplexityBot
AI search
200 16407560598%4007e2c3860LinkedIn: Log In or Sign UpWelcome to your professional community
Applebot
AI search
200 16404560598%4007e2c3860LinkedIn: Log In or Sign UpWelcome to your professional community

#18 · baidu.com

search · SSR · https://www.baidu.com/
2 divergent UA(s)
Chrome baseline: title=百度一下,你就知道 · text=357 chars · bytes=6456 · jsonld=0 · sha=1f9c648d351c0d
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 6456357100%1f9c648d350百度一下,你就知道
Chrome Mobile
Human
200 2826015142%3d50f3e2e40百度一下
Googlebot
Search
200 2820615142%3d50f3e2e40百度一下
Bingbot
Search
200 15307924807269488%ce715bdf0f0百度一下,你就知道
GPTBot
AI training
200 15322324807269488%ce715bdf0f0百度一下,你就知道
OAI-SearchBot
AI search
200 15318624807269488%ce715bdf0f0百度一下,你就知道
ChatGPT-User
AI live-fetch
200 15303924807269488%ce715bdf0f0百度一下,你就知道
ClaudeBot
AI training
200 15321124807269488%ce715bdf0f0百度一下,你就知道
PerplexityBot
AI search
200 15293224807269488%1bde2e291f0百度一下,你就知道
Applebot
AI search
200 6456357100%byte-identical0百度一下,你就知道

#19 · bet.br

betting · SPA · https://bet.br/
10 UA(s) blocked AI-blocked: gptbot, oai_searchbot, chatgpt_user, claudebot, perplexitybot, applebot
Chrome baseline: title= · text=0 chars · bytes=0 · jsonld= · sha=
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
0 0empty_body
Chrome Mobile
Human
0 0empty_body
Googlebot
Search
0 0empty_body
Bingbot
Search
0 0empty_body
GPTBot
AI training
0 0empty_body
OAI-SearchBot
AI search
0 0empty_body
ChatGPT-User
AI live-fetch
0 0empty_body
ClaudeBot
AI training
0 0empty_body
PerplexityBot
AI search
0 0empty_body
Applebot
AI search
0 0empty_body

#21 · cloud.microsoft

saas · SSR · https://cloud.microsoft/
3 UA(s) blocked AI-blocked: applebot
Chrome baseline: title=Microsoft 365 Copilot - Sign in · text=18312 chars · bytes=47295 · jsonld=1 · sha=2d99e9a72e62e5
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 4729518312100%2d99e9a72e1Microsoft 365 Copilot - Sign inMicrosoft 365 Copilot
Chrome Mobile
Human
200 296291579586%e7b7ffa3251Microsoft 365 Copilot - Sign inMicrosoft 365 Copilot
Googlebot
Search
403 6932401%32de7abf780Error 403Error 403http_403
Bingbot
Search
403 6932401%d62b1153b00Error 403Error 403http_403
GPTBot
AI training
200 384411742695%d21a888ff81Microsoft 365 Copilot - Sign inMeet the Microsoft 365 Copilot app
OAI-SearchBot
AI search
200 384321740995%a3a7929d9b1Microsoft 365 Copilot - Sign inMeet the Microsoft 365 Copilot app
ChatGPT-User
AI live-fetch
200 384861741795%642eb9fd2a1Microsoft 365 Copilot - Sign inMeet the Microsoft 365 Copilot app
ClaudeBot
AI training
200 384811748695%138def6b2d1Microsoft 365 Copilot - Sign inMeet the Microsoft 365 Copilot app
PerplexityBot
AI search
200 381261597487%9aff26b5f11Microsoft 365 Copilot - Sign inMeet the Microsoft 365 Copilot app
Applebot
AI search
403 6932401%3188a68ddd0Error 403Error 403http_403

#22 · netflix.com

video · SSR + SPA · https://www.netflix.com/
UA-neutral
Chrome baseline: title=Netflix Finland – Watch series online, watch films online · text=3315 chars · bytes=79796 · jsonld=0 · sha=deaacab196d926
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 797963315100%deaacab1960Netflix Finland – Watch series online, watch films onlineUnlimited films, series and more
Chrome Mobile
Human
200 803923321100%1c6b9525840Netflix Finland – Watch series online, watch films onlineUnlimited films, series and more
Googlebot
Search
200 803343315100%byte-identical0Netflix Finland – Watch series online, watch films onlineUnlimited films, series and more
Bingbot
Search
200 5595803315100%byte-identical0Netflix Finland – Watch series online, watch films onlineUnlimited films, series and more
GPTBot
AI training
200 5595583315100%byte-identical0Netflix Finland – Watch series online, watch films onlineUnlimited films, series and more
OAI-SearchBot
AI search
200 5595803315100%byte-identical0Netflix Finland – Watch series online, watch films onlineUnlimited films, series and more
ChatGPT-User
AI live-fetch
200 5595603321100%1c6b9525840Netflix Finland – Watch series online, watch films onlineUnlimited films, series and more
ClaudeBot
AI training
200 5596213315100%byte-identical0Netflix Finland – Watch series online, watch films onlineUnlimited films, series and more
PerplexityBot
AI search
200 5595563315100%byte-identical0Netflix Finland – Watch series online, watch films onlineUnlimited films, series and more
Applebot
AI search
200 5596213315100%byte-identical0Netflix Finland – Watch series online, watch films onlineUnlimited films, series and more

#23 · pinterest.com

social · Next.js SSR · https://www.pinterest.com/
4 UA(s) blocked 6 divergent UA(s) AI-blocked: gptbot, oai_searchbot, chatgpt_user
Chrome baseline: title=Forbidden · text=19 chars · bytes=81 · jsonld=0 · sha=10703995ab9c4a
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
403 8119100%10703995ab0ForbiddenForbiddenhttp_403
Chrome Mobile
Human
200 9807875394%c8d2bd0b931Pinterest
Googlebot
Search
200 9876275394%c8d2bd0b931Pinterest
Bingbot
Search
200 9813075394%c8d2bd0b931Pinterest
GPTBot
AI training
403 298721110%e8f5d85dae0Pinterest - Forbiddenhttp_403
OAI-SearchBot
AI search
403 298721110%e8f5d85dae0Pinterest - Forbiddenhttp_403
ChatGPT-User
AI live-fetch
403 298721110%e8f5d85dae0Pinterest - Forbiddenhttp_403
ClaudeBot
AI training
200 10027275394%c8d2bd0b931Pinterest
PerplexityBot
AI search
200 10035875394%c8d2bd0b931Pinterest
Applebot
AI search
200 10091675394%c8d2bd0b931Pinterest

#24 · live.com

saas · Redirect to login · https://www.live.com/
1 UA(s) blocked AI-blocked: applebot
Chrome baseline: title=Outlook · text=35 chars · bytes=4811 · jsonld=0 · sha=5d13b33f004e6b
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 481135100%5d13b33f000Outlook
Chrome Mobile
Human
200 429441117%9c7b088f1a0Outlook
Googlebot
Search
200 429241117%9c7b088f1a0Outlook
Bingbot
Search
200 481235100%byte-identical0Outlook
GPTBot
AI training
200 481135100%byte-identical0Outlook
OAI-SearchBot
AI search
200 481335100%byte-identical0Outlook
ChatGPT-User
AI live-fetch
200 481435100%byte-identical0Outlook
ClaudeBot
AI training
200 481035100%byte-identical0Outlook
PerplexityBot
AI search
200 481235100%byte-identical0Outlook
Applebot
AI search
403 00%http_403

#25 · ebay.com

ecommerce · SSR (substituted at rank 25) · https://www.ebay.com/
8 UA(s) blocked AI-blocked: gptbot, oai_searchbot, chatgpt_user, claudebot, perplexitybot, applebot
Chrome baseline: title=Electronics, Cars, Fashion, Collectibles & More | eBay · text=7166 chars · bytes=67058 · jsonld=0 · sha=3f4c2770bd4d39
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 670587166100%3f4c2770bd0Electronics, Cars, Fashion, Collectibles & More | eBayElectronics, Cars, Fashion, Collectibles & More | eBay
Chrome Mobile
Human
200 56805339847%cc608fe3d30Electronics, Cars, Fashion, Collectibles & More | eBayElectronics, Cars, Fashion, Collectibles & More | eBay
Googlebot
Search
0 00%empty_body
Bingbot
Search
0 00%empty_body
GPTBot
AI training
0 00%empty_body
OAI-SearchBot
AI search
0 00%empty_body
ChatGPT-User
AI live-fetch
0 00%empty_body
ClaudeBot
AI training
0 00%empty_body
PerplexityBot
AI search
0 00%empty_body
Applebot
AI search
0 00%empty_body

#26 · bilibili.com

video · SSR + SPA · https://www.bilibili.com/
UA-neutral
Chrome baseline: title=哔哩哔哩 (゜-゜)つロ 干杯~-bilibili · text=815 chars · bytes=26658 · jsonld=0 · sha=e70e987eee3330
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 26658815100%e70e987eee0哔哩哔哩 (゜-゜)つロ 干杯~-bilibili
Chrome Mobile
Human
200 3458873790%8f7391d14f1哔哩哔哩 (゜-゜)つロ 干杯~-bilibili
Googlebot
Search
200 3478670886%e864a7d7971哔哩哔哩 (゜-゜)つロ 干杯~-bilibili
Bingbot
Search
200 26511820100%8469036dcb0哔哩哔哩 (゜-゜)つロ 干杯~-bilibili
GPTBot
AI training
200 2318404%a67e79824d0哔哩哔哩 (゜-゜)つロ 干杯~-bilibili
OAI-SearchBot
AI search
200 2318404%a67e79824d0哔哩哔哩 (゜-゜)つロ 干杯~-bilibili
ChatGPT-User
AI live-fetch
200 2318404%a67e79824d0哔哩哔哩 (゜-゜)つロ 干杯~-bilibili
ClaudeBot
AI training
200 2318404%a67e79824d0哔哩哔哩 (゜-゜)つロ 干杯~-bilibili
PerplexityBot
AI search
200 2318404%a67e79824d0哔哩哔哩 (゜-゜)つロ 干杯~-bilibili
Applebot
AI search
200 27374815100%byte-identical0哔哩哔哩 (゜-゜)つロ 干杯~-bilibili

#27 · temu.com

ecommerce · SSR · https://www.temu.com/
UA-neutral
Chrome baseline: title= · text=0 chars · bytes=1348 · jsonld=0 · sha=e3b0c44298fc1c
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 13480e3b0c442980
Chrome Mobile
Human
200 13490byte-identical0
Googlebot
Search
200 13550byte-identical0
Bingbot
Search
200 13540byte-identical0
GPTBot
AI training
200 13600byte-identical0
OAI-SearchBot
AI search
200 13540byte-identical0
ChatGPT-User
AI live-fetch
200 13530byte-identical0
ClaudeBot
AI training
200 13540byte-identical0
PerplexityBot
AI search
200 13540byte-identical0
Applebot
AI search
200 13510byte-identical0

#28 · twitch.tv

video · SPA (substituted at rank 28) · https://www.twitch.tv/
UA-neutral
Chrome baseline: title=Twitch · text=6 chars · bytes=67847 · jsonld=1 · sha=a731a58c4cf35c
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 678476100%a731a58c4c1Twitch
Chrome Mobile
Human
200 696236100%byte-identical1Twitch
Googlebot
Search
200 696236100%byte-identical1Twitch
Bingbot
Search
200 678476100%byte-identical1Twitch
GPTBot
AI training
200 678476100%byte-identical1Twitch
OAI-SearchBot
AI search
200 678476100%byte-identical1Twitch
ChatGPT-User
AI live-fetch
200 678476100%byte-identical1Twitch
ClaudeBot
AI training
200 678476100%byte-identical1Twitch
PerplexityBot
AI search
200 678476100%byte-identical1Twitch
Applebot
AI search
200 678476100%byte-identical1Twitch

#29 · dzen.ru

portal · SSR · https://dzen.ru/
UA-neutral
Chrome baseline: title= · text=0 chars · bytes=1988 · jsonld=0 · sha=e3b0c44298fc1c
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 19880e3b0c442980
Chrome Mobile
Human
200 19910byte-identical0
Googlebot
Search
200 3273671305263e52657400Дзен — главная новостная информационная платформа, которая помогает миллионам людей узнавать, что происходит в мире.
Bingbot
Search
200 37184615386a1574b77170Дзен — главная новостная информационная платформа, которая помогает миллионам людей узнавать, что происходит в мире.
GPTBot
AI training
200 18480byte-identical0
OAI-SearchBot
AI search
200 18580byte-identical0
ChatGPT-User
AI live-fetch
200 18480byte-identical0
ClaudeBot
AI training
200 18530byte-identical0
PerplexityBot
AI search
200 18570byte-identical0
Applebot
AI search
200 321017406016dcccb0b30Дзен — главная новостная информационная платформа, которая помогает миллионам людей узнавать, что происходит в мире.

#30 · microsoft.com

UA-neutral
Chrome baseline: title=Microsoft – tekoäly, pilvi, tuottavuus, tietojenkäsittely, pelaaminen ja sovellukset · text=4150 chars · bytes=23559 · jsonld=0 · sha=7835da38688435
User-AgentHTTPBytesText charsvs ChromeSHA-256JSON-LD<title><h1>block reason
Chrome Desktop
Human
200 235594150100%7835da38680Microsoft – tekoäly, pilvi, tuottavuus, tietojenkäsittely, pelaaminen ja sovelluksetPelaa Forza Horizon 6 Xbox Series X|S:llä
Chrome Mobile
Human
200 235584150100%byte-identical0Microsoft – tekoäly, pilvi, tuottavuus, tietojenkäsittely, pelaaminen ja sovelluksetPelaa Forza Horizon 6 Xbox Series X|S:llä
Googlebot
Search
200 235584150100%byte-identical0Microsoft – tekoäly, pilvi, tuottavuus, tietojenkäsittely, pelaaminen ja sovelluksetPelaa Forza Horizon 6 Xbox Series X|S:llä
Bingbot
Search
200 235574150100%byte-identical0Microsoft – tekoäly, pilvi, tuottavuus, tietojenkäsittely, pelaaminen ja sovelluksetPelaa Forza Horizon 6 Xbox Series X|S:llä
GPTBot
AI training
200 235574150100%byte-identical0Microsoft – tekoäly, pilvi, tuottavuus, tietojenkäsittely, pelaaminen ja sovelluksetPelaa Forza Horizon 6 Xbox Series X|S:llä
OAI-SearchBot
AI search
200 235594150100%byte-identical0Microsoft – tekoäly, pilvi, tuottavuus, tietojenkäsittely, pelaaminen ja sovelluksetPelaa Forza Horizon 6 Xbox Series X|S:llä
ChatGPT-User
AI live-fetch
200 235584150100%byte-identical0Microsoft – tekoäly, pilvi, tuottavuus, tietojenkäsittely, pelaaminen ja sovelluksetPelaa Forza Horizon 6 Xbox Series X|S:llä
ClaudeBot
AI training
200 235564150100%byte-identical0Microsoft – tekoäly, pilvi, tuottavuus, tietojenkäsittely, pelaaminen ja sovelluksetPelaa Forza Horizon 6 Xbox Series X|S:llä
PerplexityBot
AI search
200 235584150100%byte-identical0Microsoft – tekoäly, pilvi, tuottavuus, tietojenkäsittely, pelaaminen ja sovelluksetPelaa Forza Horizon 6 Xbox Series X|S:llä
Applebot
AI search
200 235584150100%byte-identical0Microsoft – tekoäly, pilvi, tuottavuus, tietojenkäsittely, pelaaminen ja sovelluksetPelaa Forza Horizon 6 Xbox Series X|S:llä

7 · How to read the SHA column

Each cell shows the first 10 characters of the SHA-256 of the normalized visible text (HTML stripped, whitespace collapsed). If two UAs share a hash, they got byte-identical content. If they differ, even by a single user-specific token, the hashes will diverge — useful for spotting personalization, A/B-test buckets, or country-specific bodies served to one UA family but not another.

8 · About PrerenderProxy

PrerenderProxy is the open custom-built pre-rendering layer behind a number of Finnish high-traffic sites (Elisa, Yritystele, …). It sits at the Fastly edge, detects crawler user-agents, and serves Puppeteer-rendered HTML snapshots to bots while regular users get the live SPA. The conceptual model is identical to what the Top 30 above are doing internally with Next.js / Nuxt / Angular Universal SSR — the difference is that PrerenderProxy retrofits the same property onto sites that can't easily migrate to a framework-native SSR path. See / for the dashboard (basic-auth required).