Applebot-Extended

Vendor: Apple
Type: AI training
JavaScript rendering: no
Honors robots.txt: yes


Vendor	Apple
Type	AI training opt-out token (not a separate crawler)
robots.txt token	`Applebot-Extended`
JavaScript rendering	n/a — control directive only
Honors robots.txt	Yes
Vendor docs	support.apple.com/HT204683

What it actually is

Applebot-Extended is not a separate bot. It is a robots.txt token that Apple introduced in 2024 to let site owners opt out of having Applebot's collected content used to train Apple Foundation Models (the on-device LLMs powering Apple Intelligence), while still allowing Applebot to crawl for Siri / Spotlight / Safari search.

The HTTP request from Applebot remains a single fetch with a single UA (Applebot/0.1). The robots.txt directive determines what Apple does with the data on its end.

How to use

To opt out of Apple Intelligence training but keep Siri/Spotlight visibility:

User-agent: Applebot
Allow: /

User-agent: Applebot-Extended
Disallow: /

To opt out of both (rare — also removes Siri visibility):

User-agent: Applebot
Disallow: /

Why it exists

When Apple introduced Apple Intelligence in 2024 it needed a way for publishers to opt out of model training without losing search visibility. Rather than a second crawler, Apple used the robots.txt mechanism to express the policy. Google later adopted the same pattern with Google-Extended.

Quirks

Not an active crawler. You will never see an Applebot-Extended/1.0

UA in your logs.

Only meaningful in combination with Applebot — blocking only

Applebot-Extended while allowing Applebot produces the intended outcome (search yes, training no).

Sources

Apple · Applebot-Extended documentation