Skip to content
AI Training · Apple

How to Block Applebot-Extended

Applebot-Extended is Apple's AI training crawler — separate from the Applebot that powers Siri and Spotlight. Here's how to opt out of Apple Intelligence training without losing your search presence.

60 sec
Block time
No
Siri / Spotlight affected
Yes
Respects robots.txt

What is Applebot-Extended?

Apple launched Applebot-Extended alongside Apple Intelligence — the suite of on-device and cloud AI features introduced in iOS 18 and macOS Sequoia. While the original Applebot has been crawling the web since 2015 for Siri, Spotlight, and App Store indexing, Applebot-Extended is a dedicated crawler whose sole purpose is gathering training data for Apple's AI models.

The key difference: regular Applebot helps your content get found (Siri suggestions, Spotlight results, Safari Reader). Applebot-Extended does something different — it consumes your content as model training data, which means your writing, product descriptions, and original content could end up shaping Apple's AI outputs without any attribution or compensation.

User agent string

Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/17.0 Safari/605.1.15 (Applebot-Extended/1.0)

The token Applebot-Extended is what robots.txt and server rules match against.

Applebot vs. Applebot-Extended

These are two distinct crawlers. Blocking one does not block the other — unless you block the parent (Applebot), which causes Applebot-Extended to inherit that block.

PropertyApplebotApplebot-Extended
PurposeSiri, Spotlight, App Store, Safari ReaderApple Intelligence AI model training
Active since20152024 (Apple Intelligence launch)
User agent tokenApplebotApplebot-Extended
Respects robots.txtYesYes (inherits from Applebot rules)
Blocking affects Siri/SpotlightYesNo
Blocking affects AI trainingYes (parent)Yes (direct)

How to Block Applebot-Extended

1Block Apple Intelligence Training Only (keep Siri/Spotlight)

Add a dedicated block for Applebot-Extended while leaving Applebot allowed. Your Siri and Spotlight presence is unaffected.

# robots.txt

User-agent: Applebot-Extended

Disallow: /

Place this in robots.txt at your domain root (e.g. https://yoursite.com/robots.txt).

2Block All Apple Crawlers (training + Siri/Spotlight)

If you want to fully exclude Apple's crawlers, block both. Note: this will remove your site from Siri suggestions and Spotlight indexing.

# robots.txt — block all Apple crawlers

User-agent: Applebot

Disallow: /


User-agent: Applebot-Extended

Disallow: /

3Partial Block — Protect Specific Content

Protect original writing and premium content from AI training while allowing Apple to crawl marketing pages.

# robots.txt — protect specific paths

User-agent: Applebot-Extended

Disallow: /blog/

Disallow: /articles/

Disallow: /members/

Allow: /

4Server-Level Block (nginx / Apache)

For stronger enforcement — robots.txt relies on the crawler honoring it; a server-level block returns 403 regardless.

# nginx — block Applebot-Extended

if ($http_user_agent ~* "Applebot-Extended") {

return 403;

}

# Apache .htaccess — block Applebot-Extended

BrowserMatch "Applebot-Extended" bad_bot

Order Allow,Deny

Allow from all

Deny from env=bad_bot

The Inheritance Rule (Critical)

⚠️ Applebot-Extended inherits from Applebot

If you block Applebot from a path, Applebot-Extended is also blocked from that path — even if you have no explicit Applebot-Extended rule. This means if you already run a blanket Disallow: /for Applebot, Applebot-Extended cannot access your site at all.

The reverse is not true: blocking Applebot-Extended does not affect Applebot. The inheritance is one-way — child inherits from parent, not the other way around.

Verify Your Block is Working

After updating robots.txt, verify it's correct before Applebot-Extended's next crawl:

  1. 1

    Check your robots.txt is live

    Visit https://yoursite.com/robots.txt in a browser. You should see the Applebot-Extended Disallow rule you added.

  2. 2

    Use Google Search Console's robots.txt tester

    While GSC is Google-specific, its robots.txt parser follows the same spec. Enter your robots.txt and test the path "/" with user agent "Applebot-Extended" to confirm it shows "Blocked".

  3. 3

    Check server logs

    Search your access logs for "Applebot-Extended". After the block is live, you should see 403 responses (server-level block) or the crawler simply stops appearing (robots.txt compliance).

  4. 4

    Use Open Shadow's bot checker

    Run your domain through Open Shadow's free bot check at /check — it scans your robots.txt and reports which AI bots are allowed, disallowed, or unaddressed.

Should You Block Applebot-Extended?

There's a legitimate tradeoff. Apple Intelligence generates answers, summaries, and suggestions from web content — blocking Applebot-Extended means your brand, products, and content are less likely to influence those outputs. For some publishers that's a feature; for others it's a cost.

Block if you…

  • Create original writing, journalism, or research
  • Sell proprietary data or reports
  • Have a paywall or subscription model
  • Object to your content training AI without compensation
  • Already block other AI training crawlers for consistency

Allow if you…

  • Want your brand mentioned in Apple Intelligence answers
  • Run a marketing or product discovery site
  • Have content that benefits from AI amplification
  • Want to be cited when Apple AI answers questions in your niche
  • Believe AI discoverability is a growth channel

Frequently Asked Questions

What is the difference between Applebot and Applebot-Extended?
Applebot is Apple's general-purpose crawler used for Siri suggestions, Spotlight search, Safari Reader, and App Store search. It has been active since 2015. Applebot-Extended is a separate crawler introduced with Apple Intelligence — it specifically crawls web content to train Apple's AI models. They use different user agent strings and have different purposes. You can block Applebot-Extended without affecting Applebot, and vice versa.
Will blocking Applebot-Extended affect my Siri or Spotlight presence?
No. Blocking Applebot-Extended only prevents Apple's AI training crawler. The regular Applebot — which powers Siri web results, Spotlight suggestions, and App Store indexing — uses a separate user agent and is unaffected by Applebot-Extended rules in robots.txt. You can safely block Applebot-Extended while keeping Applebot allowed.
Does Apple respect robots.txt for Applebot-Extended?
Yes. Apple has publicly documented that Applebot-Extended respects robots.txt Disallow directives. It also respects the meta tag 'noindex' and will not crawl content disallowed for its parent bot (Applebot). A robots.txt block is the recommended and sufficient opt-out method.
If I already block Applebot, does that also block Applebot-Extended?
Yes — but only if you use 'Disallow: /' under 'User-agent: Applebot'. Applebot-Extended inherits permission rules from Applebot: if Applebot is disallowed from a path, Applebot-Extended will also be disallowed from that path. However, if you want to block Applebot-Extended while allowing regular Applebot, you must add a separate 'User-agent: Applebot-Extended' block.
Can I block Apple Intelligence training on specific pages but allow it elsewhere?
Yes. robots.txt supports path-level granularity. You can disallow Applebot-Extended from your premium content, original writing, or paywalled sections while allowing it on marketing pages or documentation. Use specific Disallow paths instead of Disallow: / for partial opt-out.
What user agent string does Applebot-Extended use?
The user agent string is: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/17.0 Safari/605.1.15 (Applebot-Extended/1.0). The key identifier for robots.txt and server-side blocking is the token 'Applebot-Extended'.

Related Guides

See which AI bots are crawling your site

Run a free bot check on any domain — Open Shadow scans your robots.txt, headers, and bot signals to show you exactly which AI crawlers have access and which are blocked.

Check My Site for Free