Applebot-Extended
In one line
Applebot-Extended is the identifier Apple introduced to let site owners opt out of AI training separately from regular Applebot, which still powers Siri and Spotlight indexing.
Going deeper
Applebot-Extended is Apple's identifier for the AI-training use of web data. Regular Applebot still powers Siri and Spotlight indexing; the 'extended' version was carved out so site owners can opt out of AI training separately.
The design is essentially the same as Google-Extended. Block one without the other, and you can keep search visibility while opting out of AI training. Control it with an 'Applebot-Extended' rule in robots.txt.
The decision is straightforward: allow if you want eligibility in Apple AI features, block if you want to explicitly opt out. The exposure impact in the Korean market is currently smaller than in global markets, which is worth weighing in.
Sources
Related terms
Google-Extended
Google-Extended is the separate user agent Google uses for training Gemini and Vertex AI, letting site owners control AI training access independently from regular search indexing.
GEO·AEOGPTBot
GPTBot is OpenAI's official web crawler used for ChatGPT training and search indexing — controllable via robots.txt.
GEO·AEOClaudeBot
ClaudeBot is Anthropic's web crawler used for training Claude and grounding its answers — manageable via robots.txt.
GEO·AEOCCBot
CCBot is the crawler operated by the nonprofit Common Crawl — and the dataset it produces is the starting point for the training data of many LLMs.
GEO·AEOllms.txt
llms.txt is a proposed text file placed at the site root that tells large language models where the most important content lives — think 'sitemap, but written for LLMs'.
How does your brand show up in AI answers?
Villion measures how your brand appears across ChatGPT, Perplexity and AI Overviews, then automates the work that lifts citation rate and share of voice.
Get a free audit