@@ -5,5 +5,98 @@ eleventyComputed:
55---
66Sitemap: {{ site .url }} /sitemap.xml
77
8- User-agent: *
9- Disallow:
8+ # Block all known AI crawlers and assistants
9+ # from using content for training AI models.
10+ # Source: https://robotstxt.com/ai
11+ User-Agent: GPTBot
12+ User-Agent: ClaudeBot
13+ User-Agent: Claude-User
14+ User-Agent: Claude-SearchBot
15+ User-Agent: CCBot
16+ User-Agent: Google-Extended
17+ User-Agent: Applebot-Extended
18+ User-Agent: Facebookbot
19+ User-Agent: Meta-ExternalAgent
20+ User-Agent: Meta-ExternalFetcher
21+ User-Agent: diffbot
22+ User-Agent: PerplexityBot
23+ User-Agent: Perplexity‑User
24+ User-Agent: Omgili
25+ User-Agent: Omgilibot
26+ User-Agent: webzio-extended
27+ User-Agent: ImagesiftBot
28+ User-Agent: Bytespider
29+ User-Agent: TikTokSpider
30+ User-Agent: Amazonbot
31+ User-Agent: Youbot
32+ User-Agent: SemrushBot-OCOB
33+ User-Agent: Petalbot
34+ User-Agent: VelenPublicWebCrawler
35+ User-Agent: TurnitinBot
36+ User-Agent: Timpibot
37+ User-Agent: OAI-SearchBot
38+ User-Agent: ICC-Crawler
39+ User-Agent: AI2Bot
40+ User-Agent: AI2Bot-Dolma
41+ User-Agent: DataForSeoBot
42+ User-Agent: AwarioBot
43+ User-Agent: AwarioSmartBot
44+ User-Agent: AwarioRssBot
45+ User-Agent: Google-CloudVertexBot
46+ User-Agent: PanguBot
47+ User-Agent: Kangaroo Bot
48+ User-Agent: Sentibot
49+ User-Agent: img2dataset
50+ User-Agent: Meltwater
51+ User-Agent: Seekr
52+ User-Agent: peer39_crawler
53+ User-Agent: cohere-ai
54+ User-Agent: cohere-training-data-crawler
55+ User-Agent: DuckAssistBot
56+ User-Agent: Scrapy
57+ User-Agent: Cotoyogi
58+ User-Agent: aiHitBot
59+ User-Agent: Factset_spyderbot
60+ User-Agent: FirecrawlAgent
61+ User-Agent: bedrockbot
62+ User-Agent: DeepSeekBot
63+ User-Agent: GoogleAgent-Mariner
64+ User-Agent: Gemini-Deep-Research
65+ User-Agent: Google-NotebookLM
66+ User-Agent: Google-Agent
67+ User-Agent: GoogleAgent-URLContext
68+ User-Agent: Google-Firebase
69+ User-Agent: MistralAI-User
70+ User-Agent: SemrushBot-FT
71+ User-Agent: SemrushBot-ESI
72+ User-Agent: AddSearchBot
73+ User-Agent: bigsur.ai
74+ User-Agent: Brightbot
75+ User-Agent: Crawlspace
76+ User-Agent: EchoboxBot
77+ User-Agent: FriendlyCrawler
78+ User-Agent: LinerBot
79+ User-Agent: Panscient
80+ User-Agent: Panscient.com
81+ User-Agent: Poseidon Research Crawler
82+ User-Agent: SBIntuitionsBot
83+ User-Agent: TerraCotta
84+ User-Agent: Thinkbot
85+ User-Agent: Yak
86+ User-Agent: YandexAdditional
87+ User-Agent: YandexAdditionalBot
88+
89+ Disallow: /
90+ DisallowAITraining: /
91+
92+ # Block any non-specified AI crawlers (e.g., new
93+ # or unknown bots) from using content for training
94+ # AI models, while allowing the website to be
95+ # indexed and accessed by bots. These directives
96+ # are still experimental and may not be supported
97+ # by all AI crawlers.
98+ User-Agent: *
99+ DisallowAITraining: /
100+ Content-Usage: ai=n
101+ Content-Signal: search=yes, ai-input=no, ai-train=no
102+ Allow: /
0 commit comments