AI Training Audit
Is your website opted in to AI training?
An informational scan of your /robots.txt, /ai.txt and /llms.txt. Reports which AI crawlers you currently allow or block — no compliance verdict, just clarity.
Begin scan
Coverage
What we
read
Three small text files at the root of a domain control how AI crawlers treat your content. None are required. Most websites have none of them.
- № 01
/robots.txt
The standard. We check whether you're allowing or blocking the major AI crawlers — GPTBot, ClaudeBot, Google-Extended, PerplexityBot, CCBot, Applebot-Extended, Bytespider, Meta-ExternalAgent, and around a dozen more.
- № 02
/ai.txt
Spawning.ai's proposed opt-out file for AI training datasets. Uncommon, but if you have one we'll surface it so you know it's still being served.
- № 03
/llms.txt
An emerging standard for sites that want to be readable by LLMs — think of it as a sitemap for AI assistants. Presence is a positive signal that you've considered AI consumption of your content.
Why this matters
No "right" answer. Just clarity.
Some businesses want to be cited by ChatGPT, Claude and Perplexity — visibility in AI answers is the new SEO. Others want to keep their content out of training corpora — IP, licensing, brand control. Both are legitimate.
The audit doesn't pick a side. It tells you what your robots.txt actually says, so you can decide whether that matches your intent.
Last word
Run the scan. See your stance.
Free, instant, and without registration.
Run a free audit