№ 04 / SolutionsFree

  AI Training Audit

Is your website opted in to AI training?

An informational scan of your /robots.txt, /ai.txt and /llms.txt. Reports which AI crawlers you currently allow or block — no compliance verdict, just clarity.

Begin scan

Reads three text files at the root of your domain. Takes a few seconds. No data stored beyond the public report.

Coverage

What we
read

Three small text files at the root of a domain control how AI crawlers treat your content. None are required. Most websites have none of them.

  • № 01

    /robots.txt

    The standard. We check whether you're allowing or blocking the major AI crawlers — GPTBot, ClaudeBot, Google-Extended, PerplexityBot, CCBot, Applebot-Extended, Bytespider, Meta-ExternalAgent, and around a dozen more.

  • № 02

    /ai.txt

    Spawning.ai's proposed opt-out file for AI training datasets. Uncommon, but if you have one we'll surface it so you know it's still being served.

  • № 03

    /llms.txt

    An emerging standard for sites that want to be readable by LLMs — think of it as a sitemap for AI assistants. Presence is a positive signal that you've considered AI consumption of your content.

Why this matters

No "right" answer. Just clarity.

Some businesses want to be cited by ChatGPT, Claude and Perplexity — visibility in AI answers is the new SEO. Others want to keep their content out of training corpora — IP, licensing, brand control. Both are legitimate.

The audit doesn't pick a side. It tells you what your robots.txt actually says, so you can decide whether that matches your intent.

Last word

Run the scan. See your stance.

Free, instant, and without registration.

Run a free audit