Free LLMS.txt Checker for AI Crawlers

Q: What does the AI Crawler & llms.txt Checker test?

It checks whether the homepage, robots.txt, llms.txt, optional llms-full.txt, AI crawler user agents, and important links inside llms.txt return healthy public responses.

Q: Does llms-full.txt have to exist?

No. If llms-full.txt is referenced from robots.txt or llms.txt, it should return 200 as text/plain. If the site does not publish a full file, references to it should be removed.

BLUF: Check whether ChatGPT, Claude, Perplexity, and other AI crawlers can access your homepage, robots.txt, llms.txt, optional llms-full.txt, and the important URLs listed inside the file.

Check if AI crawlers can access your site. Paste a domain below to test live crawler access and quickly spot whether the issue is in robots.txt, llms.txt, llms-full.txt, linked URLs, or edge rules such as Cloudflare.

AI crawler diagnostics

AI Crawler & llms.txt Checker

Paste a domain to check whether llms.txt, llms-full.txt, robots.txt, linked resources, and major AI crawler user agents are accessible.

Looking for more utilities like this? Visit the WebDesy SEO Tools page.

What the Checker Reviews

Whether /llms.txt returns 200 and is served as plain text.
Whether /robots.txt includes the right Llms-txt: discovery line.
Whether tested AI user agents, including GPTBot, OAI-SearchBot, ChatGPT-User, ClaudeBot, Claude-User, and PerplexityBot, can reach the homepage and /llms.txt.
Whether /llms-full.txt is healthy when it exists or is referenced.
Whether the public links inside llms.txt return successful responses.

Example Results

Healthy setup

/robots.txt returns 200 and points to /llms.txt.
/llms.txt returns 200 as plain text.
Major AI crawler user agents can reach the homepage and /llms.txt.
Important URLs listed inside llms.txt return successful responses.

Needs attention

A normal browser gets 200, but AI crawler user agents get 403.
Cloudflare or another edge layer blocks crawler requests.
/llms-full.txt is referenced but returns 404.
URLs listed in llms.txt redirect, 404, or time out.

What to Do If It Fails

If AI crawler user agents get 403, check Cloudflare AI Crawl Control, WAF rules, bot rules, and user-agent blocks.
If robots.txt is missing the discovery line, add the correct Llms-txt: URL and keep private paths blocked.
If /llms-full.txt returns 404, publish it as plain text or remove every reference to it.
If linked URLs fail, fix the listed URLs, replace them in llms.txt, or redirect them to relevant live pages.

Need help fixing a failed check? Contact WebDesy and send the copied report. I can review the website files, robots policy, Cloudflare AI Crawl Control, WAF rules, and llms.txt setup.

Sample Report

A healthy result should show PASS for homepage access, llms.txt access, plain-text content type, the Llms-txt: discovery line, AI crawler user-agent access, and linked URL checks.

AI crawler and llms.txt check
Result: PASS (100%)
Main checks:
- PASS: llms.txt access
- PASS: robots.txt Llms-txt line
- PASS: AI crawler user-agent access
- PASS: Linked URL checks

Related Guides

If the checker finds a problem, the fix is usually in one of three places: the website files, the robots policy, or Cloudflare bot/security rules. For the full walkthrough, see how I fixed Cloudflare AI crawler blocking and llms.txt issues for a client. If you are checking the robots layer first, start with how to use the robots.txt file with WordPress.

Alternative tool: You can also use HTTP Status to check HTTP status codes.

FAQ

What does the AI Crawler & llms.txt Checker test?

It checks whether the homepage, robots.txt, llms.txt, optional llms-full.txt, AI crawler user agents, and important links inside llms.txt return healthy public responses.

What is `llms.txt`?

llms.txt is a plain text file that helps AI systems discover important website content, preferred URLs, and crawler-facing context.

Does `llms-full.txt` have to exist?

No. If llms-full.txt is referenced from robots.txt or llms.txt, it should return 200 as text/plain. If the site does not publish a full file, references to it should be removed.

Why can `llms.txt` work in a browser but fail for AI crawlers?

A normal browser can receive a 200 response while Cloudflare, a WAF rule, or bot protection blocks AI crawler user agents such as GPTBot, ClaudeBot, OAI-SearchBot, or PerplexityBot.