BLUF: Check whether ChatGPT, Claude, Perplexity, and other AI crawlers can access your homepage, robots.txt, llms.txt, optional llms-full.txt, and the important URLs listed inside the file.
Check if AI crawlers can access your site. Paste a domain below to test live crawler access and quickly spot whether the issue is in robots.txt, llms.txt, llms-full.txt, linked URLs, or edge rules such as Cloudflare.
AI crawler diagnostics
AI Crawler & llms.txt Checker
Paste a domain to check whether llms.txt, llms-full.txt, robots.txt, linked resources, and major AI crawler user agents are accessible.
Looking for more utilities like this? Visit the WebDesy SEO Tools page.
What the Checker Reviews
- Whether
/llms.txtreturns 200 and is served as plain text. - Whether
/robots.txtincludes the rightLlms-txt:discovery line. - Whether tested AI user agents, including GPTBot, OAI-SearchBot, ChatGPT-User, ClaudeBot, Claude-User, and PerplexityBot, can reach the homepage and
/llms.txt. - Whether
/llms-full.txtis healthy when it exists or is referenced. - Whether the public links inside
llms.txtreturn successful responses.
Example Results
Healthy setup
/robots.txtreturns 200 and points to/llms.txt./llms.txtreturns 200 as plain text.- Major AI crawler user agents can reach the homepage and
/llms.txt. - Important URLs listed inside
llms.txtreturn successful responses.
Needs attention
- A normal browser gets 200, but AI crawler user agents get 403.
- Cloudflare or another edge layer blocks crawler requests.
/llms-full.txtis referenced but returns 404.- URLs listed in
llms.txtredirect, 404, or time out.
What to Do If It Fails
- If AI crawler user agents get
403, check Cloudflare AI Crawl Control, WAF rules, bot rules, and user-agent blocks. - If
robots.txtis missing the discovery line, add the correctLlms-txt:URL and keep private paths blocked. - If
/llms-full.txtreturns404, publish it as plain text or remove every reference to it. - If linked URLs fail, fix the listed URLs, replace them in
llms.txt, or redirect them to relevant live pages.
Need help fixing a failed check? Contact WebDesy and send the copied report. I can review the website files, robots policy, Cloudflare AI Crawl Control, WAF rules, and llms.txt setup.
Sample Report
A healthy result should show PASS for homepage access, llms.txt access, plain-text content type, the Llms-txt: discovery line, AI crawler user-agent access, and linked URL checks.
AI crawler and llms.txt check Result: PASS (100%) Main checks: - PASS: llms.txt access - PASS: robots.txt Llms-txt line - PASS: AI crawler user-agent access - PASS: Linked URL checks
Related Guides
If the checker finds a problem, the fix is usually in one of three places: the website files, the robots policy, or Cloudflare bot/security rules. For the full walkthrough, see how I fixed Cloudflare AI crawler blocking and llms.txt issues for a client. If you are checking the robots layer first, start with how to use the robots.txt file with WordPress.
Alternative tool: You can also use HTTP Status to check HTTP status codes.
FAQ
What does the AI Crawler & llms.txt Checker test?
It checks whether the homepage, robots.txt, llms.txt, optional llms-full.txt, AI crawler user agents, and important links inside llms.txt return healthy public responses.
What is llms.txt?
llms.txt is a plain text file that helps AI systems discover important website content, preferred URLs, and crawler-facing context.
Does llms-full.txt have to exist?
No. If llms-full.txt is referenced from robots.txt or llms.txt, it should return 200 as text/plain. If the site does not publish a full file, references to it should be removed.
Why can llms.txt work in a browser but fail for AI crawlers?
A normal browser can receive a 200 response while Cloudflare, a WAF rule, or bot protection blocks AI crawler user agents such as GPTBot, ClaudeBot, OAI-SearchBot, or PerplexityBot.