What does your robots.txt actually allow?

Parse user-agent blocks, allow/disallow patterns, crawl-delay and sitemap directives

Fetches /robots.txt directly No account required
What gets parsed User-agent blocks Allow / Disallow Crawl-delay Sitemap refs Syntax errors

Why robots.txt matters

It’s the first file every crawler fetches. A wrong Disallow: / can wipe your entire site from Google’s index overnight — until you notice and ship a fix.

What gets parsed?

Every User-agent block with its Allow/Disallow rules, Crawl-delay directives and Sitemap references. Syntax errors are flagged separately.

Read more →

For staging environments

Make sure your staging or pre-prod robots.txt blocks crawlers — and that your production version does not accidentally inherit those rules.

Read the docs →