Crawl rules
Review user-agent groups, allow rules, disallow paths, crawl-delay values, and invalid directives in one readable summary.
A robots.txt checker fetches a site's robots.txt file and reviews crawler rules, user-agent groups, sitemap directives, and SEO issues that can accidentally block Googlebot or other crawlers.
Review user-agent groups, allow rules, disallow paths, crawl-delay values, and invalid directives in one readable summary.
Confirm that the file points search engines to XML sitemaps so canonical URLs can be discovered quickly after releases.
Flag missing files, sitewide blocks, blank user agents, oversized files, and directives crawlers may ignore.
A robots.txt checker fetches a site's robots.txt file and reviews crawler rules, user-agent groups, sitemap directives, and common SEO issues that affect search engine access.
Robots.txt can block crawling, but it is not a guaranteed noindex control. To remove a page from search results, use noindex on a crawlable page or remove the page from the site.
Most public sites should have a robots.txt file because it gives crawlers a predictable place to find rules and sitemap URLs, even when the rules are simple.
`Disallow: /` under `User-agent: *` tells most crawlers not to crawl any public path on the site. That is useful for staging sites, but dangerous on production sites.
The checker only accepts public http and https URLs. It blocks localhost, private network, and reserved address ranges before fetching robots.txt.