Robots.txt Generator
Create a robots.txt file that gives crawlers clear crawl rules, protected paths, and sitemap discovery hints.
How to use the robots.txt generator
Use this tool when you need a clean starting point for crawler directives before publishing or updating site-wide crawl controls.
Fill in the fields that match your crawling policy:
- User-agent: define which crawler the rules apply to. Use
*for all bots. - Disallow: list private folders or pages you do not want crawled.
- Allow: add exceptions for paths that should remain crawlable.
- Crawl-delay: slow down aggressive bots if your server needs more breathing room.
- Sitemap URL: point crawlers to the XML sitemap you want them to discover.
Example A common setup blocks a private folder while keeping a public resource available.
User-agent: *
Disallow: /private
Allow: /public
After generation, review the output, test it with the Robots.txt Tester, and deploy it to the site root as <code>/robots.txt</code>.
Robots.txt Generator Use Cases
A robots.txt file is often the first control point for search crawlers. Use this generator when you need a clean starting file for a new site, a staging environment, an ecommerce faceted navigation cleanup, or a crawl-budget pass before a technical SEO audit.
Common rules to generate
- Allow all crawlers: publish a simple default policy with a sitemap location.
- Block staging paths: keep test folders, admin routes, and internal search pages out of crawl queues.
- Protect crawl budget: disallow duplicate filter URLs, sort parameters, and temporary campaign paths.
- Separate bot behavior: create user-agent specific sections for crawlers that need stricter rules.
After generating the file
- Upload it to
/robots.txtat the root of the canonical host. - Confirm the sitemap URL resolves and returns XML.
- Test important pages before blocking a whole directory.
- Review rules after migrations, CMS changes, and new subfolder launches.
Useful next checks: Robots.txt Tester, Sitemap Generator, and Page Indexability Checker.