Robots.txt is path-specific
A rule that looks harmless at the domain level can block a product page, blog post, or documentation path.
Robots policy
Parse robots.txt for major AI user agents and see which rules apply to the URL path you care about.
A rule that looks harmless at the domain level can block a product page, blog post, or documentation path.
Training crawlers, search/retrieval crawlers, and classic search bots should be reviewed separately.
AI crawler user-agent names change over time, so the bot registry should have a visible update date.