OpenAI crawler policy

GPTBot Checker

Run a focused GPTBot access report for the exact public URL path you care about. The checker separates GPTBot from OAI-SearchBot and ChatGPT-User so training, search retrieval, and user-triggered access do not get blurred together.

Real checks

What this checker actually tests

The scan uses the same live crawler-access engine as the homepage, with the focused bot result lifted to the top of the report.

GPTBot is the training crawler

GPTBot is treated as an OpenAI training/model-improvement crawler in this report, not as the same role as search retrieval or user-triggered fetches.

Path-specific evidence

The result evaluates robots.txt against the submitted URL path and surfaces the matching rule line when a rule applies.

OpenAI roles compared

The report shows OAI-SearchBot and ChatGPT-User next to GPTBot so you can allow search retrieval while making a separate training-policy decision.

Methodology

How the bot decision is made

Each recommendation ties back to a public robots.txt rule, header, meta tag, file, or fetched page signal.

Fetch public robots.txt

The scanner fetches same-origin /robots.txt with bounded redirects and evaluates the exact submitted path.

Prioritize GPTBot

The focused report lifts GPTBot above the full bot table and includes related OpenAI agents for policy contrast.

Check page blockers

The scan also reviews HTTP status, meta robots, X-Robots-Tag, sitemap, and llms.txt because access is only useful when the page can be discovered and read.

Limitations

What this report does not claim

Bot access is a technical readiness signal, not a promise of training, ranking, citation, or AI visibility.

Boundary

This checker does not prove whether OpenAI has used, will use, or will cite the submitted page.

Boundary

robots.txt is a public policy signal; individual crawler behavior can change and should be verified against official documentation.

Boundary

The scan does not log in, bypass bot defenses, fetch private URLs, or store public scan reports by default.

Tool matrix

Move from one bot decision into broader robots, visibility, and crawler-readiness checks.

FAQ

Common questions

Short answers for site owners deciding how to handle crawler-specific robots.txt policies.

What does the GPTBot checker test?

It checks whether GPTBot is allowed or blocked by robots.txt for the exact submitted path, then adds page-level blockers such as noindex headers, sitemap availability, and llms.txt context.

Is GPTBot the same as OAI-SearchBot or ChatGPT-User?

No. The page treats GPTBot as a training crawler, OAI-SearchBot as search/retrieval, and ChatGPT-User as user-triggered context so site owners can make separate policy choices.

Can I block GPTBot but allow OpenAI search retrieval?

Yes, robots.txt can express different rules for different user agents. The focused report helps confirm whether the intended split applies to the tested path.

Do you store GPTBot scan results?

No. Scans are processed for the response and are not saved as public reports by default.