Question 1

What’s the difference between robots.txt and sitemap.xml?

Accepted Answer

robots.txt controls crawl rules and can point to sitemaps via Sitemap: directives. sitemap.xml lists URLs you want search engines to discover. They solve different problems, and you usually want both.

Question 2

Is it a problem if robots.txt is missing?

Accepted Answer

Not always. Many sites work fine without robots.txt. But if you do have one, a bad rule can block crawling, so checking robots.txt is still important.

Question 3

What does “global block” mean?

Accepted Answer

It usually means you have User-agent: * plus Disallow: /, which blocks crawlers from the whole site. A sitemap can still exist, but bots won’t crawl URLs if they’re disallowed.

Question 4

My robots has no “Sitemap:” lines. Is that bad?

Accepted Answer

Not necessarily. Some sites don’t declare it. This tool probes common locations like /sitemap.xml and /sitemap_index.xml. Adding Sitemap: in robots.txt is still a good practice.

Question 5

Why do you show “odd content-type” for sitemap?

Accepted Answer

Sitemaps should usually be XML (or sometimes text). If the server returns HTML, it can be a soft error page or a WAF interstitial.

Question 6

Why can sitemap be OK but indexing still slow?

Accepted Answer

A sitemap helps discovery, but indexing depends on quality, internal linking, canonical tags, server performance, duplicates, and crawl budget. This tool checks availability, not indexing outcomes.

Question 7

Why do I see 403 / 429 / status 0?

Accepted Answer

403 and 429 are commonly caused by WAF/rate limits. Status 0 often means connection/TLS/DNS problems or the server closed the request.

Question 8

Can a sitemap be blocked by robots.txt?

Accepted Answer

The sitemap file itself can be blocked by server rules and URLs inside the sitemap can be disallowed. If crawlers are globally blocked, the sitemap won’t help much for crawling.

Question 9

Does this tool check all sitemaps listed in robots.txt?

Accepted Answer

In this version, it checks the first candidate sitemap (from robots, or a common guess if none are declared).

Question 10

What does the CSV export include?

Accepted Answer

It exports robots URL/final URL/status, key robots signals, the tested sitemap URL/final URL/status, content-type/size, and the final summary issues.

Sitemap vs Robots Checker

Results

Quick interpretation

Sitemap vs Robots Checker: spot indexing blockers fast

What we flag

FAQ