Crawl Analyzer
Paths
URLs
Dead-ends
Frequency
Orphans
Budget
Verify
Tools
Session
WordPress-Pingback · AutomaticBarrier · Apr 15 07:33 · 17 hits
Scrape pages
Crawl path
⊕ Fit
2xx OK
3xx Redirect
4xx Error
5xx Server error
— edge = referer · dashed = no referer
? what do these mean
2xx OK
Page was successfully fetched. Bot read the content and may have followed links from it.
200
Standard success — full content returned.
304
Not Modified — bot already has this page cached, server skipped sending content. Saves bandwidth but counts as a crawl hit.
Graph edges & nodes
— solid edge
— URL was reached via a referer link from another page in this session.
- - dashed edge
— No referer. Bot fetched this URL from its own crawl queue, not via a link.
dashed border node
— Entry point. No other page in this session linked to it.
grey node
— External referer. Not part of this session — it's the page that triggered the crawl.
3xx Redirect
Bot was redirected to another URL. Each redirect costs one crawl budget hit.
301
Permanent — passes link equity to destination. Use for page moves.
302
Temporary — does not reliably pass link equity. Avoid for permanent moves.
307
Temporary (method-safe) — same SEO implications as 302.
308
Permanent (method-safe) — like 301 but preserves POST method.
4xx Error
Bot hit a dead end — no content returned, no links followed. Wastes crawl budget.
400
Bad Request — malformed URL or request.
401
Unauthorised — login required, bot can't access.
403
Forbidden — server blocked the bot (firewall, IP, robots).
404
Not Found — page doesn't exist. High volume = broken internal links.
410
Gone — page permanently removed. Tells Google to deindex faster than 404.
429
Too Many Requests — bot rate-limited. May reduce crawl frequency.
5xx Server error
Server failed to respond. Bot may retry but will deprioritise the site if errors persist.
500
Internal Server Error — unhandled exception on the server.
502
Bad Gateway — upstream server (e.g. PHP-FPM) returned an invalid response.
503
Service Unavailable — server overloaded or in maintenance mode.
504
Gateway Timeout — upstream server took too long to respond.
Crawl Sequence
[500]
/wp-cron.php?doing_wp_cron=1776267210.1629951000213623046875
+0s
[500]
/wp-cron.php?doing_wp_cron=1776267652.2209300994873046875000
+442s
[500]
/wp-cron.php?doing_wp_cron=1776267981.7281088829040527343750
+771s
[500]
/wp-cron.php?doing_wp_cron=1776268042.4928119182586669921875
+832s
[500]
/wp-cron.php?doing_wp_cron=1776268346.0635550022125244140625
+1136s
[500]
/wp-cron.php?doing_wp_cron=1776268670.0408198833465576171875
+1460s
[500]
/wp-cron.php?doing_wp_cron=1776269368.4466729164123535156250
+2158s
[500]
/wp-cron.php?doing_wp_cron=1776269697.8282830715179443359375
+2487s
[500]
/wp-cron.php?doing_wp_cron=1776270085.7720239162445068359375
+2875s
[500]
/wp-cron.php?doing_wp_cron=1776270232.6455740928649902343750
+3022s
[500]
/wp-cron.php?doing_wp_cron=1776270453.5221259593963623046875
+3243s
[500]
/wp-cron.php?doing_wp_cron=1776270544.6608428955078125000000
+3334s
[500]
/wp-cron.php?doing_wp_cron=1776270760.3875029087066650390625
+3550s
[200]
/?wordfence_syncAttackData=1776270935.3618
+3725s
[500]
/wp-cron.php?doing_wp_cron=1776270951.8208000659942626953125
+3741s
[500]
/wp-cron.php?doing_wp_cron=1776271016.5770599842071533203125
+3806s
[500]
/wp-cron.php?doing_wp_cron=1776271222.4737820625305175781250
+4012s
0s
4012s
Link attributes
Click a node in the graph to see its outgoing links
This page hasn't been scraped yet — no link data available.
Run scraper →
▶ show
#
Target URL
Text
Role
Placement
Rel
↗