Crawl Analyzer
Paths
URLs
Dead-ends
Frequency
Orphans
Budget
Verify
Tools
Session
WordPress-Pingback · AutomaticBarrier · Apr 16 01:09 · 10 hits
Scrape pages
Crawl path
⊕ Fit
2xx OK
3xx Redirect
4xx Error
5xx Server error
— edge = referer · dashed = no referer
? what do these mean
2xx OK
Page was successfully fetched. Bot read the content and may have followed links from it.
200
Standard success — full content returned.
304
Not Modified — bot already has this page cached, server skipped sending content. Saves bandwidth but counts as a crawl hit.
Graph edges & nodes
— solid edge
— URL was reached via a referer link from another page in this session.
- - dashed edge
— No referer. Bot fetched this URL from its own crawl queue, not via a link.
dashed border node
— Entry point. No other page in this session linked to it.
grey node
— External referer. Not part of this session — it's the page that triggered the crawl.
3xx Redirect
Bot was redirected to another URL. Each redirect costs one crawl budget hit.
301
Permanent — passes link equity to destination. Use for page moves.
302
Temporary — does not reliably pass link equity. Avoid for permanent moves.
307
Temporary (method-safe) — same SEO implications as 302.
308
Permanent (method-safe) — like 301 but preserves POST method.
4xx Error
Bot hit a dead end — no content returned, no links followed. Wastes crawl budget.
400
Bad Request — malformed URL or request.
401
Unauthorised — login required, bot can't access.
403
Forbidden — server blocked the bot (firewall, IP, robots).
404
Not Found — page doesn't exist. High volume = broken internal links.
410
Gone — page permanently removed. Tells Google to deindex faster than 404.
429
Too Many Requests — bot rate-limited. May reduce crawl frequency.
5xx Server error
Server failed to respond. Bot may retry but will deprioritise the site if errors persist.
500
Internal Server Error — unhandled exception on the server.
502
Bad Gateway — upstream server (e.g. PHP-FPM) returned an invalid response.
503
Service Unavailable — server overloaded or in maintenance mode.
504
Gateway Timeout — upstream server took too long to respond.
Crawl Sequence
[500]
/wp-cron.php?doing_wp_cron=1776330556.3245890140533447265625
+0s
[500]
/wp-cron.php?doing_wp_cron=1776330784.4930939674377441406250
+228s
[500]
/wp-cron.php?doing_wp_cron=1776332186.8989729881286621093750
+1630s
[500]
/wp-cron.php?doing_wp_cron=1776332941.2119200229644775390625
+2385s
[500]
/wp-cron.php?doing_wp_cron=1776333386.2978389263153076171875
+2830s
[500]
/wp-cron.php?doing_wp_cron=1776333538.5151119232177734375000
+2982s
[500]
/wp-cron.php?doing_wp_cron=1776333895.1556069850921630859375
+3339s
[500]
/wp-cron.php?doing_wp_cron=1776333968.8124320507049560546875
+3412s
[500]
/wp-cron.php?doing_wp_cron=1776335187.0261719226837158203125
+4631s
[500]
/wp-cron.php?doing_wp_cron=1776335786.9505450725555419921875
+5230s
0s
5230s
Link attributes
Click a node in the graph to see its outgoing links
This page hasn't been scraped yet — no link data available.
Run scraper →
▶ show
#
Target URL
Text
Role
Placement
Rel
↗