Crawler & URL Discovery

Starts a site crawl (same host, depth and page limits from the job), discovers internal links, and records HTTP status per URL. This is a lightweight crawler—not a full Screaming Frog replacement: advanced SEO checks (JS rendering, duplicate packs, custom extractions, sitemap builder) would be follow-on work on top of this pipeline.

No job yet

Start crawl

Domain or URL

Status

No status loaded.

Phase 2 — Compare crawls

Baseline a is usually the earlier crawl; b the later. URLs are matched by crawl URL hash. Rows: new_in_b, removed_in_a, changed (status, title, canonical, meta, word count, H1, content-type, robots meta, meta refresh, body hash, X-Robots-Tag, HTML lang, response time). With two jobs selected, this page updates the URL with compareA / compareB so you can bookmark or share the pair. With the diff table visible, filters and sort are also written to the URL (for example csk=status_delta to sort by HTTP status change between A and B).

No crawl jobs returned. Start a crawl above, or check the database connection.

Baseline crawl (A)Compare crawl (B)

Past crawl jobs

Remove individual jobs from the database (queue, audits, fetches for that job). Use Filter jobs in Phase 2 to narrow this list and the compare dropdowns. View loads that job into Status, reports, and Discovered URLs above.

Open job by ID

No jobs loaded. Use Reload list or start a crawl.

Phase 1 Reports

126 downloads (CSV exports + sitemap).

One ZIP: csv/ (125 files), xml/sitemap.xml, and json/summary.json. May take a minute on large jobs.

Discovered URLs

URL	Depth	HTTP	Title	Queue state
No URLs yet.