Skip to content

Crawl parameters

FieldTypeRequiredNotes
urlstringyesSeed URL.
limitnumbernoMaximum number of pages to scrape.
renderbooleannoRender JS while crawling.
regexOnFullURLstringnoFilter URLs using a full-URL regex.
deduplicateSimilarURLsbooleannoMerge near-duplicate URLs before scrape work.
callback_urlstringnoCallback on completion for async workflows.

Guidance

  • keep limit low until you know the target site structure
  • use regexOnFullURL on docs or changelog sites to stay on the exact subtree you need
  • enable de-duplication when the site emits tracking-heavy URL variants

Crawl parameter baseline

```bash curl https://api.faircompany.ai/v1/crawl/web/crawl \ -X POST \ -H "Authorization: Bearer fc_live_xxx" \ -H "Content-Type: application/json" \ -d '{ "url": "https://docs.stripe.com", "limit": 20, "render": false }' ```