Skip to content

Scrape overview

POST /v1/crawl/web/scrape is the default FairCrawl read path for one URL.

Use it when:

  • one page can answer the question
  • you need text, markdown, HTML, metadata, or screenshot output
  • the target may require JS rendering or premium routing
  • the source is a PDF or DOCX instead of HTML

Request shape

The scrape route accepts a JSON body with a required url and optional extraction controls such as format, render_js, only_main_content, screenshot, query, premium, proxy_country, pdf_mode, and max_pages.

Basic scrape request

```bash curl https://api.faircompany.ai/v1/crawl/web/scrape \ -X POST \ -H "Authorization: Bearer fc_live_xxx" \ -H "Content-Type: application/json" \ -d '{ "url": "https://stripe.com/pricing", "format": "markdown", "only_main_content": true }' ```

Routing behavior

  • static HTML starts on the cheapest fetch path
  • JS-heavy pages can escalate to browser rendering
  • hard targets can escalate to premium or proxy-backed fetchers
  • PDFs and DOCX files use the document extraction path instead of the HTML parser
  • /docs/endpoints/scrape/parameters/
  • /docs/endpoints/scrape/response-shape/
  • /docs/endpoints/scrape/examples/