Documentation
CORSPORXY Logo

Documentation → features

Web Scraping

CorsProxy includes a built‑in content extraction feature so you can pull structured data from pages directly in the browser.

Business plan required: extract requires a Business plan and a valid API key.

Extract content from HTML

Enable extraction by adding extract=1:

https://corsproxy.io/?url=https://example.com&extract=1

Parameters

ParameterDescriptionExample
extractEnable extraction (1)extract=1
selectorCSS selector for main contentselector=article
titleSelectorCSS selector for titletitleSelector=h1
bylineSelectorCSS selector for author/bylinebylineSelector=.byline
stripComma‑separated selectors to removestrip=.ads,.promo
formatjson (default) or textformat=text
maxCharsMax characters in outputmaxChars=5000

Example (structured JSON)

https://corsproxy.io/?url=https://news.ycombinator.com&extract=1&selector=.titleline%20%3E%20a

Example (plain text)

https://corsproxy.io/?url=https://example.com&extract=1&format=text

Response content type is application/json;charset=UTF-8 (or text/plain;charset=UTF-8 when format=text).

CSV/XML/RSS conversion is documented separately in File Conversion.