Bright Data
brightdata.comEnterprise proxy + scraping infrastructure. Web Unlocker, Scraping Browser, SERP API.
The interface is live, but the benchmark values are placeholders while the first real run is prepared.
Share of attempts that returned usable content, not just a transport-level success.
Median end-to-end response time for successful attempts in that track.
Estimated provider spend normalized to a standard successful-attempt volume.
Average attempts needed before a run is marked usable or failed.
Per-target logs, timestamps, failure reasons, byte counts, and hashes.
| Status | Provider | Score | Open |
|---|---|---|---|
| Preview | TBD | ||
| Preview | TBD | ||
| Preview | TBD | ||
| Preview | TBD | ||
| Preview | TBD | ||
| Preview | TBD | ||
| Preview | TBD | ||
| Preview | TBD | ||
| Preview | TBD | ||
| Preview | TBD |
URL in → HTML/text out. Static pages, no JS, simple anti-bot.
Pages requiring a real browser to populate the DOM.
Multi-step CDP flows: click, fill, scroll, persist cookies.
Field-level accuracy against ground-truth schemas.
LLM-controlled browser tasks. Non-deterministic; reported separately.
Every provider receives the same sampled URLs and task definitions for the published run.
Fetch APIs, render APIs, hosted browsers, extraction APIs, and agent flows are scored separately.
A response only counts when the returned page or fields are usable for the intended scraping task.
ScrapeDrive is operated by the same team, appears in the benchmark, and is not excluded when it loses.
| Anti-bot vendor | URLs |
|---|---|
| Akamai-protected | TBD |
| Cloudflare | TBD |
| PerimeterX | TBD |
| DataDome | TBD |
| Unprotected control | TBD |
| Total sampled | TBD |
| Target | Protection | Category | URLs |
|---|---|---|---|
| Amazon · product detail | Akamai | ecom | TBD |
| LinkedIn · company page | PerimeterX | social | TBD |
| Indeed · search results | Cloudflare | jobs | TBD |
| G2 · software listing | DataDome | review | TBD |
| Yelp · business page | PerimeterX | local | TBD |
| Zillow · listing detail | Akamai | realestate | TBD |
| TripAdvisor · hotel detail | Cloudflare | travel | TBD |
| eBay · search results | Akamai | ecom | TBD |
| Static news article (control) | control | control | TBD |
| SPA-only shop (control) | control | control | TBD |
Enterprise proxy + scraping infrastructure. Web Unlocker, Scraping Browser, SERP API.
Smart proxy with auto-rotation, ban detection, JS rendering. Scrapy creators.
URL → clean Markdown for LLMs. Crawl, scrape, extract.
Lightweight scraping API. Built by the team that runs ScrapingEvals.
Headless browser API with proxy rotation. Aimed at developers.
Rotating proxies + headless browsers. High-volume general scraping.
Hosted Chrome over CDP. Best for browser sessions and agentic flows.
ASP anti-scraping bypass, screenshots, sessions, extraction rules.
Simple URL-in / HTML-out API. Cheap, broad coverage.
Actor marketplace + crawler infrastructure. Wide capability surface.
ScrapingEvals is a public benchmark for scraping APIs. It compares providers on real-world scraping jobs instead of only listing features or pricing pages.
A raw HTML fetch API, JavaScript renderer, persistent browser session, structured extractor, and agent-controlled browser solve different jobs. One overall score would hide those differences.
Start with the track that matches your job. HTML scrape is for static pages, JS Render is for browser-built pages, Browser Session is for multi-step workflows, Structured Extract is for field accuracy, and Agentic is for LLM-driven browsing.
The site links run ids, commit hashes, adapter source, target categories, and evidence logs so another engineer can inspect how the numbers were produced.
No. ScrapeDrive is marked because the ScrapingEvals team operates it. The same tables, evidence view, tracks, and scoring language apply to every provider.