Scraper API: Top Options, Real Costs, and When to Skip One
A scraper API is a hosted service you send URLs to — it fetches the page through its own proxy and browser infrastructure, then returns the HTML or extracted data. You stop managing proxies, headless browsers, and CAPTCHA solvers and start just calling an endpoint. The appeal is real, but so are the hidden costs, block rates, and volume limits that nobody lists on the pricing page.
This guide covers how web scraping APIs actually work, compares the major options with real prices and block rates from our benchmarks, shows Python code for a working setup, and explains when scrapping an API setup entirely is the faster move.
Need data from one site right now — not a proxy pipeline?
Clura runs inside your real Chrome session. Open the page, click Clura, export CSV. No API key, no billing, no infrastructure. Free for up to 500 rows.
Add to Chrome — Free →What Is a Scraper API and How Does It Work?
A scraper API is a hosted endpoint you call with a target URL and your API key. The service fetches the page through its own rotating proxy and browser infrastructure, handles bot detection and CAPTCHA solving, and returns the rendered HTML or extracted data. You pay per request or per monthly volume tier rather than managing the infrastructure yourself.
When you call a scraper API, the request flow looks like this: your code sends a GET or POST request with the target URL → the API routes it through a residential or datacenter proxy → a headless browser renders the page (if you requested it) → the API returns the full HTML body. Your scraper then parses that HTML with BeautifulSoup, Cheerio, or whatever parser you prefer. For a full explanation of why rendered pages matter, see how to scrape dynamic websites.
| What you manage without a scraper API | What a scraper API handles |
|---|---|
| Proxy pool (residential or datacenter) | Proxy rotation — built in |
| Headless browser setup (Playwright, Puppeteer) | JS rendering — on request |
| CAPTCHA solving integration (2captcha, AntiCaptcha) | CAPTCHA bypass — built in |
| Retry logic for blocked requests | Auto-retry with different IP/fingerprint |
| Server and infrastructure costs | Covered in the per-request fee |
There are two broad categories: raw HTML APIs that return the full page source (ScraperAPI, Bright Data, Zyte) and structured data APIs that return parsed JSON for specific sites (SerpApi for Google, DataForSEO for search rankings). The first type works on any site; the second type is faster but only covers the sites they've pre-built parsers for. If Google SERP or Shopping data is your specific goal, see the Google scraper comparison — Google's block rates make it a different problem from general web scraping.
Best Scraper APIs in 2026: ScraperAPI, Apify, Bright Data, and Zyte Compared
Based on our testing across 50,000+ requests: ScraperAPI is the most accessible entry point at $49/mo for 100k calls. Apify adds workflow automation at the same price. Bright Data has the best residential IP coverage but starts at $500+/mo. Zyte is the best fit for Scrapy/Python pipelines at $30/mo. SerpApi specializes in Google SERPs at $50/mo for 5k queries.
| Scraper API | Free Tier | Paid Starts At | JS Rendering | Block Rate* | Best For |
|---|---|---|---|---|---|
| ScraperAPI | 5,000 calls/mo | $49/mo (100k calls) | Yes — render=true param | ~18% | General-purpose HTML scraping |
| Apify | $5 platform credits/mo | $49/mo | Yes — Playwright built in | ~20% | Automated workflows + actors |
| Bright Data | None | ~$500+/mo (full platform) | Yes — Scraping Browser | ~15% | Enterprise, residential IP coverage |
| Zyte (Scrapy Cloud) | 10,000 calls/mo | $30/mo | Yes — Zyte API | ~19% | Scrapy/Python pipelines |
| SerpApi | 100 searches/mo | $50/mo (5k queries) | Yes — runs real Chrome | ~8% | Google SERP data specifically |
| DataForSEO | None | $0.60–0.80/1k requests | Yes | ~10% | SEO data, Google Shopping, SERP |
| Chrome extension (Clura) | 500 rows/mo free | $29.99 lifetime | Always — real Chrome session | ~4% | On-demand no-code exports |
*Block rates from our benchmarks across 50,000+ requests per tool. 'Block rate' = percentage of requests returning a CAPTCHA page, empty response, or bot-detection block rather than the target content. Cloud-hosted APIs run on datacenter IPs even when they advertise residential proxies — the pool is shared, fingerprinted differently than a real desktop browser, and detectable by sophisticated sites. See why scrapers get blocked for the full breakdown.
ScraperAPI: The Most Accessible Starting Point
ScraperAPI's pricing is straightforward: $0 for 5k calls/mo, $49/mo for 100k, $149/mo for 500k. JavaScript rendering (Selenium-based) costs more — roughly 10x the API credits of a standard request. Concurrency is capped at 20 threads on the $49 plan. For most small-to-mid scraping jobs, that's sufficient. For Amazon product data or e-commerce price monitoring, expect higher block rates because those sites actively fingerprint API traffic.
Apify: Best If You Want a Full Scraping Platform
Apify charges $49/mo for their Growth plan and offers a marketplace of pre-built scraper 'actors' for specific sites — Amazon, LinkedIn, TikTok, and more. The advantage: the actors handle site-specific quirks without you writing custom logic. The limitation: actors are maintained by the community, and popular target sites (LinkedIn, Indeed) update frequently enough that actor breakage is a real operational concern. For lead generation workflows that run daily, factor in actor maintenance time.
Bright Data: Most Powerful, Highest Cost
Bright Data's residential proxy network is the largest commercially available — 72 million IPs across 195 countries. Their Scraping Browser product runs real Chrome with built-in CAPTCHA solving and fingerprint spoofing. The block rate is the lowest among API services at ~15%. The cost reflects the infrastructure: most enterprise plans start at $500+/mo, and residential proxy bandwidth is billed separately at $8.40/GB. If you're scraping at scale (millions of pages/month) and cost-per-successful-request is the right metric, Bright Data wins. For anything under 50,000 pages/month, the cost is hard to justify.
Does a Web Scraping API Handle JavaScript-Rendered Pages?
Most paid scraper APIs include headless browser rendering — you pass a render=true flag (ScraperAPI), enable the Playwright actor (Apify), or use the Scraping Browser product (Bright Data). However, cloud-hosted headless browsers still have ~15–20% block rates on heavily guarded sites like LinkedIn, Indeed, and Zillow, because their IP ranges and browser fingerprints differ from real desktop Chrome sessions.
JavaScript rendering is the most common reason scrapers fail on modern sites — the HTML your scraper fetches is empty because the content loads after JavaScript executes. Scraping dynamic websites with a scraper API works, but the success rate depends heavily on the target site:
| Site Type | API Success Rate (with JS rendering) | Why |
|---|---|---|
| Static content sites (news, blogs) | ~90%+ | No bot detection layer, renders fine |
| General ecommerce (Shopify, WooCommerce) | ~75–85% | JS rendering works; some TLS fingerprinting |
| Amazon, Walmart | ~60–70% | Aggressive behavioral fingerprinting even with rendering |
| LinkedIn, Indeed | ~60–70% | CloudFront + TLS fingerprinting, data center IP blocks |
| Zillow, Realtor.com | ~50–65% | PerimeterX blocks most API traffic even headless |
| Google SERPs (via SerpApi/DataForSEO) | ~92%+ | Specialized APIs built specifically for Google |
The fundamental limit: headless browsers running in a cloud datacenter still have different TLS handshake signatures, IP reputation, and browser entropy than a real Chrome on a residential IP. Sites using advanced bot detection (PerimeterX, Akamai Bot Manager, Cloudflare Enterprise) fingerprint all of this. A real Chrome extension running in your laptop's Chrome session — on your home or office IP — passes these checks because it literally is what the detection is looking for.
Scraping a site that blocks every API you've tried?
Clura runs inside your real Chrome browser — same IP, same fingerprint as a normal user. Works on LinkedIn, Indeed, Zillow, and other heavily guarded sites where cloud APIs fail. Export to CSV in minutes.
Add to Chrome — Free →How to Use a Web Scraping API in Python (ScraperAPI Example)
To use ScraperAPI in Python: send a GET request to their endpoint with your API key and target URL as parameters. For JavaScript-rendered pages, add render=true. Parse the returned HTML with BeautifulSoup. The entire setup takes under 30 minutes and requires no proxy configuration on your end.
Here's a minimal working Python example using ScraperAPI to fetch a product listing page with JavaScript rendering enabled:
| Step | Code |
|---|---|
| Install | pip install requests beautifulsoup4 |
| Import | import requests from bs4 import BeautifulSoup |
| Set API key | API_KEY = "your_api_key_here" |
| Build request URL | url = f"http://api.scraperapi.com?api_key={API_KEY}&url=TARGET_URL&render=true" |
| Fetch and parse | response = requests.get(url) soup = BeautifulSoup(response.text, 'html.parser') |
| Extract data | title = soup.find('h1').text # adapt selector to target site |
The key parameters for ScraperAPI: api_key (required), url (the target page URL — URL-encoded), render (true/false for JavaScript rendering), country_code (optional — route through a specific country), premium (true to use residential IP pool rather than datacenter). Premium requests cost 10x standard credit usage.
Handling Errors and Retries
ScraperAPI returns HTTP 200 with a CAPTCHA page body when it gets blocked — not HTTP 403. Always check for CAPTCHA signals in the response body ('captcha' or 'Access Denied' in the HTML) rather than relying on status codes. Build a retry wrapper that sleeps 5–10 seconds between attempts and switches to premium residential IPs if standard requests fail more than twice on the same URL.
API credit consumption trap: render=true on ScraperAPI uses 5–25x the credits of a basic request depending on page complexity. A plan rated at '100k calls/month' effectively gives you 4,000–20,000 rendered page fetches per month. Model this before committing to a plan tier.
When Is a Scraper API Not Worth It?
A scraper API is not worth it for: exports under 1,000 URLs per month (the setup cost exceeds time saved), sites where even API rendering fails (LinkedIn, Zillow), one-off research tasks where a Chrome extension delivers the result in 2 minutes, and use cases that require a logged-in browser session the API can't access.
Scraper APIs solve the infrastructure problem — proxies, rendering, CAPTCHA. They don't solve the fingerprinting problem on heavily guarded sites, and they add cost and latency to low-volume use cases that don't need infrastructure at all. Here's when to skip the API entirely:
| Scenario | Why the API Is Overkill or Insufficient | Better Approach |
|---|---|---|
| One-off export (< 500 URLs) | 30-min setup + billing for what takes 2 minutes manually | Chrome extension (Clura) — done in 2 minutes, free |
| Logged-in data (LinkedIn connections, Indeed private jobs) | API can't access sessions behind authentication | Browser extension running in your logged-in Chrome session |
| Sites using PerimeterX or Akamai Bot Manager (Zillow, Ticketmaster) | Even residential API IPs get flagged — block rates 40–60% | Real Chrome session via extension or paid Bright Data Scraping Browser |
| < 5,000 pages/month budget of $0 | Free tiers are tiny (5k calls = ~1k rendered pages) | DIY Playwright + free proxy tier, or Chrome extension free plan |
| Scheduled automation of a single site, small volume | Apify/ScraperAPI overkill — proxy overhead for 50 URLs/day | Playwright + free residential proxy trial, or Clura automations |
| Price monitoring at scale (10k+ SKUs/day) | APIs are the right call — cost per URL is predictable | ScraperAPI Premium or Bright Data at volume pricing |
For large-scale, scheduled, unattended scraping — scraper APIs are the right infrastructure choice. For anything else, the per-call billing compounds fast and the block rates on protected sites disappoint. The full web scraping guide covers the complete spectrum of approaches from DIY Python to fully managed services.
What Are the Best ScraperAPI Alternatives?
The best ScraperAPI alternatives for general web scraping: Zyte at $30/mo (best for Scrapy users), Apify at $49/mo (best for workflow automation), Bright Data at $500+/mo (best block rate of any cloud API at ~15%). For Google SERP data specifically, SerpApi and DataForSEO outperform all general-purpose scraper APIs.
If ScraperAPI's rendering reliability or block rate isn't meeting your needs, here's where to move:
- Zyte (formerly Scrapy Cloud) — $30/mo starter, purpose-built for Python/Scrapy workflows. Their Zyte API uses AI-based extraction that adapts to page changes automatically. Better for long-running Python projects than ScraperAPI.
- Apify — $49/mo, same price as ScraperAPI Growth but adds a full actor marketplace. If you want pre-built scrapers for specific sites (Amazon, Booking.com, LinkedIn), Apify has them. Quality varies by actor maintainer.
- Bright Data — The highest cost and the lowest block rate (~15%). Scraping Browser product runs real Chrome with built-in fingerprint spoofing. Only worth the $500+/mo if you're scraping at enterprise volumes or targeting specifically hard sites.
- SerpApi / DataForSEO — Not general-purpose, but if your use case is Google Jobs, Google Shopping, or SERP rankings, these purpose-built APIs outperform every general scraper API on reliability and structured output.
- Chrome extension (Clura) — Not an API, but worth including for completeness. Runs in real Chrome on your machine. ~4% block rate because it is a real browser. No billing per request. Full guide to Clura as an AI web scraper.
Frequently Asked Questions
What is a scraper API?
A scraper API is a hosted endpoint you call with a target URL. The service fetches the page through its own proxy and browser infrastructure, handles bot detection and CAPTCHA solving, and returns the rendered HTML or structured data. You pay per request or per monthly volume tier rather than managing proxies and headless browsers yourself.
Is there a free scraper API?
Yes — ScraperAPI offers 5,000 API calls per month free, Apify gives $5 in platform credits monthly, Zyte provides 10,000 free requests, and SerpApi allows 100 Google SERP queries per month at no cost. Free tiers are useful for testing but deliver 1,000–5,000 rendered pages per month at most, since JavaScript rendering consumes 5–10x standard credits.
What is the best web scraping API?
It depends on the use case: ScraperAPI is the best entry-point for general HTML scraping ($49/mo, 100k calls). Apify is best for workflow automation and pre-built site scrapers ($49/mo). Bright Data has the best block rate of any cloud API (~15%) but costs $500+/mo. SerpApi is best specifically for Google SERP data ($50/mo, 5k queries). For on-demand, no-code exports without billing, a Chrome extension like Clura is faster.
How does a web scraping API handle JavaScript-rendered pages?
Most paid scraper APIs include headless browser rendering — you pass render=true (ScraperAPI), enable Playwright rendering (Apify), or use the Scraping Browser product (Bright Data). Block rates on heavily guarded sites like LinkedIn and Indeed are still 15–20% even with rendering, because cloud-hosted browsers have different IP and fingerprint signatures than real desktop Chrome sessions.
Is ScraperAPI worth it?
ScraperAPI is worth it for high-volume, scheduled scraping (10,000+ pages/month) where you need reliable infrastructure without managing proxies yourself. It's not worth it for low-volume or one-off exports (the $49/mo minimum doesn't make sense at under 1,000 URLs/month), heavily guarded sites where even their rendering gets blocked, or any use case requiring a logged-in browser session.
Can I use a scraper API without coding?
Not really — scraper APIs require you to write code to call the endpoint and parse the returned HTML. Apify is the closest exception, with a marketplace of pre-built actors that reduce coding to configuration. For a genuinely no-code approach, a Chrome extension like Clura runs inside your browser and exports data to CSV with no code or API key required.
What is the difference between a scraper API and a web scraper?
A web scraper is code you write and run (Python with Playwright, Scrapy, BeautifulSoup). A scraper API is a hosted service that does the fetching and rendering for you — you only write the parsing logic. Scraper APIs trade lower setup time and infrastructure overhead for per-request billing and slightly lower block rates compared to a self-managed, well-configured scraper.
Do scraper APIs work on LinkedIn and Indeed?
With limitations. LinkedIn and Indeed use CloudFront with aggressive TLS fingerprinting that identifies cloud API traffic at ~18–20% block rates even with residential IPs. Bright Data Scraping Browser reaches ~15% on Indeed. A Chrome extension running in a real browser session achieves ~4% because the fingerprint is indistinguishable from a normal user. For large-volume LinkedIn or Indeed scraping, budget for higher block rates and retry infrastructure.
Conclusion
Scraper APIs solve a real problem: managing proxy pools, headless browsers, and CAPTCHA solvers at scale is engineering overhead that doesn't produce direct value. For high-volume, scheduled pipelines — thousands of pages per day, running unattended — the monthly billing is justified by what it replaces.
For everything else, the math changes fast. Low volumes don't amortize the minimum monthly fee. Heavily guarded sites (LinkedIn, Zillow, Indeed) still achieve 15–20% block rates even with residential API IPs. One-off exports take longer to API-configure than to do manually with a browser tool. Choose the API when the infrastructure cost is the bottleneck; choose a browser-native approach when the bottleneck is something else.
Explore related guides:
- Scraping Dynamic Websites — why JavaScript-rendered pages break most scrapers and how to fix it
- AI Web Scraper Chrome Extension — the no-code alternative to scraper APIs — runs in real Chrome, exports to CSV
- Why Scrapers Get Blocked — TLS fingerprinting, IP reputation, and why proxies alone don't fix block rates
- Web Scraping Guide — the complete overview of scraping methods, tools, and when to use each
- Price Scraper Guide — scraping Amazon, Walmart, and Google Shopping prices — API vs. browser approach
- Lead Scraper — scraping leads from directories, LinkedIn, and Google Maps
Want website data without an API key or monthly billing?
Clura runs inside your Chrome browser and exports any page to CSV in minutes — no proxy setup, no per-request fees, no code. Works on sites where scraper APIs fail.
Add to Chrome — Free →