Benchmark

Top 7 Video Scrapers: Tested & Ranked

updated on Jul 2, 2026

We tested the top 7 video scraping providers to see how they handle video metadata on the top video platform, totaling 6,000 requests, and measured their success rate, response time, and metadata fields.

Provider

For

Bright Data

Streamed batch scraping with 48 metadata fields

Oxylabs

Object-level YouTube scraping (metadata, subtitles, channels, downloads)

Decodo

Dedicated templates for metadata, search, subtitles, and channels

SerpApi

Fast delivery with parsed JSON

Video scraping benchmark results

To see how we calculated these metrics, read video scraping benchmark methodology.

What data you can scrape from video platforms

Different providers return different amounts of metadata for the same video URL. JSON providers give you parsed fields you can use directly; HTML providers return the rendered page, so you pull the fields you need with CSS selectors.

The table below lists the metadata fields each provider returned for one video URL, highlighting the ones unique to that provider.

Provider	Format	Provider-only fields	Example unique fields
Bright Data	JSON	37	formatted_transcript, audio_tracks, codecs, quality_label, ai_content_label, is_age_restricted, license, made_for_kids, music, category, recommended_videos
Apify	JSON	12	translatedTitle, translatedText, subtitles, descriptionLinks, collaborators, isMonetized, isMembersOnly, commentsTurnedOff, location, channelUsername, type
Decodo	JSON	5	formats (raw stream URLs by resolution), age_limit, is_live, is_transcript_available, generated_subtitle_languages
SerpApi	JSON	1	search_meta (the SerpApi-side metadata block)
Oxylabs	HTML	—	Parsed via CSS selectors on the rendered page
Nimble	HTML	—	Parsed via CSS selectors on the rendered page
Zyte	HTML	—	Parsed via CSS selectors on the rendered page

Beyond the unique fields shown, every JSON provider also returns the common video metadata you would expect: title, description, view count, like count, comment count, publish date, duration, channel name, channel URL, subscriber count, thumbnails, tags, and related videos. The HTML providers expose the same data, just through CSS selectors on the rendered page.

Video scrapers free trial

Vendor	Free trial
Bright Data	5K records per month
Oxylabs	7-day free trial
Decodo	3-day trial (100 MB)
SerpApi	250 searches per month
Zyte	$5 credits
Apify	$5 credits
Nimble	5,000 requests (one-time)

Get our team to automate one of your business processes with AI agents, free of charge.

Automate a process

Video scrapers & benchmark results

Bright Data

Bright Data returned 48 parsed fields per URL, the highest field count of any provider tested. Its Dataset API supports batch streaming, where a large set of URLs is submitted in a single trigger, scraped in parallel server-side, and streamed back to a webhook in chunks as each batch of results becomes ready. This is the API’s native mode and the way Bright Data is designed to be used at scale.

In video scraping benchmark, all 1,000 URLs were submitted in one call and the full run finished in about 17 minutes, giving an amortized per-URL time of 1 second, the fastest result in the benchmark. When called one URL at a time through the async trigger, poll, and fetch cycle instead, each request takes roughly 70 seconds.

Bright Data offers many video scrapers and ready made datasets on the Dataset Marketplace.

Ready to use datasets:

Videos Posts: titles, URLs, creators, length, likes, views, and comments
Channels: public channel info, including views, subscribers, and creation dates
Comments: comment text, likes, replies, and parent video details

Video scrapers:

Videos posts, collect by URL: pulls a video by its watch URL
Videos posts, discover by explore: discovers videos through the Explore page
Videos posts, discover by hashtag: collects videos tagged with a hashtag
Videos posts, discover by keyword: searches for videos by keyword
Videos posts, discover by podcast URL: surfaces podcast linked videos
Videos posts, discover by search filters: keyword search with video filters applied
Videos posts, discover by URL: discovers videos by channel URL

Channel scrapers:

Channels, collect by URL: extracts channel details from a channel URL
Channels, discover by keyword: finds channels via keyword search

Comment scrapers:

Comments, collect by URL: collects comments for a video by URL

For video scraping benchmark we used the Video posts, collect by url scraper.

Oxylabs

Oxylabs averaged 17 seconds per URL in the benchmark, returning the watch page as rendered HTML for the four target fields to be extracted client-side. Oxylabs provides a Web Scraper API with eight YouTube-specific sources, each targeting a different object on the platform:

search: up to 20 search results for a query
search_max: up to 700 search results for a query
metadata: metadata of a single video
subtitles: subtitle track of a single video
download: audio or video stream of a single video
video_trainability: whether a video is eligible for AI training
channel: full channel data including video list
autocomplete: search-bar suggestions for a term

There is also a universal scraper with render=html for cases where none of the dedicated sources fit, which renders the page in a headless browser and returns the HTML.

For the video scraping benchmark we sent each video URL through the universal source with render=html, then parsed the rendered watch page to pull title, channel, view count, and duration.

Decodo

Decodo is the second-fastest provider tested at 4 seconds per URL, returning 20 parsed fields, five of them exclusive to Decodo. It has four scraper templates dedicated to video platform, each covering a different object on the platform:

Metadata: titles, durations, views, channel info and more for a single video
Search: up to 20 search results for a query
Subtitles: full subtitles and captions of a video for analysis or indexing
Channel: channel metadata, video lists and engagement metrics for creator analysis

Metadata accepts a video ID via the query parameter and returns structured JSON containing title, channel, view count, duration, upload date, like count, and the remaining metadata fields. This is the template we used in the video scraping benchmark.

SerpApi

SerpApi‘s Video API was the fastest provider in the benchmark at 1 second per URL, returning 17 parsed fields. It exposes three YouTube engines, each available as a single GET against https://serpapi.com/search.json:

Video API : per-video details including title, channel, views, likes, published date, description, chapters, related videos, and pagination tokens for comments
Search API : search results for a query, with upload-date, length, and quality filters via the sp parameter
Video Transcript API : the transcript of a video by ID, with snippets, start/end timestamps, and language details

All three return parsed JSON in one synchronous call and accept gl (country) and hl (language) for localization. Video API accepts a video ID via the v parameter and returns the full payload in a single GET, and with no_cache=true added to bypass the one-hour SerpApi cache, this is the engine that powered SerpApi’s role in the video scraping benchmark.

Apify

Apify’s Video scraper took the longest at 21 seconds per URL but produced the richest payload of any provider tested, with 28 parsed fields.

Apify has six dedicated scraper actors in their marketplace, maintained by the Streamers team, each targeting a different object on the platform:

Video scraper: full per-video metadata including channel name, likes, views, and subscriber counts
Comments scraper: comment text, posting date, author username, and parent video info
Channel scraper: channel info such as subscriber count, total video count, total views, and creation date
Shorts scraper: short-form video data including caption, timestamps, likes, dislikes, views, and comment counts
Hashtag video scraper: video records discovered by hashtag, with the same per-video fields
Video downloader: MP4, MP3 and other format downloads pushed directly to cloud storage

Every actor accepts URLs or search terms as input and returns parsed JSON, CSV or Excel. The Video scraper is the actor we ran in the video scraping benchmark, called via the standard Apify /acts/{actor}/runs endpoint with a single video URL per startUrls entry, polled to completion, and read from the run’s dataset items.

Nimble

Nimble averaged 18 seconds per URL in the benchmark, returning rendered HTML rather than parsed fields. For web pages they offer the Extract API: any URL goes in, anti-bot evasion and proxy rotation happen on Nimble’s side, and a stealth browser driver (we picked vx10) renders the page before returning the HTML.

Pulling the metadata out of that response was a client-side job: locate the embedded ytInitialPlayerResponse JSON inside the HTML, walk into videoDetails, and read off title, channel author, view count, and duration in seconds.

Zyte

Zyte returned each URL in 9 seconds via its browserHtml mode, leaving metadata extraction to the client.

Zyte has a single Zyte API endpoint configured per request with payload flags. The httpResponseBody flag returns raw HTTP without running scripts, which works for static pages but misses content on a JS-hydrated video page. Switching to browserHtml: true boots a real browser, executes the page’s JavaScript, and returns the post-hydration HTML. From there the extraction matches what Nimble’s pipeline needed: grab ytInitialPlayerResponse from a <script> tag, balance-brace the JSON to its closing }, parse it, and lift the four target fields from videoDetails.

Video scraping benchmark methodology

We tested 6 video scraping providers on 1,000 unique video URLs, sending one URL per request and recording the response. All URLs were verified to be live at the time the benchmark was run, so a removed-video edge case did not need to be handled in the validation logic.

The 1,000 URLs were in canonical watch?v=… form. Channel pages, playlists, and short-form videos were excluded so every entry passed to every provider was the same kind of object.

Each provider was configured to use the URL-input mode its API supports:

Decodo: Video Metadata template, video ID passed via query, parsed JSON.
Bright Data: Video posts, collect by url scraper, running in the API’s native batch streaming setup. The full URL list went in as a single trigger with chunked webhook delivery, and per URL numbers are the batch throughput averaged across the run.
SerpApi: Video API engine, video ID passed via v, with no_cache=true so cached responses were never served.
Apify: Video scraper actor via /acts/{actor}/runs with the URL in startUrls. The run was polled until completion and the dataset items were read once it finished.
Oxylabs: Web Scraper API with source=universal and render=html. The previously documented youtube_metadata source now returns an unsupported-source error, so the universal scraper with rendered HTML was used instead.
Nimble: Extract API with render=true and the vx10 stealth browser driver, returning rendered HTML.
Zyte: Zyte API with browserHtml: true, returning post-hydration HTML.

A response was counted as valid when at least one of four fields was returned in a usable format: title as a non-empty string, view_count as a non-negative integer (or a string that parses as one), duration as either an MM:SS string or an integer of seconds, or published as a date string (either an exact date or a relative phrase such as “3 weeks ago”). A single field in correct form was enough to count the call as successful, because that already shows the provider reached the page and completed the scrape.

Three of the seven providers returned rendered HTML rather than parsed JSON. For those responses, the validator located the embedded ytInitialPlayerResponse script and read the videoDetails object, applying the same check to its four fields: title, author, viewCount, and lengthSeconds.

HTTP 429 responses triggered a 30-second back-off and were retried up to three times. For each call, the wall-clock time from submission to a usable response was recorded, then averaged across the 1,000 URLs to produce the per-provider end-to-end time. The boolean validation result was averaged the same way to produce the per-provider success rate.

See more of our benchmarks and data-driven insights in Google Search.

Add as preferred source

FAQs

None of the providers expose a time series of past view counts directly. You can build one by scraping the same video URL on a schedule and storing the snapshots yourself; daily or hourly cron is usually enough for trend analysis.

Search returns a ranked list of videos for a keyword, with shallow metadata per result. URL scraping returns deep metadata for a specific video you already know about. Search is for discovery; URL scraping is for monitoring a known set of items.

Public, non-personal data is generally legal to scrape in most jurisdictions, but every platform’s Terms of Service forbid automated access. The legal risk increases if you scrape personal data (comments tied to identifiable users), if you redistribute the raw video content, or if you bypass authentication. Consult a lawyer for high-stakes use cases.

No. Every provider in the benchmark manages its own proxy pool and anti-bot evasion. You authenticate with an API key and send the target URL or video ID; the proxy layer is invisible to the caller.

Cite this benchmark

Pick the format that matches where you're publishing. Pasting the link version into your CMS preserves the backlink.

Nazlı Şipi (2026) - "Top 7 Video Scrapers: Tested & Ranked". Published online at AIMultiple.com. Retrieved July 2, 2026, from: https://aimultiple.com/video-scraper [Online Resource]

Şipi, N. (2026, July 2). Top 7 Video Scrapers: Tested & Ranked. AIMultiple. https://aimultiple.com/video-scraper

@misc{sipi2026,
  author = {Şipi, Nazlı},
  title  = {{Top 7 Video Scrapers: Tested & Ranked}},
  year   = {2026},
  month  = jul,
  howpublished    = {\url{https://aimultiple.com/video-scraper}},
  note   = {AIMultiple. Retrieved July 2, 2026}
}

Nazlı Şipi

AI Researcher

Follow On

Nazlı is a data analyst at AIMultiple. She has prior experience in data analysis across various industries, where she worked on transforming complex datasets into actionable insights.

View Full Profile