Extraction Connectors

Connectors for turning public URLs into structured content, metadata, and readable summaries without building your own scraper stack.

Who it's for

•Researchers extracting article content for analysis
•Marketing teams collecting source material from public URLs
•Agents that need normalized page content in downstream steps
•Operators avoiding custom scraper maintenance and auth setup

Available connectors

URL Extract

Extract readable main content from Xiaohongshu and other web pages.

View details →

Frequently asked questions

Can extraction connectors read private pages?

No. They only work on publicly accessible URLs that Vernclaw can reach from the managed runtime.

Do they return raw HTML?

The goal is normalized content and metadata, not a full raw HTML dump.

What happens on JavaScript-heavy pages?

Some JavaScript-heavy pages may return partial content if rendering fails or the readable body cannot be reconstructed.

Explore more

All connectors SEO Intelligence Social Readers Content Generation Web Search Connector docs CLI guide