Run AI-powered, unblockable web scraping, data extraction with natural language via the MrScraper API
数据来源:ClawHub。 在 ClawSkills 查看
选择你使用的 Agent
方法一:命令行安装(推荐)
推荐(无需提前安装 clawhub)
npx clawhub@latest --dir ~/.claude/skills install mrscraper或使用 clawhub CLI(需提前安装)
clawhub --dir ~/.claude/skills install mrscraper⚠️ 需要 Node.js 18+,没有 Node?请使用下方方法二直接下载 ZIP。 安装 Node.js →
方法二:手动下载安装(无需 Node)
下载 ZIP,解压后将文件夹放到以下路径,重启 Agent 即可:
安装路径
~/.claude/skills/mrscraper/💡解压后将文件夹放到上方路径,重启 Agent 即可生效
--- name: mrscraper description: Run AI-powered, unblockable web scraping, data extraction with natural language via the MrScraper API tags: [scraping, data-extraction, web-crawling, stealth-browser, web-automation]
homepage: https://mrscraper.com/ vendor: MrScraper support_email: [email protected]
required_env_vars: [MRSCRAPER_API_TOKEN] primary_credential: MRSCRAPER_API_TOKEN
metadata: {"openclaw":{"requires":{"env":["MRSCRAPER_API_TOKEN"]},"primaryEnv":"MRSCRAPER_API_TOKEN"}}
network: {"allowed_hosts":["api.mrscraper.com","api.app.mrscraper.com","sync.scraper.mrscraper.com"]} ---
Run AI-powered, unblockable web scraping, data extraction with natural language via the MrScraper API
This skill supports:
This skill is API-only and does not depend on bundled local scripts.
https://api.mrscraper.comhttps://api.app.mrscraper.comUse query-param auth on unblocker endpoint:
token=Use header-based auth on platform endpoints:
x-api-token: <MRSCRAPER_API_TOKEN>
accept: application/json
content-type: application/json
MRSCRAPER_API_TOKEN?An API token lets your applications securely interact with MrScraper APIs and rerun scrapers created in the dashboard.
Follow these steps in the dashboard:
MRSCRAPER_API_TOKEN.x-api-token header.Security rule:
Notes from the auth docs:
sync.scraper.mrscraper.com.[email protected].scripts/ are required.api.app.mrscraper.com and api.mrscraper.com.GEThttps://api.mrscraper.comtoken query parameterOpens a target URL through stealth browsing and IP rotation, then returns HTML. Use this when direct access is blocked by captcha or anti-bot protections.
| Field | Type | Required | Default | Description | | ---------------- | --------- | -------- | ------- | --------------------------------------- | | token | string | Yes | — | Unblocker token (MRSCRAPER_API_TOKEN) | | url | string | Yes | — | URL-encoded target URL | | timeout | number | No | 60 | Max wait in seconds (example 120) | | geoCode | string | No | None | Geographic routing code (example SG) | | blockResources | boolean | No | false | Block non-essential resources |
curl --location 'https://api.mrscraper.com?token=<MRSCRAPER_API_TOKEN>&timeout=120&geoCode=SG&url=https%3A%2F%2Fwww.lazada.sg%2Fproducts%2Fpdp-i111650098-s23209659764.html&blockResources=false'
<!doctype html>
<html>
<head>...</head>
<body>...</body>
</html>
geoCode and practical timeouts for repeatable behavior.POSThttps://api.app.mrscraper.com/api/v1/scrapers-aix-api-tokenCreate a new AI scraper run from natural-language instructions.
agent: general or agent: listing):| Field | Type | Required | Default | Description | | -------------- | -------- | -------- | -------- | ---------------------------------------------------------- | | url | string | Yes | — | Target URL | | message | string | Yes | — | Extraction instruction | | agent | string | No | general | The AI agent type to use for scraping: general, listing, or map | | proxyCountry | string | No | None | ISO country code for proxy-based scraping |
agent: map):| Field | Type | Required | Default | Description | | ----------------- | -------- | -------- | --------- | ------------------------------------------------------------------------------------------------------------- | | url | string | Yes | — | Target URL | | agent | string | No | map | The AI agent type to use for scraping (for this case it is map) | | maxDepth | number | No | 2 | Maximum depth level for crawling links from the starting URL.
0 = only the starting URL, 1 = +direct links | | maxPages | number | No | 50 | Maximum number of pages to scrape during the crawling process. | | limit | number | No | 1000 | Maximum number of data records to extract across all pages. Scraping stops when this limit is reached. | | includePatterns | string | No | "" | Regex patterns to include (separate multiple with \|\|) | | excludePatterns | string | No | "" | Regex patterns to exclude (separate multiple with \|\|) |
curl -X POST "https://api.app.mrscraper.com/api/v1/scrapers-ai" \
-H "x-api-token: <MRSCRAPER_API_TOKEN>" \
-H "Content-Type: application/json" \
-d '{
"url": "https://books.toscrape.com/catalogue/a-light-in-the-attic_1000/index.html",
"message": "Extract title, price, stocks, and rating",
"agent": "general"
}'
{
"id": "497f6eca-6276-4993-bfeb-53cbbbba6f08",
"createdAt": "2019-08-24T14:15:22Z",
"createdById": "e13e432a-5323-4484-a91d-b5969bc564d9",
"updatedAt": "2019-08-24T14:15:22Z",
"updatedById": "d8bc6076-4141-4a88-80b9-0eb31643066f",
"deletedAt": "2019-08-24T14:15:22Z",
"deletedById": "8ef578ad-7f1e-4656-b48b-b1b4a9aaa1cb",
"userId": "2c4a230c-5085-4924-a3e1-25fb4fc5965b",
"scraperId": "6695bf87-aaa6-46b0-b1ee-88586b222b0b",
"type": "AI",
"url": "http://example.com",
"status": "Finished",
"error": "string",
"tokenUsage": 0,
"runtime": 0,
"data": {}, // MAIN SCRAPED DATA
"htmlPath": "string",
"recordingPath": "string",
"screenshotPath": "string",
"dataPath": "string"
}
...
安装 MrScraper 后,可以对 AI 说这些话来触发它
Help me get started with MrScraper
Explains what MrScraper does, walks through the setup, and runs a quick demo based on your current project
Use MrScraper to run AI-powered, unblockable web scraping, data extraction with natu...
Invokes MrScraper with the right parameters and returns the result directly in the conversation
What can I do with MrScraper in my data & analytics workflow?
Lists the top use cases for MrScraper, with example commands for each scenario
将技能文件夹放到 ~/.claude/skills/mrscraper/ 目录(个人级,所有项目可用),或 .claude/skills/mrscraper/(项目级)。重启 AI 客户端后,用 /mrscraper 主动调用,或让 AI 根据上下文自动发现并使用。
MrScraper 支持 Claude、Cursor、OpenClaw,可与这些 AI 平台无缝集成,扩展其能力。
MrScraper 可免费安装使用。请查阅仓库了解许可证信息。
Run AI-powered, unblockable web scraping, data extraction with natural language via the MrScraper API
MrScraper 属于「Data & Analytics」分类,该分类的技能帮助 AI 智能体在此领域执行专业任务。
Automate my data & analytics tasks using MrScraper
Identifies repetitive steps in your workflow and sets up MrScraper to handle them automatically