Convert any webpage into structured JSON data using AI. Scrape websites, extract data into custom JSON schemas, and call saved APIs programmatically. Useful for web scraping, data extraction, content monitoring, lead generation, price tracking, and building data pipelines.
数据来源:ClawHub。 在 ClawSkills 查看
选择你使用的 Agent
方法一:命令行安装(推荐)
推荐(无需提前安装 clawhub)
npx clawhub@latest --dir ~/.claude/skills install webscraper-pulpminer或使用 clawhub CLI(需提前安装)
clawhub --dir ~/.claude/skills install webscraper-pulpminer⚠️ 需要 Node.js 18+,没有 Node?请使用下方方法二直接下载 ZIP。 安装 Node.js →
方法二:手动下载安装(无需 Node)
下载 ZIP,解压后将文件夹放到以下路径,重启 Agent 即可:
安装路径
~/.claude/skills/webscraper-pulpminer/💡解压后将文件夹放到上方路径,重启 Agent 即可生效
--- name: pulpminer description: "Convert any webpage into structured JSON data using AI. Scrape websites, extract data into custom JSON schemas, and call saved APIs programmatically. Useful for web scraping, data extraction, content monitoring, lead generation, price tracking, and building data pipelines." emoji: ⛏️ homepage: https://pulpminer.com metadata: {"clawdbot":{"requires":{"env":["PULPMINER_API_KEY"]},"config":["pulpminer_api_key"]}} ---
PulpMiner converts any webpage into structured JSON using AI. You provide a URL and optionally a JSON template, and PulpMiner scrapes the page, runs it through an LLM, and returns clean structured data.
All API calls require the apikey header:
apikey: <PULPMINER_API_KEY>
Get your API key from https://pulpminer.com/api — click "Regenerate Key" if you don't have one.
PulpMiner works in two phases:
curl -X GET "https://api.pulpminer.com/external/<apiId>" \
-H "apikey: <PULPMINER_API_KEY>"
Returns JSON extracted from the configured webpage.
For APIs saved with template URLs like https://example.com/search?q={{query}}&page={{page}}:
curl -X POST "https://api.pulpminer.com/external/<apiId>" \
-H "apikey: <PULPMINER_API_KEY>" \
-H "Content-Type: application/json" \
-d '{"query": "javascript frameworks", "page": "1"}'
The {{variable}} placeholders in the saved URL get replaced with the values you provide.
Successful responses return:
{
"data": { ... },
"errors": null
}
Error responses return:
{
"data": null,
"errors": "Error message describing what went wrong"
}
When creating a saved API at https://pulpminer.com/api, you can configure:
| Option | Description | |--------|-------------| | URL | The webpage to scrape | | JSON Template | Optional JSON structure for the LLM to follow (e.g., {"name": "", "price": ""}) | | Render JS | Enable for SPAs and JS-heavy pages (uses headless browser) | | CSS Selector | Extract only a specific part of the page (e.g., .product-list, #main-content) | | Extra Instructions | Additional guidance for the AI (e.g., "Only extract items with prices above $50") | | Dynamic URL | Enable template variables in the URL with {{variable}} syntax | | Cache | Toggle response caching on/off |
For async scraping in Zapier workflows:
# Static API
curl -X POST "https://api.pulpminer.com/external/zapier/get/<apiId>" \
-H "apikey: <PULPMINER_API_KEY>" \
-d '{"callbackURL": "https://hooks.zapier.com/..."}'
# Dynamic API
curl -X POST "https://api.pulpminer.com/external/zapier/post/<apiId>" \
-H "apikey: <PULPMINER_API_KEY>" \
-d '{"callbackURL": "https://hooks.zapier.com/...", "query": "value"}'
Returns 201 immediately. Sends scraped data to the callback URL when complete.
Verify authentication:
curl -X GET "https://api.pulpminer.com/external/n8n/auth" \
-H "apikey: <PULPMINER_API_KEY>"
Then use the standard /external/ endpoints for data fetching.
安装 PulpMiner Web Scraper - Convert Any Webpage to Realtime JSON API 后,可以对 AI 说这些话来触发它
Help me get started with PulpMiner Web Scraper - Convert Any Webpage to Realtime JSON API
Explains what PulpMiner Web Scraper - Convert Any Webpage to Realtime JSON API does, walks through the setup, and runs a quick demo based on your current project
Use PulpMiner Web Scraper - Convert Any Webpage to Realtime JSON API to convert any webpage into structured JSON data using AI
Invokes PulpMiner Web Scraper - Convert Any Webpage to Realtime JSON API with the right parameters and returns the result directly in the conversation
What can I do with PulpMiner Web Scraper - Convert Any Webpage to Realtime JSON API in my developer & devops workflow?
将技能文件夹放到 ~/.claude/skills/webscraper-pulpminer/ 目录(个人级,所有项目可用),或 .claude/skills/webscraper-pulpminer/(项目级)。重启 AI 客户端后,用 /webscraper-pulpminer 主动调用,或让 AI 根据上下文自动发现并使用。
PulpMiner Web Scraper - Convert Any Webpage to Realtime JSON API 支持 Claude、Cursor、OpenClaw,可与这些 AI 平台无缝集成,扩展其能力。
PulpMiner Web Scraper - Convert Any Webpage to Realtime JSON API 可免费安装使用。请查阅仓库了解许可证信息。
Convert any webpage into structured JSON data using AI. Scrape websites, extract data into custom JSON schemas, and call saved APIs programmatically. Useful for web scraping, data extraction, content monitoring, lead generation, price tracking, and building data pipelines.
Lists the top use cases for PulpMiner Web Scraper - Convert Any Webpage to Realtime JSON API, with example commands for each scenario
Automate my developer & devops tasks using PulpMiner Web Scraper - Convert Any Webpage to Realtime JSON API
Identifies repetitive steps in your workflow and sets up PulpMiner Web Scraper - Convert Any Webpage to Realtime JSON API to handle them automatically
PulpMiner Web Scraper - Convert Any Webpage to Realtime JSON API 属于「Developer & DevOps」分类,该分类的技能帮助 AI 智能体在此领域执行专业任务。