P

pdf-parser-mineru

PDF document parsing tool based on local MinerU, supports converting PDF to Markdown, JSON, and other machine-readable formats.

数据来源：ClawHub。在 ClawSkills 查看

1.4k下载量

0收藏数

1浏览量

安装

选择你使用的 Agent

方法一：命令行安装（推荐）

关于 pdf-parser-mineru

--- name: pdf-process-mineru description: PDF document parsing tool based on local MinerU, supports converting PDF to Markdown, JSON, and other machine-readable formats. ---

Tool List

1. pdf_to_markdown

Convert PDF documents to Markdown format, preserving document structure, formulas, tables, and images.

Description: Use MinerU to parse PDF documents and output in Markdown format, supporting OCR, formula recognition, table extraction, and other features.

Parameters:

file_path (string, required): Absolute path to the PDF file
output_dir (string, required): Absolute path to the output directory
backend (string, optional): Parsing backend, options: hybrid-auto-engine (default), pipeline, vlm-auto-engine
language (string, optional): OCR language code, such as en (English), ch (Chinese), ja (Japanese), etc., defaults to auto-detection
enable_formula (boolean, optional): Whether to enable formula recognition, defaults to true
enable_table (boolean, optional): Whether to enable table extraction, defaults to true
start_page (integer, optional): Start page number (starting from 0), defaults to 0
end_page (integer, optional): End page number (starting from 0), defaults to -1 meaning parse all pages

Return Value:

{
  "success": true,
  "output_path": "/path/to/output",
  "markdown_content": "Converted Markdown content...",
  "images": ["List of image paths"],
  "tables": ["List of table information"],
  "formula_count": 10
}

Examples:

python .claude/skills/pdf-process/script/pdf_parser.py \
  '{"name": "pdf_to_markdown", "arguments": {"file_path": "/path/to/document.pdf", "output_dir": "/path/to/output"}}'

# Use specific backend
python .claude/skills/pdf-process/script/pdf_parser.py \
  '{"name": "pdf_to_markdown", "arguments": {"file_path": "/path/to/document.pdf", "output_dir": "/path/to/output", "backend": "pipeline"}}'

# Parse specific pages
python .claude/skills/pdf-process/script/pdf_parser.py \
  '{"name": "pdf_to_markdown", "arguments": {"file_path": "/path/to/document.pdf", "output_dir": "/path/to/output", "start_page": 0, "end_page": 5}}'

---

2. pdf_to_json

Convert PDF documents to JSON format, including detailed layout and structural information.

Description: Use MinerU to parse PDF documents and output in JSON format, containing structured information such as text blocks, images, tables, formulas, etc.

Parameters:

file_path (string, required): Absolute path to the PDF file
output_dir (string, required): Absolute path to the output directory
backend (string, optional): Parsing backend, options: hybrid-auto-engine (default), pipeline, vlm-auto-engine
language (string, optional): OCR language code, such as en (English), ch (Chinese), ja (Japanese), etc., defaults to auto-detection
enable_formula (boolean, optional): Whether to enable formula recognition, defaults to true
enable_table (boolean, optional): Whether to enable table extraction, defaults to true
start_page (integer, optional): Start page number (starting from 0), defaults to 0
end_page (integer, optional): End page number (starting from 0), defaults to -1 meaning parse all pages

Return Value:

{
  "success": true,
  "output_path": "/path/to/output.json",
  "pages": [
    {
      "page_no": 0,
      "page_size": [595, 842],
      "blocks": [
        {
          "type": "text",
          "text": "Text content",
          "bbox": [x, y, x, y]
        }
      ],
      "images": [],
      "tables": [],
      "formulas": []
    }
  ],
  "metadata": {
    "total_pages": 10,
    "author": "Author",
    "title": "Title"
  }
}

Examples:

python .claude/skills/pdf-process/script/pdf_parser.py \
  '{"name": "pdf_to_json", "arguments": {"file_path": "/path/to/document.pdf", "output_dir": "/path/to/output"}}'

# Use specific backend and language
python .claude/skills/pdf-process/script/pdf_parser.py \
  '{"name": "pdf_to_json", "arguments": {"file_path": "/path/to/document.pdf", "output_dir": "/path/to/output", "backend": "hybrid-auto-engine", "language": "ch"}}'

---

Installation Instructions

1. Install MinerU

# Update pip and install uv
pip install --upgrade pip
pip install uv

# Install MinerU (including all features)
uv pip install -U "mineru[all]"

2. Verify Installation

# Check if MinerU is installed successfully
mineru --version

# Test basic functionality
mineru --help

3. System Requirements

Python Version: 3.10-3.13
Operating System: Linux / Windows / macOS 14.0+
Memory:

- Using pipeline backend: minimum 16GB, recommended 32GB+ - Using hybrid/vlm backend: minimum 16GB, recommended 32GB+

Disk Space: minimum 20GB (SSD recommended)
GPU (optional):

- pipeline backend: supports CPU-only - hybrid/vlm backend: requires NVIDIA GPU (Volta architecture and above) or Apple Silicon

Use Cases

Academic Paper Parsing: Extract structured content such as formulas, tables, and images
Technical Document Conversion: Convert PDF documents to Markdown for version control and online publishing
OCR Processing: Process scanned PDFs and garbled PDFs
Multilingual Documents: Supports OCR recognition for 109 languages
Batch Processing: Batch convert multiple PDF documents

Backend Selection Recommendations

hybrid-auto-engine (default): Balanced accuracy and speed, suitable for most scenarios
pipeline: Suitable for CPU-only environments, best compatibility
vlm-auto-engine: Highest accuracy, requires GPU acceleration

Notes

File Paths: All paths must be absolute paths
Output Directory: Non-existent directories will be created automatically
Performance: Using GPU can significantly improve parsing speed
Page Numbers: Page numbers start counting from 0
Memory: Processing large documents may consume more memory

Troubleshooting

Common Issues

Installation Failure:

- Ensure using Python 3.10-3.13 - Windows only supports Python 3.10-3.12 (ray does not support 3.13) - Using uv pip install can resolve most dependency conflicts

Insufficient Memory:

- Use pipeline backend - Limit parsing pages: start_page and end_page - Reduce virtual memory allocation

Slow Parsing Speed:

- Enable GPU acceleration - Use hybrid-auto-engine backend - Disable unnecessary features (formulas, tables)

Low OCR Accuracy:

- Specify the correct document language - Ensure the backend supports OCR (use pipeline or hybrid-*)

Related Resources

MinerU Official Documentation: https://opendatalab.github.io/MinerU/
MinerU GitHub: https://github.com/opendatalab/MinerU
Online Demo: https://mineru.net/

Prompt 示例

安装 pdf-parser-mineru 后，可以对 AI 说这些话来触发它

U

Help me get started with pdf-parser-mineru

A

Explains what pdf-parser-mineru does, walks through the setup, and runs a quick demo based on your current project

U

Use pdf-parser-mineru to pDF document parsing tool based on local MinerU, supports convertin...

A

Invokes pdf-parser-mineru with the right parameters and returns the result directly in the conversation

U

What can I do with pdf-parser-mineru in my documents & notes workflow?

A

Lists the top use cases for pdf-parser-mineru, with example commands for each scenario

常见问题

如何安装 pdf-parser-mineru？▾

将技能文件夹放到 ~/.claude/skills/pdf-parser-mineru/ 目录（个人级，所有项目可用），或 .claude/skills/pdf-parser-mineru/（项目级）。重启 AI 客户端后，用 /pdf-parser-mineru 主动调用，或让 AI 根据上下文自动发现并使用。

pdf-parser-mineru 支持哪些 AI 平台？▾

pdf-parser-mineru 支持 Claude、Cursor、OpenClaw，可与这些 AI 平台无缝集成，扩展其能力。

pdf-parser-mineru 是免费的吗？▾

pdf-parser-mineru 可免费安装使用。请查阅仓库了解许可证信息。

pdf-parser-mineru 有什么功能？▾

PDF document parsing tool based on local MinerU, supports converting PDF to Markdown, JSON, and other machine-readable formats.

pdf-parser-mineru 属于哪个分类？▾

pdf-parser-mineru 属于「Documents & Notes」分类，该分类的技能帮助 AI 智能体在此领域执行专业任务。

使用场景

Getting Started with pdf-parser-mineru→Automate Documents & Notes Workflows with pdf-parser-mineru→Team Collaboration with pdf-parser-mineru→