Use Case
pdf-ocr-layout is a powerful AI agent skill that extends your assistant with new capabilities. Multi-modal document in-depth analysis tool based on GLM-OCR, GLM-4.7 and GLM-4.6V. Use when: - Need to extract tables in documents (PDF/pictures) with high accuracy and convert them to Markdown format - Need to automatically crop and extract illustrations and charts from document pages into independent files - Requires deep semantic understanding of extracted charts (based on GLM-4.6V visual analysis) - Requires logical analysis of extracted tabular data (based on GLM-4.7 text analysis) Core architecture: 1. Visual extraction: GLM-OCR 2. Semantic understanding: GLM-4.7 (plain text/table) + GLM-4.6V (multi-modal/image) This guide walks you through installing pdf-ocr-layout, configuring it for your setup, and running your first commands β so you can start getting value in minutes.
Install pdf-ocr-layout: npx clawhub@latest --dir ~/.claude/skills install pdf-ocr-layout
Restart your AI client (Claude Code, Cursor, Gemini CLI, or OpenClaw)
Type a natural language request related to documents & notes to trigger pdf-ocr-layout
Review the output and refine your prompt for better results
Combine pdf-ocr-layout with other skills to build multi-step workflows
Copy these prompts and use them with your AI agent after installing pdf-ocr-layout
Help me get started with pdf-ocr-layout
What can pdf-ocr-layout do for my documents & notes workflow?
Show me an example of using pdf-ocr-layout
Select your agent
Option 1: Install via CLI (recommended)
Recommended (no pre-install needed)
npx clawhub@latest --dir ~/.claude/skills install pdf-ocr-layoutOr via clawhub CLI (if already installed)
clawhub --dir ~/.claude/skills install pdf-ocr-layoutβ οΈ Requires Node.js 18+. No Node? Use Option 2 below to download the ZIP instead. Install Node.js β
Option 2: Manual install (no Node required)
Download the ZIP, extract it, and place the folder at the path below. Restart your agent to activate.
Install path
~/.claude/skills/pdf-ocr-layout/π‘Extract and place the folder at the path above, then restart your agent.