Use Case
pdf-ocr-layout isn't just for solo use β teams can share skills and build consistent AI workflows across the organization. Multi-modal document in-depth analysis tool based on GLM-OCR, GLM-4.7 and GLM-4.6V. Use when: - Need to extract tables in documents (PDF/pictures) with high accuracy and convert them to Markdown format - Need to automatically crop and extract illustrations and charts from document pages into independent files - Requires deep semantic understanding of extracted charts (based on GLM-4.6V visual analysis) - Requires logical analysis of extracted tabular data (based on GLM-4.7 text analysis) Core architecture: 1. Visual extraction: GLM-OCR 2. Semantic understanding: GLM-4.7 (plain text/table) + GLM-4.6V (multi-modal/image) This guide covers how to deploy pdf-ocr-layout for your team, standardize prompts, and create shared workflows that everyone can use.
Install pdf-ocr-layout in your project directory: .claude/skills/pdf-ocr-layout/
Commit the skill folder to your repository so the whole team has access
Document your team's standard prompts in a shared README
Use pdf-ocr-layout in code reviews, standups, and planning sessions
Iterate: collect feedback from the team and refine your prompts
Copy these prompts and use them with your AI agent after installing pdf-ocr-layout
How can my team use pdf-ocr-layout together?
Set up pdf-ocr-layout for our project so everyone can use it
Create a shared workflow using pdf-ocr-layout for our team
Select your agent
Option 1: Install via CLI (recommended)
Recommended (no pre-install needed)
npx clawhub@latest --dir ~/.claude/skills install pdf-ocr-layoutOr via clawhub CLI (if already installed)
clawhub --dir ~/.claude/skills install pdf-ocr-layoutβ οΈ Requires Node.js 18+. No Node? Use Option 2 below to download the ZIP instead. Install Node.js β
Option 2: Manual install (no Node required)
Download the ZIP, extract it, and place the folder at the path below. Restart your agent to activate.
Install path
~/.claude/skills/pdf-ocr-layout/π‘Extract and place the folder at the path above, then restart your agent.