从图片或 PDF 文档中识别并提取文字内容,支持多种图片格式和 PDF 文件,自动判断是否包含文字并保留原始格式输出结构化结果;当用户需要从图片或 PDF 提取文字、进行 OCR 识别、处理含文字的文档或转换为可编辑文本时使用
Security Analysis
medium confidenceOverall the skill is coherent with its stated purpose (PDF text extraction); it requires no credentials or network access and the provided script only reads local PDFs and prints JSON, but it delegates image OCR to an external 'read_image' tool and the included script does not perform OCR on scanned images.
The skill claims to handle both images and PDFs. The included script (scripts/pdf_text_extractor.py) only processes text-based PDF content via PyMuPDF; image OCR is delegated in SKILL.md to a platform 'read_image' tool. SKILL.md also sometimes implies support for scanned PDFs but the script itself does not perform OCR — it will return empty text for image-only pages and advise using OCR. This is a minor mismatch but explainable (PDF handling vs OCR delegation).
Runtime instructions are narrow: obtain image URL or local PDF path, call the platform 'read_image' tool for images, and run the bundled Python script for PDFs. The script reads only the provided file path, validates suffix, extracts text with PyMuPDF, returns JSON, and does not access other files, environment variables, or external endpoints.
No automated install spec is included (instruction-only). SKILL.md documents a single dependency (pymupdf) and suggests 'pip install pymupdf>=1.23.0'. There are no downloads from arbitrary URLs or archive extraction steps in the package.
The skill requests no environment variables, credentials, or config paths. The code does not reference any secrets or external services. This is proportionate to the stated function.
The skill is not always-enabled, is user-invocable, and does not modify other skills or system settings. The script does not write persistent state; SKILL.md states files are only used in-session.
Guidance
This skill appears to do what it says for text-based PDFs and is not requesting credentials or making external network calls. Before installing, confirm two things: (1) your agent/platform supplies a 'read_image' tool (SKILL.md relies on it for image OCR) or you have a separate OCR tool — the bundled script does not perform OCR on scanned/image pages; (2) you are comfortable installing the PyMuPDF (pymupdf) Python package. Also note the package source/homepage is unknown — if provenance matters to you, request the author's origin or review the code yourself (the included script is short and local). If you need automatic OCR of scanned PDFs or images, verify the agent has an OCR tool or extend the skill to include one.
Latest Release
v1.0.0
PDF和图片文字提取器初始版本发布: - 支持从图片(常见格式)和 PDF 文档中自动识别并结构化提取文字内容 - 保持原有文本结构,输出支持 Markdown 格式 - 自动判别图片/PDF 是否包含可提取文字,并给出清晰反馈 - 提供标准流程用于图片、普通 PDF 和扫描 PDF 的处理 - 集成 pymupdf,支持扫描版、文字版 PDF 及中英多语言识别 - 包含详细使用说明、依赖和注意事项
Popular Skills
Published by @yuanyi-github on ClawHub